Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.
Comment: Update TrainingPackage link + Add information about clock and reset management + how to preserve signals

...

All NanoXplore primitives are listed in nxLibrary-<variant_name>.vhdp available in the release archive. A documentation is also available in order to get information about all generics and io in Library guide.

It is recommended to only add the nxpackage in the work library and not component declarations as they are already declared in the package.

...

  1. Use a pad directly connected to the closest CKG (these pads are suffixed by _CLK). Either the user instantiates a PLL or a WFG in the design or the tool instantiates automatically a WFG in bypass mode.

  2. Use a common pad and use a buffer NX_BD in global_lowskew mode.

Please have a look at Training Package : Application Note Design/LowskewManagement project.

Global signals management

There are some signals with high fan out can be mapped into low-skew network and introduced some important delays. It could be the problem with synchronous signals load, set, reset, …

In order to avoid this problem, use rejectLowskew method.

Code Block
languagepy
p.rejectLowskew('inst1|reset_sync')

Memory initialization

There are several ways to initialize a memory (attribute, generic, python method).

...

Clock gating and switch

NanoXplore FPGAs get clock switch primitive NX_CKS in order to gate a clock with a dedicated techno dependent cell.

In addition, NG-ULTRA family FPGAs get a clock mux pritmitive, NX_GCK_U in MUX mode in order to switch between clocks with a dedicated techno dependent cell. For NG-MEDIUM and NG-LARGE family, mux must be done with 2 NX_CKS.

Please have a look at Training Package : Application Note Init Component/Ram project.

Memory Inference

In order to infer a memory instead of instantiating a NanoXplore primitive (advantage is the user can choose with the same RTL code to map this memory in RF, RAM, RAM_ECC, … thanks to NXpython constraints), it is recommended to follow TrainingPackage Design/MemInfer examples providing inference for ROM, SRAM, DPRAM with and without ECC.

Please have a look at Training Package Application Note Design/MemInfer project.

Hierarchy

It is recommended to divide the design in well sized modules depending on the function they are responsible for.

It will be very helpful to constrain the design by module then.

Project creation

NXmap is based on Python scripts that is to say a project is a class and all options and constraints are methods associated to this class.

The user can either create a project based on a NanoXplore template (recommended) or start it from scratch.

In case of NanoXplore template, please ask NanoXplore support team at support@nanoxplore.com.

In case of starting from scratch, it can be easier to create a project using the GUI in order to get the right basic NXpython methods.

The project needs the following database:

  • Top cell library and top cell name

  • All files added to associated library

  • Options and constraints

Note

Never add a nxLibrary file to your project. NXmap tool already get the needed files with NX primitives.

Whatever the project base, it is recommended to comply with the following rules:

  • Set options before any progress.

  • Set IO pads locations and parameters before any progress.

  • Save your project after each progress step.

  • Separate your project in categories (setup,synthesis, placing, routing, sta) and set your constraints at the right place.

Project check

Once the project is launched, it is necessary to check logs and reports in order to be sure there is not any issue leading to errors, undesired optimizations, …

Logs and report files analysis

Errors and warnings

First of all, check the project is successful until the required progress step.

If not, a message appears in the console. Sometimes, the message is clear enough, but it could not be. So errors.rpt and warning.rpt reports can be very useful in order to grab the issue.

Note there reports are generated only if there are messages to report to the user.

If reported messages are not clear enough to solve the issue, contact the NanoXplore support team at support@nanoxplore.com giving the CONTEXT code and an archive in order to reproduce the issue (RTL sources + scripts).

Even though the project is successful, it is advised to parse these files.

Complexity

After errors and warnings are solved or understood, check hierarchy.rpt and progress.rpt in order to check the number of instances is relevant with the estimation.

In order to get complexity module by module without confining it in a region, it is possible to use addModule method described in Nxmap user manual.

Refer to #Floor_planning_Complexity for more details.

IO Location

Check in progress.rpt none IO has been automatically placed by the design. It is marked with an exclamation sign.

Common Errors

There are typical types of errors :

  • Oversize : Not enough instance capacity in the area

  • Overflow : Not enough routing capacity in the area

  • Blending Tile : Unable to route a path

These errors can be solved by changing the following parameters/options :

  • If modules are constrained with constrainModule method, increase area size

  • Change the Seed used for P&R

  • Change the following options: CongestionEffort, DensityEffort, RoutingEffort

STA

Once the project is mature enough, Static Timing Analysis can be done in order to check how fast the design can operates.

STA tool can be launched from any step after Preparing (Placing 1/5) with the following example method:

Code Block
Timing_analysis = p.createAnalyzer()
Timing_analysis.launch({'conditions': 'typical', 'maximumSlack': 500, 'searchPathsLimit': 10})

It is advised to launch a STA after Preparing as it shows up most critical paths and the maximum frequency with most optimistic hypothesis. If maximum frequencies are not higher enough at this step, it won’t be after routing for sure.

Clock creation

The input clock frequencies and other parameters must be informed to NXmap for STA.

Please refer to createClock and createGeneratedClock described in Nxmap user manual.

If a PLL is used, PLL output clock frequencies are automatically computed by the software.

Special path and clock domains relationship

Declare all false paths and multi cycle paths in order to not take into account some paths between registers in some particular cases.

Please refer to addFalsePath and addMultiCyclePath in Nxmap user manual.

Clock groups can be created if clock domains are completely unrelated.

Timing files analysis

When launching the STA tool, the following files are created :

  • Summary_<progress_step>_<conditions>.timing

  • Violation_<progress_step>_<conditions>.timing

  • Timing_Constraints_Report_<progress_step>_<conditions>.timing

  • DOMAIN_Input_to_<clk>_<progress_step>_<conditions>.timing

  • DOMAIN_<clk1>_to_<clk2>_<progress_step>_<conditions>.timing

  • DOMAIN_<clk>_to_Output_<progress_step>_<conditions>.timing

Use the first 3 timing files in order to get an overview of the project and other ones to analyze paths.

Optimization

Mapping

Operator

Operators are divided in 3 categories :

  • Adder (ADD)

  • LessThan (LTN)

  • Multiplier (MUL)

LessThan, LessThanOrEqual, GreaterThan and GreaterThanOrEqual are all considered as LessThan operators.

It is possible to map this operators in LUT, Carry or DSP thanks to addMappingDirective method described in Nxmap user manual.

By default, adders are mapped into Carry and Multipliers in DSP. But it can sometimes be interesting to change default mapping directives.

Check #Mapping_Directive_Operator for information about constraint setting.

Memory

Memories can be mapped into logic elements (LUT/DFF), register files (RF), Memory Blocks (RAM) or Memory Blocks protected by EDAC correction thanks to addMappingDirective method described in Nxmap user manual.

By default, small memories (equal or less than 64x16) will be mapped into RF and bigger mapped into RAM. But it can sometimes be interesting to change default mapping directives.

It it highly recommended to size the design according to available depths and heights. Otherwise, too many memories will be instantiated and will affect the routing and STA performances.

In case of memory with reset assertion, the memory has to be mapped into DFF as RF or RAM does not have a reset input pin.

Check #Mapping_Directive_Memory for information about constraint setting.

Placing

In order to improve the maximum clock frequency for each clock domain, it is advised to follow the following steps:

  • Define a floor plan for the design depending on module relationships and pinout.

  • Apply the floor plan to your NXmap project using confineModule described in Nxmap user manual. Check #Floor_planning_Constrain_module for information about constraint setting.

  • If needed, repeat this process going deeper and deeper in the design hierarchy.

  • For last most critical paths, use constrainPath described in Nxmap user manualin order to create a region with only a few elements contained into the specified path. Check #Floor_planning_Constrain_path_between_registers for information about constraint setting.

It can also have a very positive impact to create unitary projects and reuse the routed projects as a blackbox in your final top project using addBlackBox described in Nxmap user manual.

It is also possible to place manually instances in a specified spot thanks to the following NXpython methods all described in Nxmap user manual:

How to use NXpython constrains methods

Mapping directive

...

In order to map an operator into LUT, CY or DSP, follow the following steps:

...

Launch your design for the first time without constraint.

...

Grab the operator model or instance in operators.rpt report. For instance, “ | Operator 'add_3u_3u' | : add_L25 (line 25 in …, model name is “add_3u_3u” and instance name is “add_L25”.

...

ClockSwitch project.

For NG-ULTRA family, NX_CKS is a NX_GCK_U in CKS mode.

Reset management

It is recommended to make global design reset going through the low-skew network. As clocks, they can come from an IO pad and be inserted in the low-skew network thanks to a WFG or if generated internally be guided to the low-skew network with a buffer.

All NanoXplore primitives are reset high sensitive but can manage either synchronous or asynchronous reset.

Thus, it is a better way to use directly active high reset in the design.

In case of active low reset for design processes, it is recommended to invert the input reset signal before low-skew insertion.

Global signals management

There are some signals with high fan out can be mapped into low-skew network and introduced some important delays. It could be the problem with synchronous signals load, set, reset, …

In order to avoid this problem, use rejectLowskew method.

Code Block
languagepy
p.rejectLowskew('inst1|reset_sync')

Memory initialization

There are several ways to initialize a memory (attribute, generic, python method).

It is recommended to rather use python method addMemoryInitialization described in /wiki/spaces/~814749387/pages/48660481 as it is compliant with inferred and instantiated memories.

Please have a look at Training Package : Application Note Init/Ram project.

Memory Inference

In order to infer a memory instead of instantiating a NanoXplore primitive (advantage is the user can choose with the same RTL code to map this memory in RF, RAM, RAM_ECC, … thanks to NXpython constraints), it is recommended to follow TrainingPackage Design/MemInfer examples providing inference for ROM, SRAM, DPRAM with and without ECC.

Please have a look at Training Package : Application Note Design/MemInfer project.

Avoid optimization of a signal

Because of instance optimization, the synthesis tool can merge or delete some design signals.

For some reasons, a user can ask the tool for avoiding this kind of optimization adding attributes in the RTL code.

syn_preserve attribute is dedicated to registers.

syn_keep/syn_noprune are equivalent and dedicated to logic signals.

Please have a look at Training Package : Application Note Attribute/SynKeep and Attribute/SynPreserve projects.

Hierarchy

It is recommended to divide the design in well sized modules depending on the function they are responsible for.

It will be very helpful to constrain the design by module then.

Project creation

NXmap is based on Python scripts that is to say a project is a class and all options and constraints are methods associated to this class.

The user can either create a project based on a NanoXplore template (recommended) or start it from scratch.

In case of NanoXplore template, please ask NanoXplore support team at support@nanoxplore.com.

In case of starting from scratch, it can be easier to create a project using the GUI in order to get the right basic NXpython methods.

The project needs the following database:

  • Top cell library and top cell name

  • All files added to associated library

  • Options and constraints

Note

Never add a nxLibrary file to your project. NXmap tool already get the needed files with NX primitives.

Whatever the project base, it is recommended to comply with the following rules:

  • Set options before any progress.

  • Set IO pads locations and parameters before any progress.

  • Save your project after each progress step.

  • Separate your project in categories (setup,synthesis, placing, routing, sta) and set your constraints at the right place.

Project check

Once the project is launched, it is necessary to check logs and reports in order to be sure there is not any issue leading to errors, undesired optimizations, …

Logs and report files analysis

Errors and warnings

First of all, check the project is successful until the required progress step.

If not, a message appears in the console. Sometimes, the message is clear enough, but it could not be. So errors.rpt and warning.rpt reports can be very useful in order to grab the issue.

Note there reports are generated only if there are messages to report to the user.

If reported messages are not clear enough to solve the issue, contact the NanoXplore support team at support@nanoxplore.com giving the CONTEXT code and an archive in order to reproduce the issue (RTL sources + scripts).

Even though the project is successful, it is advised to parse these files.

Complexity

After errors and warnings are solved or understood, check hierarchy.rpt and progress.rpt in order to check the number of instances is relevant with the estimation.

In order to get complexity module by module without confining it in a region, it is possible to use addModule method described in /wiki/spaces/~814749387/pages/48660481.

Refer to #Floor_planning_Complexity for more details.

IO Location

Check in progress.rpt none IO has been automatically placed by the design. It is marked with an exclamation sign.

Common Errors

There are typical types of errors :

  • Oversize : Not enough instance capacity in the area

  • Overflow : Not enough routing capacity in the area

  • Blending Tile : Unable to route a path

These errors can be solved by changing the following parameters/options :

  • If modules are constrained with constrainModule method, increase area size

  • Change the Seed used for P&R

  • Change the following options: CongestionEffort, DensityEffort, RoutingEffort

STA

Once the project is mature enough, Static Timing Analysis can be done in order to check how fast the design can operates.

STA tool can be launched from any step after Preparing (Placing 1/5) with the following example method:

Code Block
Timing_analysis = p.createAnalyzer()
Timing_analysis.launch({'conditions': 'typical', 'maximumSlack': 500, 'searchPathsLimit': 10})

It is advised to launch a STA after Preparing as it shows up most critical paths and the maximum frequency with most optimistic hypothesis. If maximum frequencies are not higher enough at this step, it won’t be after routing for sure.

Clock creation

The input clock frequencies and other parameters must be informed to NXmap for STA.

Please refer to createClock and createGeneratedClock described in /wiki/spaces/~814749387/pages/48660481.

If a PLL is used, PLL output clock frequencies are automatically computed by the software.

Special path and clock domains relationship

Declare all false paths and multi cycle paths in order to not take into account some paths between registers in some particular cases.

Please refer to addFalsePath and addMultiCyclePath in /wiki/spaces/~814749387/pages/48660481.

Clock groups can be created if clock domains are completely unrelated.

Timing files analysis

When launching the STA tool, the following files are created :

  • Summary_<progress_step>_<conditions>.timing

  • Violation_<progress_step>_<conditions>.timing

  • Timing_Constraints_Report_<progress_step>_<conditions>.timing

  • DOMAIN_Input_to_<clk>_<progress_step>_<conditions>.timing

  • DOMAIN_<clk1>_to_<clk2>_<progress_step>_<conditions>.timing

  • DOMAIN_<clk>_to_Output_<progress_step>_<conditions>.timing

Use the first 3 timing files in order to get an overview of the project and other ones to analyze paths.

Optimization

Mapping

Operator

Operators are divided in 3 categories :

  • Adder (ADD)

  • LessThan (LTN)

  • Multiplier (MUL)

LessThan, LessThanOrEqual, GreaterThan and GreaterThanOrEqual are all considered as LessThan operators.

It is possible to map this operators in LUT, Carry or DSP thanks to addMappingDirective method described in /wiki/spaces/~814749387/pages/48660481.

By default, adders are mapped into Carry and Multipliers in DSP. But it can sometimes be interesting to change default mapping directives.

Check #Mapping_Directive_Operator for information about constraint setting.

Memory

Memories can be mapped into logic elements (LUT/DFF), register files (RF), Memory Blocks (RAM) or Memory Blocks protected by EDAC correction thanks to addMappingDirective method described in /wiki/spaces/~814749387/pages/48660481.

By default, small memories (equal or less than 64x16) will be mapped into RF and bigger mapped into RAM. But it can sometimes be interesting to change default mapping directives.

It it highly recommended to size the design according to available depths and heights. Otherwise, too many memories will be instantiated and will affect the routing and STA performances.

In case of memory with reset assertion, the memory has to be mapped into DFF as RF or RAM does not have a reset input pin.

Check #Mapping_Directive_Memory for information about constraint setting.

Placing

In order to improve the maximum clock frequency for each clock domain, it is advised to follow the following steps:

It can also have a very positive impact to create unitary projects and reuse the routed projects as a blackbox in your final top project using addBlackBox described in /wiki/spaces/~814749387/pages/48660481.

It is also possible to place manually instances in a specified spot thanks to the following NXpython methods all described in /wiki/spaces/~814749387/pages/48660481:

How to use NXpython constrains methods

Mapping directive

Anchor
Mapping_directive_Operator
Mapping_directive_Operator
Operator

In order to map an operator into LUT, CY or DSP, follow the following steps:

  • Launch your design for the first time without constraint.

  • Grab the operator model or instance in operators.rpt report. For instance, “ | Operator 'add_3u_3u' | : add_L25 (line 25 in …, model name is “add_3u_3u” and instance name is “add_L25”.

  • Add the constraint specifying instance to map the operator, for instance p.addMappingDirective('getModels(add_3u_3u)','ADD','DSP')” or p.addMappingDirective('getInstances(add_L25)','ADD','DSP'), and relaunch the project.

  • Check in operators.rpt report the constraint matched with the desired instance.

Please have a look at Training Package : Application Note MappingDirective/Operator project.

...

  • Launch your design for the first time without constraint.

  • Grab the memory model in memories.rpt report. For instance, “ | Ram 'RAM_s_mem' Analysis:, model name is “RAM_s_mem”.

  • Add the constraint specifying instance to map the operator, for instance p.addMappingDirective('getModels(RAM_s_mem)','RAM','RAM_ECC')”, and relaunch the project.

  • Check in memories.rpt report the constraint matched with the desired instance.

Please have a look at Training Package : Application Note MappingDirective/Memory project.

...

Info

In case of direct primitive instance of the DSP, DSP name is the path to the DSP. For instance, p.addDSPLocation('module0|submodule1|DSP_INST_0','CGB[28x4]:L').

Please have a look at Training Package : Application Note PlacingConstraint/DspLocation project.

...

Info

In case of direct primitive instance of the RAM, RAM name is the path to the RAM. For instance, p.addRAMLocation('module0|submodule1|ram','CGB[28x4]').

Please have a look at Training Package : Application Note PlacingConstraint/RamLocation project.

...

Note

In order to place manually a WFG, location must comply with WFG type that is to say Core WFG (WFG_Cx) or Ring WFG (WFG_Rx). Mix WFG (WFG_Mx) can be chosen for either a Core or a Ring signal.

Please have a look at Training Package : Application Note PlacingConstraint/RingLocation project.

...

  • Launch your design for the first time without constraint.

  • Grab the register name in RegisterSummary.rpt report or in timing files. For instance, i_cpt_0|s_cpt_out_reg[5]”.

  • Add the constraint specifying TILE spot (TILE coordinates), for instance “p.setSite('i_cpt_0|s_cpt_out_reg[0]','TILE[2x2]'”, and relaunch the project.

  • Check in preplaced.rpt report the constraint matched with the desired instance.

Please have a look at Training Package : Application Note PlacingConstraint/Site project.

...

  • Launch your design for the first time without constraint.

  • Grab the module and instance name in hierarchy.rpt report. For instance, “| |-> timing_pipe(X2A98C8C6) [ GEN_HIER0|GEN_ROW[0].ROW_PIPE ]”, module name is timing_pipe(X2A98C8C6)” and instance name is “[ GEN_HIER0|GEN_ROW[0].ROW_PIPE ]”.

  • Add the constraint for instance “p.addModule('timing_pipe(X2A98C8C6)', 'GEN_HIER0.GEN_ROW[0].ROW_PIPE', 'GEN_HIER0_ROW-%')”, and relaunch the project.

  • Check in hierarchy.rpt report the constraint matched with the desired instance.

Please have a look at Training Package : Application Note PlacingConstraint/Region project.

...

  • Launch your design for the first time without constraint.

  • Grab the module name in hierarchy.rpt report. For instance, “| ~ |-> row_col_pipe(X212B9C19) [ GEN_HIER0 ]”, name is “|-> row_col_pipe(X212B9C19) [ GEN_HIER0 ]”.

  • Add the constraint specifying area coordinates, for instance “p.constrainModule('|-> row_col_pipe(X212B9C19) [ GEN_HIER0 ]','GEN_HIER0_ROW_M','Soft',9,6,2,3,'GEN_HIER0_ROW_R',False)”, and relaunch the project.

  • Check in hierarchy.rpt report the constraint matched with the desired instance.

Please have a look at Training Package : Application Note PlacingConstraint/Region project.

...

  • Define the macro IP as the top cell.

  • Define a minimum aperture and all needed constraints as it was a global project to reach specifications. Save the project file after routed steps.

  • Do not declare the macro IP entity file in the global project. Instead, add the macro IP as a blackbox, specifying coordinates of the top left corner of the macro IP aperture in the global project, for instance p.addBlackBox('switch_counter',IP','../switch_counter_preplaced.nym','g_inst.i_switch_counter_0:1x8').

  • Check in synthesize.log llog the constraint matched.

Please have a look at Training Package : Application Note PlacingConstraint/Preplace project.

...

developCKGs method is active by default, NXmap compute automatically PLL and WFG required frequencies if input clock is declared.

Please have a look at Training Package : Application Note StaConstraint/GeneratedClock project.

Anchor
Sta_constraints_

...

Generated_

...

clock_

...

declaration
Sta_constraints_

...

Generated_

...

clock_

...

declaration
Generated clock declaration

In order to declare a path between 2 registers as a false path, multi-cycle path, min or max delay pathclock in your project in order to get required frequencies in logs, follow the following steps:

  • Launch your design for the first time without constraint.

  • Grab the source and target register in DOMAIN_<clk1>_to_<clk2>_<progress_step>_<conditions>.timing. For instance, “module0|submodule0|pipe_reg[0].CK” and “module0|submodule0|pipe_reg[1].CK” , names are “module0|submodule0|pipe_reg[0]” and “module0|submodule0|pipe_reg[1]”.

  • Add the constraint specifying area coordinates, for instance “p.addFalsePath('getRegisters(module0|submodule0|pipe_reg[0])','getRegisters(module0|submodule0|pipe_reg[0])')”, and relaunch the project.

  • Check in DOMAIN_<clk1>_to_<clk2>_<progress_step>_<conditions>.timingt report the path no longer appears.

Please have a look at Training Package Application Note StaConstraint/FalsePath project.

How to improve STA results

STA tool

...

  • Grab the clock name getting the hierarchy path to the generated clock.

    • Add the constraint specifying clock parameters, for instance “p.createGeneratedClock(getClock('clk_main'),getRegisterClock('i_clock_0|counter_reg[0]'), 'clk_fabric',{'DivideBy': 2})”.

  • Check timing files are created for this new clock domain.

Please have a look at Training Package : Application Note StaConstraint/GeneratedClock project.

Anchor
Sta_constraints_Constrain_path_between_registers
Sta_constraints_Constrain_path_between_registers
Constrain path between registers

In order to declare a path between 2 registers as a false path, multi-cycle path, min or max delay path, follow the following steps:

  • Launch your design for the first time without constraint.

  • Grab the source and target register in DOMAIN_<clk1>_to_<clk2>_<progress_step>_<conditions>.timing. For instance, “module0|submodule0|pipe_reg[0].CK” and “module0|submodule0|pipe_reg[1].CK” , names are “module0|submodule0|pipe_reg[0]” and “module0|submodule0|pipe_reg[1]”.

  • Add the constraint specifying area coordinates, for instance “p.addFalsePath('getRegisters(module0|submodule0|pipe_reg[0])','getRegisters(module0|submodule0|pipe_reg[0])')”, and relaunch the project.

  • Check in DOMAIN_<clk1>_to_<clk2>_<progress_step>_<conditions>.timingt report the path no longer appears.

Please have a look at Training Package : Application Note StaConstraint/FalsePath project.

How to improve STA results

STA tool

It is recommended to launch the STA tool after Placing 1/5 step named “Preparing”. It allows to witness the number of logic elements crossed and check if it is possible to reach performances when the margin in the most optimistic scenario is high enough.

ClockCreation

Before launching STA tool, all constraints must be defined.

The user must define:

TimingDriven

TimingDriven option can be set with the following constraint:

...

  • Low: TimingDriven is disabled

  • Medium: 3 iterations

  • High: 6 iterations

The higher the number of iterations is, the higher STA performances can be high but the higher the runtime is.

Note

All clocks must be created before launching Place&Route with TimingDriven option. Otherwise, the tool does not get performances to reach.

Note

The TimingDriven efficiency is in continuous development and improvement.

FloorPlanning

...

  • iterations

  • High: 6 iterations

The higher the number of iterations is, the higher STA performances can be high but the higher the runtime is.

Note

All clocks must be created before launching Place&Route with TimingDriven option. Otherwise, the tool does not get performances to reach.

Note

The TimingDriven efficiency is in continuous development and improvement.

Floor planning

In order to give indications to the tool, the user can set up a floor-planning and pass it through the tool working directly on the whole design or working with unitary runs before integrating in the top design as specified in #Floor_planning_Preplace_ip.

First of all, ConstrainModule methods as explained in #Floor_planning_Constrain_module must be applied in order to concentrate parts of the design and place them close to other ones if interconnections between them.

In addition, ConstraintPath methods as explained in #Floor_planning_Constrain_path_between_registers must be applied if some path go trough multiple modules or for last critical paths.

Note

With this solution, the user must pay attention to interconnections between modules and between module and output interfaces.