verilator

Commit Graph

Author	SHA1	Message	Date
Michael Rogenmoser	2a1089cf25	Merge `08579ac2c0` into `3bc73cc768`	2026-03-03 12:49:52 -08:00
Geza Lore	97838325cd	Fix scheduling non-determinism (#7120 partial) (#7162 )	2026-03-01 07:44:59 -05:00
github action	08579ac2c0	Apply 'make format'	2026-02-23 22:50:37 +00:00
Michael Rogenmoser	50983e0727	Sched: Use gated triggers to prevent VIF convergence loops Replace the conditional VIF trigger approach (which compared old vs new values) with a gated trigger mechanism. Each VIF member trigger now has an "already fired" gate variable that prevents the trigger from re-firing on subsequent convergence iterations within the same eval call. The gate flags are cleared at the start of each eval. This avoids the infinite convergence loops without introducing circular ordering dependencies that the conditional approach caused for procedural assigns. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-23 19:08:25 +01:00
Geza Lore	505d33b35a	Support #0 delays with IEEE-1800 compliant semantics (#7079 ) This patch adds IEEE-1800 compliant scheduling support for the Inactive scheduling region used for #0 delays. Implementing this requires that all IEEE-1800 active region events are placed in the internal 'act' section. This has simulation performance implications. It prevents some optimizations (e.g. V3LifePost), which reduces single threaded performance. It also reduces the available work and parallelism in the internal 'nba' section, which reduced the effectiveness of multi-threading severely. Performance impact on RTLMeter when using scheduling adjusted to support proper #0 delays is ~10-20% slowdown in single-threaded mode, and ~100% (2x slower) with --threads 4. To avoid paying this performance penalty unconditionally, the scheduling is only adjusted if either: 1. The input contains a statically known #0 delay 2. The input contains a variable #x delay unknown at compile time If no #0 is present, but #x variable delays are, a ZERODLY warning is issued advising the use of '--no-sched-zero-delay' which is a promise by the user that none of the variable delays will evaluate to a zero delay at run-time. This warning is turned off if '--sched-zero-delay' is explicitly given. This is similar to the '--timing' option. If '--no-sched-zero-delay' was used at compile time, then executing a zero delay will fail at runtime. A ZERODLY warning is also issued if a static #0 if found, but the user specified '--no-sched-zero-delay'. In this case the scheduling is not adjusted to support #0, so executing it will fail at runtime. Presumably the user knows it won't be executed. The intended behaviour with all this is the following: No #0, no #var in the design (#constant is OK) -> Same as current behaviour, scheduling not adjusted, same code generated as before Has static #0 and '--no-sched-zero-delay' is NOT given: -> No warnings, scheduling adjusted so it just works, runs slow Has static #0 and '--no-sched-zero-delay' is given: -> ZERODLY on the #0, scheduling not adjusted, fails at runtime if hit No static #0, but has #var and no option is given: -> ZERODLY on the #var advising use of '--no-sched-zero-delay' or '--sched-zero-delay' (similar to '--timing'), scheduling adjusted assuming it can be a zero delay and it just works No static #0, but has #var and '--no-sched-zero-delay' is given: -> No warning, scheduling not adjusted, fails at runtime if zero delay No static #0, but has #var and '--sched-zero-delay' is given: -> No warning, scheduling adjusted so it just works	2026-02-16 03:55:55 +00:00
Igor Zaworski	446bec3d1a	Fix event triggering (#6932 )	2026-02-11 10:35:59 -08:00
Igor Zaworski	dc26dd601d	Fix internal error - virtual interface not found (#7010 )	2026-02-06 22:20:10 +00:00
Wilson Snyder	7c6c6a684b	Add SPDX copyright identifiers, and get 'reuse' clean. No functional change.	2026-01-26 20:24:34 -05:00
Todd Strader	bc3c5b32dd	Fix delayed initial assignment (#6929 )	2026-01-23 12:53:40 -05:00
Wilson Snyder	13327fa9c0	Copyright year update.	2026-01-01 07:22:09 -05:00
Geza Lore	d2ce5e62e7	Internals: Factor out --prof-exec section handling, add debug code	2025-11-25 10:08:03 +00:00
Geza Lore	2e502aead8	Internals: Make all scheduling region use a single trigger vector. (#6620 ) The 'act' region used to have 2 trigger vectors ('act' and 'pre'), now it uses a single "extended" trigger vector where the top bits are what used to be the used bits in the 'pre' trigger vector. Please see the description above `TriggerKit`. Also move the extra triggers from the low end to the high end in the trigger vectors.	2025-11-01 15:43:20 +00:00
Geza Lore	922223a9c3	Internals: Replace VlTriggerVec with unpacked array (#6616 ) Removed the VlTriggerVec type, and refactored to use an unpacked array of 64-bit words instead. This means the trigger vector and its operations are now the same as for any other unpacked array. The few special functions required for operating on a trigger vector are now generated in V3SchedTrigger as regular AstCFunc if needed. No functional change intended, performance should be the same.	2025-10-31 18:29:11 +00:00
Geza Lore	287fdb7312	Fix mis-ignoring virtual interface member triggers (#5116 reopened) (#6613 )	2025-10-29 17:27:15 -04:00
Geza Lore	cddbb5e095	Internals: Split some code from V3Sched.cpp Add V3SchedUtil.cpp that contains common small utility functions. Add V3SchedTrigger.cpp that contains functionality building the trigger mechanism code. No functional change, just code movement. Prep for some further work.	2025-10-27 21:18:47 +00:00
Geza Lore	60c532908e	Internals: Create if statements for triggers during scheduling (#6280 ) (#6581 ) The AstIf nodes conditional on events being triggered used to be created in V3Clock. Now it is in V3Sched*, in order to avoid having to pass AstActive in CFunc or MTask bodies. No functional change intended, some improved optimization due to simplifying timing triggers that were previously missed, also fixes what seems like a bug in the original timing commit code.	2025-10-27 10:41:30 +00:00
Geza Lore	d236e4c054	Internals: Cleanup scheduling sched_forks.tree used to be dumped before sched.tree, while it's basically after, so move transformForks in to a separate pass. Also extract inlined visitors in V3SchedTiming.	2025-10-26 09:47:09 +00:00
Wilson Snyder	68b227065e	Tests: Fix coverage holes from t_dist_docs_options	2025-10-25 11:00:25 -04:00
Geza Lore	ec91158130	Internals: Refactor AstCFunc internals (#6280 ) (#6578 ) - Delete 'finalsp'. It was used in one place, basically unnecessary and safe to remove. - Make 'argsp' a 'List[AstVar]'. This held before. It holds the function argument and return variables. - Replace 'intitsp' with 'varsp' and make it into 'List[AstVar]' to hold the function local variables. This was most of its use before. The few places we inserted statements here now moved into 'stmtsp' by inserting at the front of the list.	2025-10-21 16:37:32 +01:00
Geza Lore	cf275b6e58	Internals: Refactor text based Ast constructs (#6280 ) (#6571 ) Remove the large variety of ways raw "text" is represented in the Ast. Particularly, the only thing that represents a string to be emitted in the output is AstText. There are 5 AstNodes that can contain AstText, and V3Emit will throw an error if an AstText is encountered anywhere else: - AstCStmt: Internally generated procedural statements involving raw text. - AstCStmtUser: This is the old AstUCStmt, renamed so it sorts next to AstCStmt, as it's largely equivalent. We should never create this internally unless used to represent user input. It is used for $c, statements in the input, and for some 'systemc_* blocks. - AstCExpr: Internally generaged expression involving raw text. - AstCExprUser: This is the old AstUCFunc, renamed so it sorts next to AstCExpr. It is largely equivalent, but also has more optimizations disabled. This should never be created internally, it is only used for $c expressions in the input. - AstTextBlock: Use by V3ProtectLib only, to generate the hierarchical wrappers. Text "tracking" for indentation is always on for AstCStmt, AstCExpr, and AstTextBlock, as these are always generated by us, and should always be well formed. Tracking is always off for AstCStmtUser and AstCExprUser, as these contain arbitrary user input that might not be safe to parse for indentation. Remove subsequently redundant AstNodeSimpleText and AstNodeText types. This patch also fixes incorrect indentation in emitted waveform tracing functions, and makes the output more readable for hier block SV stubs. With that, all raw text nodes are handled as a proper AstNodeStmt or AstNodeExpr as required for #6280.	2025-10-21 12:41:29 +01:00
Geza Lore	61c64e4a3b	Internals: Make AstCExpr always cleanOut (#6280 ) (#6570 ) There was exactly one place in V3Task, handling DPI arguments when we relied on cleanOut of AstCExpr being false for masking. Made that code do the relevant masking via a few new run-time functions, which also eliminates some special cases in the relevant V3Task functions.	2025-10-19 09:44:33 +01:00
Geza Lore	603f4c615a	Improve Loop unrolling (#6480 ) (#6493 ) This patch implements #6480. All loop statements are represented using AstLoop and AstLoopTest. This necessitates rework of the loop unroller to handle loops of arbitrary form. To enable this, I have split the old unroller used for 'generate for' statements and moved it into V3Param, and subsequently rewrote V3Unroll to handle the new representation. V3Unroll can now unroll more complex loops, including with loop conditions containing multiple variable references or inlined functions. Handling the more generic code also requires some restrictions. If a loop contains any of the following, it cannot be unrolled: - A timing control that might suspend the loop - A non-inlined call to a non-pure function These constructs can change the values of variables in the loop, so are generally not safe to unroll if they are present. (We could still unroll if all the variables needed for unrolling are automatic, however we don't do that right now.) These restrictions seem ok in the benchmark suite, where the new unroller can generally unroll many more loops than before.	2025-09-29 15:25:25 +01:00
Wilson Snyder	3b623dc12e	Internals: Refactor to create VCMethod (#3715 ). No functional change intended.	2025-09-27 08:22:17 -04:00
Wilson Snyder	4ad1dde723	Internals: Emit newlines for AstCStmt automatically. No functional change intended.	2025-09-26 08:25:47 -04:00
Geza Lore	800af37975	Internals: Refactor generate construct Ast handling (#6280 ) (#6470 ) Internals: Refactor generate construct Ast handling (#6280) We introduce AstNodeGen, the common base class of AstGenBlock, AstGenCase, AstGenFor, and AstGenIf, which together represent all SV generate constructs. Subsequently remove AstNodeFor, AstNodeCase (AstCase is now directly derived from AstNodeStmt) and adjust internals to work on the new representation. Output is identical modulo hashes do to changed AstNode type ids, no functional change intended. Step towards #6280.	2025-09-23 19:49:01 +01:00
Geza Lore	e0e8503151	Internals: Make all AstBegin constructor arguments explicit (#6464 )	2025-09-20 13:16:03 -04:00
Krzysztof Bieganski	5349b51e71	Allow pure functions in sensitivity lists (#6393 ) Signed-off-by: Krzysztof Bieganski <kbieganski@antmicro.com>	2025-09-10 17:37:34 +02:00
Aleksander Kiryk	353a2e3d20	Fix gathering senaitivities from virtual interface members (#6325 )	2025-08-23 10:45:13 -04:00
Geza Lore	1c86ff0af2	Fix corner case bugs in module and variable inlining (#6322 ) There were a couple corner case bugs in V3Inline, and one in Dfg when dealing with inlining of modules/variables. V3Inline: - Invalid code generated when inlining an input that also had an assignment to it (Throws an ASSIGNIN, but this is sometimes reasonable to do, e.g. hiererchical reference to an unonnected input port) - Inlining (aliasing) publicly writeable input port. - Inlining forcable port connected to constant. Dfg: - Inining publicly writeable variables The tests that cover these are the same and fixing one will trigger the other bug, so fixing them all in one go. Also cleanup V3Inline to be less out of order and rely less on unique APIs only used by V3Inine (will remove those in follow up patch). Small step towards #6280.	2025-08-22 21:43:49 +01:00
Geza Lore	327d55d13d	Internals: Fix remaining cppcheck errors (#6319 ) Fixed the non const-related issue and added suppressions for the const ones. With that `make cppcheck` should be clean.	2025-08-21 09:43:37 +01:00
Geza Lore	0bf9fc270f	Iternals: Remove AstAssignPre/AstAssignPost (#6307 ) Replace with AstAlwaysPre/AstAlwaysPost with AstAssign under them. Step towards #6280	2025-08-19 09:27:59 +01:00
Wilson Snyder	88046c8063	Internals: Rename AstSenTree pointers to sentreep. No functional change intended except JSON.	2025-08-17 19:14:34 -04:00
Yilou Wang	9b99d9697f	Fix virtual interface member propagation (#6175 ) (#6184 )	2025-07-18 09:07:31 -04:00
Yilou Wang	1044398f95	Support member-level triggers for virtual interfaces (#5166 ) (#6148 )	2025-07-11 21:04:51 -04:00
Wilson Snyder	46c7b69c64	Internals: UINFO now includes newline itself. No functional change.	2025-05-22 20:29:32 -04:00
Wilson Snyder	15ebbd309f	Fix always processes ignoring $finish (#5971 ).	2025-05-02 07:36:42 -04:00
Geza Lore	4a2212949e	Fix change detection at time 0 (#5864 ) Initialize "previous value" variables in the static initializer function, instead of the 'initial' blocks function. Fixes #5499	2025-03-18 13:34:04 +00:00
Geza Lore	59cb53cfbc	Set trigger vector in whole words (#5857 ) Having many triggers still hits a bottleneck in LLVM leading to long compile times. Instead of setting triggers bit-wise, set them as a whole 64-bit word when possible. This improves C++ compile times by ~4x on some large designs and has minor run-time performance benefit.	2025-03-14 14:06:51 +00:00
Geza Lore	96bffd49aa	Fix invalidating variable caches in SenExprBulider (#5834 ) (#5835 )	2025-03-07 07:18:34 -05:00
Geza Lore	0133bc6b09	Do not use function locals in SenExprBuilder (#5822 ) Function locals are not safe here because we might need to split up the generated function. V3Localize can fix them later if safe.	2025-03-02 16:13:59 +00:00
Geza Lore	812861e7f2	Optimize splitting trigger computation and dump (#5798 )	2025-02-23 05:57:36 +10:00
Wilson Snyder	001c098e5a	Optimize empty function definition bodies (#5750 ).	2025-01-25 12:13:25 -05:00
Wilson Snyder	8fbb725f34	Copyright year update.	2025-01-01 08:30:25 -05:00
Bartłomiej Chmiel	72a47e16c1	Fix verilator_gantt for hierarchically Verilated models (#5700 )	2024-12-23 09:10:46 -06:00
Bartłomiej Chmiel	a668b7c658	Fix missing VlProcess handle in coroutines with splits (#5623 ) (#5650 ) Signed-off-by: Bartłomiej Chmiel <bchmiel@antmicro.com>	2024-12-02 05:43:26 -05:00
Geza Lore	3bc09d49fb	Generate one trigger per SenItem instead of per SenTree (#5483 )	2024-09-25 10:35:50 +01:00
Arkadiusz Kozdra	2cfec0ecc3	Support clocking blocks in virtual interfaces (#5235 )	2024-07-09 18:31:58 -04:00
Geza Lore	878204db73	Do not feed any empty logic into scheduling (#4972 )	2024-03-16 10:35:56 +00:00
Geza Lore	5a69321be3	Split V3Order into further part and decouple various components (#4953 ) Continuing the idea of decoupling the implementations of the various algorithms. The main points: -Move the former "processDomain" stuff, dealing with assigning combinational logic into the relevant sensitivity domains into V3OrderProcessDomains.cpp -Move the parallel code construction in V3OrderParallel.cpp (Could combine this with some parts of V3Partition - those not called from V3Partition::finalize - but that's not for this patch). -Move the serial code construction into V3OrderSerial.cpp -Factored the very small common code between the parallel and serial code construction (processMoveOneLogic) into V3OrderCFuncEmitter.cpp	2024-03-09 12:43:09 +00:00
Wilson Snyder	3a5248a919	Internals: Mark structs final/VL_NOT_FINAL. No functional change intended.	2024-01-20 15:06:46 -05:00

1 2 3

113 Commits