verilator

Commit Graph

Author	SHA1	Message	Date
Geza Lore	505d33b35a	Support #0 delays with IEEE-1800 compliant semantics (#7079 ) This patch adds IEEE-1800 compliant scheduling support for the Inactive scheduling region used for #0 delays. Implementing this requires that all IEEE-1800 active region events are placed in the internal 'act' section. This has simulation performance implications. It prevents some optimizations (e.g. V3LifePost), which reduces single threaded performance. It also reduces the available work and parallelism in the internal 'nba' section, which reduced the effectiveness of multi-threading severely. Performance impact on RTLMeter when using scheduling adjusted to support proper #0 delays is ~10-20% slowdown in single-threaded mode, and ~100% (2x slower) with --threads 4. To avoid paying this performance penalty unconditionally, the scheduling is only adjusted if either: 1. The input contains a statically known #0 delay 2. The input contains a variable #x delay unknown at compile time If no #0 is present, but #x variable delays are, a ZERODLY warning is issued advising the use of '--no-sched-zero-delay' which is a promise by the user that none of the variable delays will evaluate to a zero delay at run-time. This warning is turned off if '--sched-zero-delay' is explicitly given. This is similar to the '--timing' option. If '--no-sched-zero-delay' was used at compile time, then executing a zero delay will fail at runtime. A ZERODLY warning is also issued if a static #0 if found, but the user specified '--no-sched-zero-delay'. In this case the scheduling is not adjusted to support #0, so executing it will fail at runtime. Presumably the user knows it won't be executed. The intended behaviour with all this is the following: No #0, no #var in the design (#constant is OK) -> Same as current behaviour, scheduling not adjusted, same code generated as before Has static #0 and '--no-sched-zero-delay' is NOT given: -> No warnings, scheduling adjusted so it just works, runs slow Has static #0 and '--no-sched-zero-delay' is given: -> ZERODLY on the #0, scheduling not adjusted, fails at runtime if hit No static #0, but has #var and no option is given: -> ZERODLY on the #var advising use of '--no-sched-zero-delay' or '--sched-zero-delay' (similar to '--timing'), scheduling adjusted assuming it can be a zero delay and it just works No static #0, but has #var and '--no-sched-zero-delay' is given: -> No warning, scheduling not adjusted, fails at runtime if zero delay No static #0, but has #var and '--sched-zero-delay' is given: -> No warning, scheduling adjusted so it just works	2026-02-16 03:55:55 +00:00
Wilson Snyder	7c6c6a684b	Add SPDX copyright identifiers, and get 'reuse' clean. No functional change.	2026-01-26 20:24:34 -05:00
Wilson Snyder	13327fa9c0	Copyright year update.	2026-01-01 07:22:09 -05:00
Geza Lore	d1eda66668	Deprecate clocker attribute and --clk option (#6463 ) The only use for the clocker attribute and the AstVar::isUsedClock that is actually necessary today for correctness is to mark top level inputs of --lib-create blocks as being (or driving) a clock signal. Correctness of --lib-create (and hence hierarchical blocks) actually used to depend on having the right optimizations eliminate intermediate clocks (e.g.: V3Gate), when the top level port was not used directly in a sensitivity list, or marking top level signals manually via --clk or the clocker attribute. However V3Sched::partition already needs to trace through the logic to figure out what signals might drive a sensitivity list, so it can very easily mark all top level inputs as such. In this patch we remove the AstVar::attrClocker and AstVar::isUsedClock attributes, and replace them with AstVar::isPrimaryClock, automatically set by V3Sched::partition. This eliminates all need for manual annotation so we are deprecating the --clk/--no-clk options and the clocker/no_clocker attributes. This also eliminates the opportunity for any further mis-optimization similar to #6453. Regarding the other uses of the removed AstVar attributes: - As of 5.000, initial edges are triggered via a separate mechanism applied in V3Sched, so the use in V3EmitCFunc.cpp is redundant - Also as of 5.000, we can handle arbitrary sensitivity expressions, so the restriction on eliminating clock signals in V3Gate is unnecessary - Since the recent change when Dfg is applied after V3Scope, it does perform the equivalent of GateClkDecomp, so we can delete that pass.	2025-09-20 15:50:22 +01:00
Geza Lore	f39d6e6108	Deprecate sensitivity list on public_flat_rw attributes (#6443 ) These are no longer required for correct scheduling. They are still accepted for backward compatibility, but have no effect on simulation and are dropped in the front-end. Also removed the then redundant AstAlwaysPublic class. Fixes #6442	2025-09-16 22:38:53 +01:00
Geza Lore	0bf9fc270f	Iternals: Remove AstAssignPre/AstAssignPost (#6307 ) Replace with AstAlwaysPre/AstAlwaysPost with AstAssign under them. Step towards #6280	2025-08-19 09:27:59 +01:00
Wilson Snyder	c90f9e53b7	Add ALWNEVER warning, for `always @*` that never execute (#6291 ) (#6303 )	2025-08-18 12:00:53 -04:00
Wilson Snyder	88046c8063	Internals: Rename AstSenTree pointers to sentreep. No functional change intended except JSON.	2025-08-17 19:14:34 -04:00
Wilson Snyder	46c7b69c64	Internals: UINFO now includes newline itself. No functional change.	2025-05-22 20:29:32 -04:00
Wilson Snyder	27d3eb5b7b	Fix UNOPTFLAT warnings with `--coverage-trace` and always_comb (#5821 ).	2025-03-02 20:02:55 -05:00
Wilson Snyder	098ee6fa7a	Internals: Cleanup some missing VL_RESTORERs. No functional change intended.	2025-02-27 21:18:27 -05:00
Wilson Snyder	8fbb725f34	Copyright year update.	2025-01-01 08:30:25 -05:00
Geza Lore	98206a4f04	Improve V3List user interface (#4996 )	2024-03-25 23:06:25 +00:00
Geza Lore	5a69321be3	Split V3Order into further part and decouple various components (#4953 ) Continuing the idea of decoupling the implementations of the various algorithms. The main points: -Move the former "processDomain" stuff, dealing with assigning combinational logic into the relevant sensitivity domains into V3OrderProcessDomains.cpp -Move the parallel code construction in V3OrderParallel.cpp (Could combine this with some parts of V3Partition - those not called from V3Partition::finalize - but that's not for this patch). -Move the serial code construction into V3OrderSerial.cpp -Factored the very small common code between the parallel and serial code construction (processMoveOneLogic) into V3OrderCFuncEmitter.cpp	2024-03-09 12:43:09 +00:00
Wilson Snyder	e76f29e5ba	Copyright year update	2024-01-01 03:19:59 -05:00
Wilson Snyder	dc10118d3b	Fix C++20 compilation errors (#4670 )	2023-11-06 07:13:31 -05:00
Geza Lore	d60f180f43	Avoid double traversal of maps The typical find/if-not-exists-insert pattern can be achieved with 1 lookup instead of 2 using emplace with a sentinel value. Also maps value initialize their values when inserted with the [] operator, this is defined and so there is no need to explicitly insert zeroes for integer values.	2023-10-28 13:41:43 +01:00
Wilson Snyder	b5828a7ce9	Fix header order botched by clang-format in recent commit.	2023-10-18 06:37:46 -04:00
github action	770cd24f27	Apply 'make format'	2023-10-18 02:50:27 +00:00
Wilson Snyder	431bb1ed16	Support compiling Verilator with gcc/clang precompiled headers (#4579 )	2023-10-17 22:49:28 -04:00
Mariusz Glebocki	28bd7e5b19	Rework multithreading handling to separate by code units that use/never use it. (#4228 )	2023-09-24 22:12:23 -04:00
Krzysztof Bieganski	ffbbd438ae	Internals: Use runtime type info instead of `dynamic_cast` for faster graph type checks (#4397 )	2023-08-31 18:00:53 -04:00
Wilson Snyder	add68130b8	Internals: Rename to dumpLevel(), to avoid confusion with make-a-dump()	2023-05-03 18:04:10 -04:00
Kamil Rakoczy	798d7346cf	Internals: Add VL_MT_SAFE attribute to functions that requires locking. (#3805 )	2023-03-17 20:24:15 -04:00
Wilson Snyder	b24d7c83d3	Copyright year update	2023-01-01 10:18:39 -05:00
Wilson Snyder	c0499da28b	Spelling fixes	2022-12-23 11:32:38 -05:00
HungMingWu	196f3292d5	Improve V3Ast function usage ergonomics (#3650 ) Signed-off-by: HungMingWu <u9089000@gmail.com>	2022-10-21 14:12:12 +01:00
Geza Lore	ddb678cc5b	Merge branch 'master' into develop-v5	2022-09-22 17:33:36 +01:00
Geza Lore	af305bf280	Merge branch 'master' into develop-v5	2022-09-16 16:24:36 +01:00
Geza Lore	c266739e9f	Merge branch 'master' into develop-v5	2022-08-05 12:17:57 +01:00
Geza Lore	6a7bda6910	Correctly schedule combinational logic driven from DPI exports. Fixes #3429.	2022-07-14 15:35:49 +01:00
Geza Lore	282887d9c6	Fix code coverage holes Fixes #3422	2022-05-16 21:22:21 +01:00
Geza Lore	599d23697d	IEEE compliant scheduler (#3384 ) This is a major re-design of the way code is scheduled in Verilator, with the goal of properly supporting the Active and NBA regions of the SystemVerilog scheduling model, as defined in IEEE 1800-2017 chapter 4. With this change, all internally generated clocks should simulate correctly, and there should be no more need for the `clock_enable` and `clocker` attributes for correctness in the absence of Verilator generated library models (`--lib-create`). Details of the new scheduling model and algorithm are provided in docs/internals.rst. Implements #3278	2022-05-15 16:03:32 +01:00

33 Commits