verilator

Commit Graph

Author	SHA1	Message	Date
Wilson Snyder	3f7bf3d2dc	Fix MSVC localtime_s (#3124 ).	2022-03-27 13:59:18 -04:00
Geza Lore	f9e69984ff	Set vlSymsp in modules at construction time. This ensures it's available from very early on. No functional change.	2022-03-27 16:10:20 +01:00
Geza Lore	b1b5b5dfe2	Improve run-time profiling The --prof-threads option has been split into two independent options: 1. --prof-exec, for collecting verilator_gantt and other execution related profiling data, and 2. --prof-pgo, for collecting data needed for PGO The implementation of execution profiling is extricated from VlThreadPool and is now a separate class VlExecutionProfiler. This means --prof-exec can now be used for single-threaded models (though it does not measure a lot of things just yet). For consistency VerilatedProfiler is renamed VlPgoProfiler. Both VlExecutionProfiler and VlPgoProfiler are in verilated_profiler.{h/cpp}, but can be used completely independently. Also re-worked the execution profile format so it now only emits events without holding onto any temporaries. This is in preparation for some future optimizations that would be hindered by the introduction of function locals via AstText. Also removed the Barrier event. Clearing the profile buffers is not notably more expensive as the profiling records are trivially destructible.	2022-03-27 15:57:30 +02:00
Wilson Snyder	4eaa6fdd06	Internals: Use python pass appropriately. No functional change intended.	2022-03-26 15:57:52 -04:00
Yutetsu TAKATSUKASA	47226236f4	Internals: Resolve potential SEGV risk (#3350 )	2022-03-13 18:13:51 +09:00
Drew Ranck	90fb2e5487	Fix ++/-- tree fix in case statements (#3346 ) (#3349 ).	2022-03-12 11:24:32 -05:00
Wilson Snyder	f211616a4c	Fix missing debug, and code cleanup in V3LinkInc.	2022-03-11 07:34:11 -05:00
github action	181b9a5795	Apply 'make format'	2022-03-06 22:17:42 +00:00
Wilson Snyder	9baf9c55c2	Commentary	2022-03-06 17:16:41 -05:00
Yutetsu TAKATSUKASA	999751c422	Count non-empty always blocks in V3Split (#3337 ) "Optimizations, Split always" in stats now means the number of newly added always. Co-authored-by: Wilson Snyder <wsnyder@wsnyder.org>	2022-03-06 12:56:34 +09:00
Wilson Snyder	22656d6fdd	Fix Vdeeptemp error with --threads and --compiler clang (#3338 ).	2022-03-05 20:17:36 -05:00
Wilson Snyder	90c61c79d6	Fix unnamedblk error on foreach (#3321 ).	2022-03-05 17:04:52 -05:00
Wilson Snyder	4ba3bff87f	Fix class stringification on wide arrays (#3312 ).	2022-03-05 16:32:30 -05:00
Wilson Snyder	c3dd6f5344	Fix public function arguments that are arrayed (#3316 ).	2022-03-05 16:19:53 -05:00
Geza Lore	3737d209f6	Keep recursive module list topologically (#3324 ). Fixes (#3324).	2022-03-05 15:04:13 +00:00
Todd Strader	29c4b0a141	Fix cast to array types (#3333 )	2022-03-03 07:48:04 -05:00
Geza Lore	5b9806ae6d	Improve V3Combine - Always use a fast function to replace a slow one if available - Iterate to fixed point (i.e.: if combining made more functions identical, combine those too). This will be more useful in the future. - Use only single, const traversal	2022-02-27 20:40:58 +00:00
Geza Lore	665fa140a8	V3Combine: Fix crash if CCall in expression position	2022-02-27 12:52:40 +00:00
Yutetsu TAKATSUKASA	32f843a214	Internals: Don't show "Split always" statistics twice. (Split and Reorder were shown). (#3328 )	2022-02-27 20:33:54 +09:00
github action	47069dfe52	Apply 'make format'	2022-02-27 07:53:05 +00:00
HungMingWu	43a84d7ad8	Internals: Fix VL_RESTORER behavior on passing a lvalue reference (#3326 ) Signed-off-by: HungMingWu <u9089000@gmail.com>	2022-02-27 07:52:11 +00:00
Geza Lore	decfa6bd7a	V3Order: Use unique ordinals per function name This helps diffing generated code after reordering output, otherwise no functional change.	2022-02-16 18:36:40 +00:00
Geza Lore	8931bd37e2	Cleanup V3Changed and V3GenClk	2022-02-16 18:09:19 +00:00
Geza Lore	4b79d23d00	Replace SenTreeSet with generic collection Introduce VNRef that can be used to wrap AstNode keys in STL collections, resulting in equality comparisons rather than identity comparisons. This can then replace the SenTreeSet data-structure.	2022-02-16 18:09:19 +00:00
github action	77fe7c426e	Apply 'make format'	2022-02-16 05:11:38 +00:00
Raynard Qiao	331c2244fc	Fixed signed number operation (#3294 ) (#3308 )	2022-02-16 00:10:34 -05:00
Wilson Snyder	77e68acf54	Suppress WIDTH warning on negate using carry bit (#2395 ). [Peter Monsson]	2022-02-13 15:27:31 -05:00
Wilson Snyder	7a355d448a	Fix skipping public enum values with four-state values (#3303 ).	2022-02-10 19:27:28 -05:00
Geza Lore	fb9119ff49	Rename AstCFunc attribute for clarity. 'formCallTree' -> 'isFinal'. No functional change.	2022-01-28 16:18:50 +00:00
Geza Lore	26bdfc3474	Commentary	2022-01-21 05:53:42 +00:00
Wilson Snyder	0e91d8a10e	Internal: Rename for clarity. No functional change.	2022-01-19 19:14:09 -05:00
Wilson Snyder	434c3c3ef3	Removed the deprecated "fl" attribute in XML output; use "loc" attribute instead.	2022-01-17 16:22:07 -05:00
Wilson Snyder	21e05c43dd	Removed the deprecated lint_off flag -msg; use -rule instead.	2022-01-17 16:04:06 -05:00
Geza Lore	f8c0169e82	Implement 'forceable' attribute Using the 'forceable' directive in a configuration file, or the /* verilator forceable */ metacomment on a variable declaration will generate additional public signals that allow the specified signals to be forced/released from the C++ code.	2022-01-16 15:31:37 +00:00
Geza Lore	539c9d4c63	Merge alternate 'force'/'release' implementation - Add more tests, including for tracing. - Apply some cleaner, more generic abstractions in the implementation. - Use clearer AstRelease which is not an assignment.	2022-01-16 15:31:37 +00:00
Geza Lore	b4d8220cbb	Deprecate --cdc (#3279 )	2022-01-16 15:30:44 +00:00
Wilson Snyder	e931c6230a	Run EmitV test after all stages, and fix resulting fallout	2022-01-09 18:11:24 -05:00
Geza Lore	64a6e1ac8b	Add AstNode::foreach method for simple pre-order traversal (#3276 )	2022-01-09 22:34:10 +00:00
Wilson Snyder	50094ca296	Internals: Add cpplint control file and related cleanups	2022-01-09 16:49:38 -05:00
Wilson Snyder	15b32dc140	Internals: cpplint cleanups. No functional change.	2022-01-08 12:01:39 -05:00
Wilson Snyder	441ecfedc9	Internals: Make all .h files compilable	2022-01-08 11:18:23 -05:00
HungMingWu	78147ee8d7	Fix compile error at GCC11 Fixes #3273 Signed-off-by: HungMingWu <u9089000@gmail.com>	2022-01-08 10:40:51 +00:00
Geza Lore	8c58612a3b	Improve V3Inline speed and memory consumption Avoid cloning the module when inlining the last instance that references that module. This saves a lot of memory because it saves cloning singleton modules (those with a single instance), which we always inline. The top few levels of the hierarchy are often simple wrappers, including the one added by Verilator in V3LinkLevel::wrapTop. Cloning these and putting off deleting the originals can be very expensive because they often have a lot of contents inlined into them, so each layer of wrapper that is inlined would essentially add a whole new clone of the large top-level. Directly inlining the module for the last cell without cloning saves us from all this duplicate memory consumption and also from having to create the clones in the first place. Also added minor traversal speedups This reduces the memory consumption of V3Inline by 80% and peak memory consumption of Verilator by about 66% on a large design, while speeding up the V3Inline pass by ~3.5x and the whole of Verilator by ~8% while producing identical output.	2022-01-07 12:11:10 +00:00
Geza Lore	56f9d244de	Cleanup V3Inline. No functional change.	2022-01-07 12:08:17 +00:00
Geza Lore	2ba9eb4228	Speed up TSP sort implementation - More efficient comparison by pre-computing sorting keys. - Remove work items in algorithms known to be redundant earlier. This greatly reduces data structure sizes. - Use V3GraphVertex->user() for state tracking instead of unordered_map while both of these are constant time, they do add up. - In `makeMinSpanningTree`, instead of batch inserting outgoing edges of each visited vertex into an ordered set, keep an ordered set of sorted vectors of edges. This reduces the size of the ordered set significantly (it is now O(V) rather than O(E), and as the subject graph is a complete graph, V ~ sqrt(E), so this is a significant gain). - Use a vector + sorting in `perfectMatching` instead of an ordered set. This is faster on large working sets. This yields 3.8x speedup on the variable order pass and overall 14% verilation speed gain on a large design.	2022-01-07 12:05:52 +00:00
Geza Lore	9a8c878f2d	Avoid repeated traversal for SC text sections in emit when not needed Repeatedly traversing whole modules in emit (due to file splitting) looking for `systemc_* sections can add up to a lot of time on large designs that have been flattened and need to be split into many files. Assuming `systemc_* is a rarely used feature, just don't bother if we don't need to. This gain 9% verilation speed improvement on a large benchmark.	2022-01-07 12:05:50 +00:00
Wilson Snyder	41a563bdc8	Internal cleanups towards recursive functions (#3267 )	2022-01-04 20:19:58 -05:00
Yutetsu TAKATSUKASA	4e5f30858b	Fix #3258 of internal error with inout port (#3268 ) * Tests: Modify t_tri_inout to reproduce #3258 * Set direction of __en accorting to its main signal direction * Update Changes	2022-01-05 08:37:20 +09:00
Wilson Snyder	b989ac6db5	Internals: Support linking recursive function calls (but not later stages)	2022-01-03 18:50:41 -05:00
Wilson Snyder	4d1f4bbf49	Backout last commit; is unstable.	2022-01-03 13:04:47 -05:00

1 2 3 4 5 ...

3251 Commits