verilator

Commit Graph

Author	SHA1	Message	Date
Wilson Snyder	173f57c636	Changed --no-merge-const-pool to -fno-merge-const-pool (#3436 ).	2022-06-03 19:41:59 -04:00
Geza Lore	b51f887567	Perform VCD tracing in parallel when using --threads (#3449 ) VCD tracing is now parallelized using the same thread pool as the model. We achieve this by breaking the top level trace functions into multiple top level functions (as many as --threads), and after emitting the time stamp to the VCD file on the main thread, we execute the tracing functions in parallel on the same thread pool as the model (which we pass to the trace file during registration), tracing into a secondary per thread buffer. The main thread will then stitch (memcpy) the buffers together into the output file. This makes the `--trace-threads` option redundant with `--trace`, which now only affects `--trace-fst`. FST tracing uses the previous offloading scheme. This obviously helps a lot in VCD tracing performance, and I have seen better than Amdahl speedup, namely I get 3.9x on XiangShan 4T (2.7x on OpenTitan 4T).	2022-05-29 19:08:39 +01:00
Geza Lore	b130a8cfeb	Add -DVM_TRACE_VCD in model builds with Make with --trace	2022-05-20 16:44:38 +01:00
Geza Lore	551bd284dd	Rename some internals related to multi-threaded tracing Rename the implementation internals of current multi-threaded tracing to be "offload mode". No functional change, nor user interface change intended.	2022-05-20 16:44:35 +01:00
Geza Lore	b1b5b5dfe2	Improve run-time profiling The --prof-threads option has been split into two independent options: 1. --prof-exec, for collecting verilator_gantt and other execution related profiling data, and 2. --prof-pgo, for collecting data needed for PGO The implementation of execution profiling is extricated from VlThreadPool and is now a separate class VlExecutionProfiler. This means --prof-exec can now be used for single-threaded models (though it does not measure a lot of things just yet). For consistency VerilatedProfiler is renamed VlPgoProfiler. Both VlExecutionProfiler and VlPgoProfiler are in verilated_profiler.{h/cpp}, but can be used completely independently. Also re-worked the execution profile format so it now only emits events without holding onto any temporaries. This is in preparation for some future optimizations that would be hindered by the introduction of function locals via AstText. Also removed the Barrier event. Clearing the profile buffers is not notably more expensive as the profiling records are trivially destructible.	2022-03-27 15:57:30 +02:00
Wilson Snyder	441ecfedc9	Internals: Make all .h files compilable	2022-01-08 11:18:23 -05:00
Wilson Snyder	ca42be982c	Copyright year update.	2022-01-01 08:26:40 -05:00
Wilson Snyder	899de9a282	Add --lib-create, similar to --protect-lib but without protections (#3200 ).	2021-11-14 09:39:31 -05:00
Geza Lore	cdeb6e792f	Add --instr-count-dpi option, change default to 200 This replaces the former static AstNode::INSTR_COUNT_DPI, and makes it user adjustable to fit the design. Fixes #3068.	2021-07-25 16:40:12 +01:00
Geza Lore	add3811f46	Internals: Fix debug prints racing with option parsing. debug() declared by VL_DEGUB_FUNC used to cache the result of the debug level lookup (which depends on options) in a static. This meant that if the debug() function was called before option parsing, the default debug level of 0 would be used for the rest of the program, even if a --debug option was given. Fixed by not caching the debug level until after option parsing is complete.	2021-07-10 12:57:40 +01:00
Wilson Snyder	61e2e55ba5	Internals: Fix coverage holes. No functional change.	2021-07-09 18:11:59 -04:00
Wilson Snyder	36599133bf	Add --prof-c to pass profiling to compiler (#3059 ).	2021-07-07 19:12:52 -04:00
Geza Lore	ec1c112791	Remove deprecated --inhibit-sim (#3035 )	2021-06-21 12:38:42 -04:00
Geza Lore	9eafca5e28	Remove deprecated --no-relative-cfuncs (#3024 )	2021-06-16 23:17:43 -04:00
Geza Lore	c207e98306	Implement a distinct constant pool (#3013 ) What previously used to be per module static constants created in V3Table and V3Prelim are now merged globally within the whole model and emitted as part of a separate constant pool. Members of the constant pool are global variables which are declared lazily when used (similar to loose methods).	2021-06-13 15:05:55 +01:00
Wilson Snyder	1e89392e76	Add --expand-limit argument (#3005 ).	2021-06-06 10:27:01 -04:00
Geza Lore	38cab569ed	Add --reloop-limit argument (#2960 ) Add --reloop-limit argument	2021-05-15 18:04:40 +01:00
Wilson Snyder	c62546c761	Add --coverage-max-width (#2853 ).	2021-03-29 18:54:51 -04:00
Yutetsu TAKATSUKASA	4e41c13501	Structurize option parser (#2809 ) Add V3OptionsParser that can suggest correct option. Co-authored-by: Wilson Snyder <wsnyder@wsnyder.org> Co-authored-by: github action <action@example.com>	2021-03-26 22:48:24 +09:00
Wilson Snyder	3a55600913	Internals: Restyle with C++11 using replacing typedef	2021-03-12 18:10:45 -05:00
Wilson Snyder	5a4e4b2dcd	Add -Oo to disable const bit tree (#2830 ).	2021-03-10 17:47:31 -05:00
Wilson Snyder	9483ebefae	Internal code coverage cleanups.	2021-03-07 21:05:15 -05:00
Wilson Snyder	be31fdcfe4	Use Google-style-guide header guard naming, to avoid __ prefix.	2021-03-03 21:57:07 -05:00
Wilson Snyder	bd602d0e2d	Copyright year update	2021-01-01 10:29:54 -05:00
Wilson Snyder	b6ded59c2b	Internals: Use and enforce class final for ~5% performance boost.	2020-11-18 21:32:16 -05:00
Wilson Snyder	44eb362a18	clang-tidy cleanups. No functional change intended.	2020-11-10 21:40:14 -05:00
Wilson Snyder	51b0963e61	Internals: Favor const for map keys. No functional change intended.	2020-10-30 18:00:40 -04:00
Wilson Snyder	cc134b38ee	Internal coverage: Misc cleanups	2020-09-07 13:11:44 -04:00
Wilson Snyder	993115d30a	Cleanup and test EmitV for internal coverage	2020-09-07 12:58:30 -04:00
Wilson Snyder	d2fac4aa2f	Internals: Add --debug-exit-uvm	2020-08-23 09:05:18 -04:00
Wilson Snyder	b67f1f0e94	Fix GCC warnings	2020-08-18 08:10:44 -04:00
Wilson Snyder	78aee6f4e7	C++11: Use sized enums (+4% performance).	2020-08-16 12:05:35 -04:00
Wilson Snyder	034737d2a8	C++11: Use member declaration initalizations (in nodes). No functional change intended.	2020-08-16 11:44:06 -04:00
Wilson Snyder	72d2cff0a1	C++11: Use member declaration initalizations. No functional change intended.	2020-08-16 11:44:06 -04:00
Yutetsu TAKATSUKASA	953a442827	Support hierarchical verilation using protect lib (#2206 )	2020-08-15 09:43:53 -04:00
Yutetsu TAKATSUKASA	30600bf1a3	Internals: Add const	2020-07-03 14:16:43 -04:00
Wilson Snyder	6e2d8df9e5	Tests: Add --debug-exit-parse	2020-06-08 22:10:55 -04:00
Wilson Snyder	6ce878cb0d	Fix some clang-tidy warnings	2020-06-01 23:16:17 -04:00
Geza Lore	656c460605	Add --dump-tree-addrids developer option	2020-05-31 20:21:55 +01:00
Geza Lore	fe306a36b8	Add MergeCond pass to combine assignments with ?: on rhs (#2376 ) This provides minor simulation performance benefit, but can provide large C++ compilation time improvement, notably with Clang (4x). This patch implements #2366 .	2020-05-30 21:09:05 +01:00
Geza Lore	18870f8b62	Remove remnants of long removed --trace-dups option See #2385	2020-05-30 20:16:40 +01:00
Wilson Snyder	ebda8f866c	Cleanup codacity and missing consts.	2020-05-28 21:04:36 -04:00
Wilson Snyder	279f21bb5b	Configure now enables SystemC if it is installed as a system headers.	2020-05-28 18:51:46 -04:00
Stefan Wallentowitz	dc90e6c3c3	Generate file with waivers (#2354 ) This adds the flag --generate-waivefile <filename>. This will generate a verilator config file with the proper lint_off statemens to turn off warnings emitted during this particular run. This feature can be used to start with using Verilator as linter and systematically capture all known lint warning for further elimination. It hopefully helps people turning of -Wno-fatal or -Wno-lint and gradually improve their code base. Signed-off-by: Stefan Wallentowitz <stefan.wallentowitz@hm.edu>	2020-05-26 20:38:14 +02:00
Wilson Snyder	29695adf70	Fix 10s/100s timeunits.	2020-05-11 08:15:52 -04:00
Geza Lore	dd967f7769	Improve trace buffer memory utilization and performance. Convert trace buffer to 32-bit entries, rather than a union containing a pointer type. Also tweaked trace entry layouts for a bit more performance. This gains another 10% on SweRV EH1 CoreMark.	2020-04-27 19:00:17 +01:00
Wilson Snyder	77915f78db	Add experimental-only option.	2020-04-21 20:45:23 -04:00
Geza Lore	c52f3349d1	Initial implementation of generic multithreaded tracing (#2269 ) The --trace-threads option can now be used to perform tracing on a thread separate from the main thread when using VCD tracing (with --trace-threads 1). For FST tracing --trace-threads can be 1 or 2, and --trace-fst --trace-threads 1 is the same a what --trace-fst-threads used to be (which is now deprecated). Performance numbers on SweRV EH1 CoreMark, clang 6.0.0, Intel i7-3770 @ 3.40GHz, IO to ramdisk, with numactl set to schedule threads on different physical cores. Relative speedup: --trace -> --trace --trace-threads 1 +22% --trace-fst -> --trace-fst --trace-threads 1 +38% (as --trace-fst-thread) --trace-fst -> --trace-fst --trace-threads 2 +93% Speed relative to --trace with no threaded tracing: --trace 1.00 x --trace --trace-threads 1 0.82 x --trace-fst 1.79 x --trace-fst --trace-threads 1 1.23 x --trace-fst --trace-threads 2 0.87 x This means FST tracing with 2 extra threads is now faster than single threaded VCD tracing, and is on par with threaded VCD tracing. You do pay for it in total compute though as --trace-fst --trace-threads 2 uses about 240% CPU vs 150% for --trace-fst --trace-threads 1, and 155% for --trace --trace threads 1. Still for interactive use it should be helpful with large designs.	2020-04-21 23:49:07 +01:00
James Hanlon	97cbc10925	Add --flaten for use with --xml-only (#2270 ).	2020-04-21 18:14:08 -04:00
James Hanlon	65cd4f6047	Fix comment and add to CONTRIBUTORS (#2270 ).	2020-04-21 18:11:53 -04:00

1 2 3 4 5

218 Commits