unicorn

mirror of https://github.com/yuzu-emu/unicorn.git synced 2024-10-20 14:58:16 +02:00

Author	SHA1	Message	Date
Laurent Vivier	e140aac281	target-m68k: eor can manage word and byte operands Backports commit eec37aec85af9f5fd59b534d20c86a775b8e7973 from qemu	2018-02-28 01:05:21 -05:00
Laurent Vivier	bc52777b00	target-m68k: add addressing modes to not Backports commit ea4f2a844132c81f1e6b51fed7019686ce4e3bc5 from qemu	2018-02-28 01:03:38 -05:00
Richard Henderson	549e31cc72	target-m68k: Inline addx, subx, negx And add opcodes for 680x0 Backports commit a665a820e5d46b1611f409fbc7a540fe1c6bf5c8 from qemu	2018-02-28 01:02:31 -05:00
Laurent Vivier	b796f934ff	target-m68k: add dbcc Backports commit beff27ab3a60d8abab4a166670ca79b3c0970005 from qemu	2018-02-28 00:45:39 -05:00
Laurent Vivier	977c3fe6c4	target-m68k: add addressing modes to scc Backports commit d5a3cf33f2f65069d2f79a6e349f0d8140f02bb4 from qemu	2018-02-28 00:43:30 -05:00
Laurent Vivier	77b1754376	target-m68k: add exg ops Backports commit 29cf437da4eeacb46cd7076014d06c85ca47c91d from qemu	2018-02-28 00:37:41 -05:00
Laurent Vivier	56882899be	target-m68k: add linkl Backports commit c630e436c0ed3adc3a858c328119daf6d1b3357f from qemu	2018-02-28 00:31:27 -05:00
Laurent Vivier	59d6a1a744	target-m68k: add bkpt instruction Backports commit 71600eda7cc48f03ea306bc69ed7e52ef1d9dd91 from qemu	2018-02-28 00:29:41 -05:00
Emilio G. Cota	22be035e60	target-arm: remove EXCP_STREX + cpu_exclusive_{test, info} The exception is not emitted anymore; remove it and the associated TCG variables. Backports commit 05188cc72f0399e99c92f608a8e7ca4c8e552c4b from qemu	2018-02-28 00:24:20 -05:00
Emilio G. Cota	cb92eea81a	target-arm: emulate aarch64's LL/SC using cmpxchg helpers Emulating LL/SC with cmpxchg is not correct, since it can suffer from the ABA problem. Portable parallel code, however, is written assuming only cmpxchg--and not LL/SC--is available. This means that in practice emulating LL/SC with cmpxchg is a viable alternative. The appended emulates LL/SC pairs in aarch64 with cmpxchg helpers. This works in both user and system mode. In usermode, it avoids pausing all other CPUs to perform the LL/SC pair. The subsequent performance and scalability improvement is significant, as the plots below show. They plot the throughput of atomic_add-bench compiled for ARM and executed on a 64-core x86 machine. Hi-res plots: http://imgur.com/a/JVc8Y atomic_add-bench: 1000000 ops/thread, [0,1] range 18 ++---------+----------+---------+----------+----------+----------+---++ +cmpxchg +-E--+ + + + + + \| 16 ++master +-H--+ ++ \|\| \| 14 ++ ++ \| \| \| 12 ++\| ++ \| \| \| 10 ++++ ++ 8 ++E ++ \|+++ \| 6 ++ \| ++ \| \| \| 4 ++ \| ++ \| \| \| 2 +H++E+--- ++ + \| +E++----+E+---+--+E+----++E+------+E+------+E++----+E+---+--+E\| 0 ++H-H----H-+-----H----+---------+----------+----------+----------+---++ 0 10 20 30 40 50 60 Number of threads atomic_add-bench: 1000000 ops/thread, [0,2] range 18 ++---------+----------+---------+----------+----------+----------+---++ +cmpxchg +-E--+ + + + + + \| 16 ++master +-H--+ ++ \| \| \| 14 ++E ++ \| \| \| 12 ++\| ++ \|+++ \| 10 ++ \| ++ 8 ++ \| ++ \| \| \| 6 ++ \| ++ \| \| \| 4 ++ \| ++ \| +E+--- \| 2 +H+ +E+-----+++ +++ +++ ---+E+-----+E+------+++ +++ + +E+---+--+E+----++E+------+E+--- ++++ +++ + +E\| 0 ++H-H----H-+-----H----+---------+----------+----------+----------+---++ 0 10 20 30 40 50 60 Number of threads atomic_add-bench: 1000000 ops/thread, [0,128] range 70 ++---------+----------+---------+----------+----------+----------+---++ +cmpxchg +-E--+ + + + + + \| 60 ++master +-H--+ +++ ---+E+-----+E+------+E+ \| +E+------E-------+E+--- \| \| --- +++ \| 50 ++ +++--- ++ \| -+E+ \| 40 ++ +++---- ++ \| E- \| \| --\| \| 30 ++ -- +++ ++ \| +E+ \| 20 ++E+ ++ \|E+ \| \| \| 10 ++ ++ + + + + + + + \| 0 +HH-H----H-+-----H----+---------+----------+----------+----------+---++ 0 10 20 30 40 50 60 Number of threads atomic_add-bench: 1000000 ops/thread, [0,1024] range 160 ++---------+---------+----------+---------+----------+----------+---++ +cmpxchg +-E--+ + + + + + \| 140 ++master +-H--+ +++ +++ \| -+E+-----+E+-------E\| 120 ++ +++ ---- +++ \| +++ ----E-- \| 100 ++ --E--- +++ ++ \| +++ ---- +++ \| 80 ++ --E-- ++ \| ---- +++ \| \| -+E+ \| 60 ++ ---- +++ ++ \| +E+- \| 40 ++ -- ++ \| +E+ \| 20 +EE+ ++ +++ + + + + + + \| 0 +HH-H---H--+-----H---+----------+---------+----------+----------+---++ 0 10 20 30 40 50 60 Number of threads Backports commit 1dd089d0eec060dcd8478735114d98421d414805 from qemu	2018-02-28 00:21:27 -05:00
Emilio G. Cota	3546558f66	target-arm: emulate SWP with atomic_xchg helper Backports commit cf12bce088f22b92bf62ffa0d7f6a3e951e355a9 from qemu	2018-02-28 00:11:23 -05:00
Emilio G. Cota	ec14a00925	target-arm: emulate LL/SC using cmpxchg helpers Emulating LL/SC with cmpxchg is not correct, since it can suffer from the ABA problem. Portable parallel code, however, is written assuming only cmpxchg--and not LL/SC--is available. This means that in practice emulating LL/SC with cmpxchg is a viable alternative. The appended emulates LL/SC pairs in ARM with cmpxchg helpers. This works in both user and system mode. In usermode, it avoids pausing all other CPUs to perform the LL/SC pair. The subsequent performance and scalability improvement is significant, as the plots below show. They plot the throughput of atomic_add-bench compiled for ARM and executed on a 64-core x86 machine. Hi-res plots: http://imgur.com/a/aNQpB atomic_add-bench: 1000000 ops/thread, [0,1] range 9 ++---------+----------+----------+----------+----------+----------+---++ +cmpxchg +-E--+ + + + + + \| 8 +Emaster +-H--+ ++ \| \| \| 7 ++E ++ \| \| \| 6 ++++ ++ \| \| \| 5 ++ \| ++ 4 ++ \| ++ \| \| \| 3 ++ \| ++ \| \| \| 2 ++ \| ++ \|H++E+--- +++ ---+E+------+E+------+E\| 1 +++ +E+-----+E+------+E+------+E+------+E+-- +++ +++ ++ ++H+ + +++ + +++ ++++ + + + \| 0 ++--H----H-+-----H----+----------+----------+----------+----------+---++ 0 10 20 30 40 50 60 Number of threads atomic_add-bench: 1000000 ops/thread, [0,2] range 16 ++---------+----------+---------+----------+----------+----------+---++ +cmpxchg +-E--+ + + + + + \| 14 ++master +-H--+ ++ \| \| \| 12 ++\| ++ \| E \| 10 ++\| ++ \| \| \| 8 ++++ ++ \|E+\| \| \| \| \| 6 ++ \| ++ \| \| \| 4 ++ \| ++ \| +E+--- +++ +++ +++ ---+E+------+E\| 2 +H+ +E+------E-------+E+-----+E+------+E+------+E+-- +++ + \| + +++ + ++++ + + + \| 0 ++H-H----H-+-----H----+---------+----------+----------+----------+---++ 0 10 20 30 40 50 60 Number of threads atomic_add-bench: 1000000 ops/thread, [0,128] range 70 ++---------+----------+---------+----------+----------+----------+---++ +cmpxchg +-E--+ + + + ++++ + \| 60 ++master +-H--+ ----E------+E+-------++ \| -+E+--- +++ +++ +E\| \| +++ ---- +++ ++\| 50 ++ +++ ---+E+- ++ \| -E--- \| 40 ++ ---+++ ++ \| +++--- \| \| -+E+ \| 30 ++ +++---- ++ \| +E+ \| 20 ++ +++-- ++ \| +E+ \| \|+E+ \| 10 +E+ ++ + + + + + + + \| 0 +HH-H----H-+-----H----+---------+----------+----------+----------+---++ 0 10 20 30 40 50 60 Number of threads atomic_add-bench: 1000000 ops/thread, [0,1024] range 120 ++---------+---------+----------+---------+----------+----------+---++ +cmpxchg +-E--+ + + + + + \| \| master +-H--+ ++\| 100 ++ ----E+ \| +++ ---+E+--- ++\| \| --E--- +++ \| 80 ++ ---- +++ ++ \| ---+E+- \| 60 ++ -+E+-- ++ \| +++ ---- +++ \| \| -+E+- \| 40 ++ +++---- ++ \| +++ ---+E+ \| \| -+E+--- \| 20 ++ +E+ ++ \|+E+++ \| +E+ + + + + + + \| 0 +HH-H---H--+-----H---+----------+---------+----------+----------+---++ 0 10 20 30 40 50 60 Number of threads Backports commit 354161b37c6465a32073eac5f16fa35939af2bb4 from qemu	2018-02-28 00:07:44 -05:00
Richard Henderson	fd9933fbd5	target-arm: Rearrange aa32 load and store functions Stop specializing on TARGET_LONG_BITS == 32; unconditionally allocate a temp and expand with tcg_gen_extu_i32_tl. Split out gen_aa32_addr, gen_aa32_frob64, gen_aa32_ld_i32 and gen_aa32_st_i32 as separate interfaces. Backports commit 7f5616f53896a4e08ad37de3ac50d3a4cc8eff7a from qemu	2018-02-27 23:59:16 -05:00
Emilio G. Cota	3dc16ebca3	target-i386: remove helper_lock() It's been superseded by the atomic helpers. The use of the atomic helpers provides a significant performance and scalability improvement. Below is the result of running the atomic_add-test microbenchmark with: $ x86_64-linux-user/qemu-x86_64 tests/atomic_add-bench -o 5000000 -r $r -n $n , where $n is the number of threads and $r is the allowed range for the additions. The scenarios measured are: - atomic: implements x86' ADDL with the atomic_add helper (i.e. this patchset) - cmpxchg: implement x86' ADDL with a TCG loop using the cmpxchg helper - master: before this patchset Results sorted in ascending range, i.e. descending degree of contention. Y axis is Throughput in Mops/s. Tests are run on an AMD machine with 64 Opteron 6376 cores. atomic_add-bench: 5000000 ops/thread, [0,1] range 25 ++---------+----------+---------+----------+----------+----------+---++ + atomic +-E--+ + + + + + \| \|cmpxchg +-H--+ \| 20 +Emaster +-N--+ ++ \|\| \| \|++ \| \|\| \| 15 +++ ++ \|N\| \| \|+\| \| 10 ++\| ++ \|+\|+ \| \| \| -+E+------ +++ ---+E+------+E+------+E+-----+E+------+E\| \|+E+E+- +++ +E+------+E+-- \| 5 ++\|+ ++ \|+N+H+--- +++ \| ++++N+--+H++----+++ + +++ --++H+------+H+------+H++----+H+---+--- \| 0 ++---------+-----H----+---H-----+----------+----------+----------+---H+ 0 10 20 30 40 50 60 Number of threads atomic_add-bench: 5000000 ops/thread, [0,2] range 25 ++---------+----------+---------+----------+----------+----------+---++ ++atomic +-E--+ + + + + + \| \|cmpxchg +-H--+ \| 20 ++master +-N--+ ++ \|E\| \| \|++ \| \|\|E \| 15 ++\| ++ \|N\|\| \| \|+\|\| ---+E+------+E+-----+E+------+E\| 10 ++\| \| ---+E+------+E+-----+E+--- +++ +++ \|\|H+E+--+E+-- \| \|+++++ \| \| \|\| \| 5 ++\|+H+-- +++ ++ \|+N+ - ---+H+------+H+------ \| + +N+--+H++----+H+---+--+H+----++H+--- + + +H+---+--+H\| 0 ++---------+----------+---------+----------+----------+----------+---++ 0 10 20 30 40 50 60 Number of threads atomic_add-bench: 5000000 ops/thread, [0,8] range 40 ++---------+----------+---------+----------+----------+----------+---++ ++atomic +-E--+ + + + + + \| 35 +cmpxchg +-H--+ ++ \| master +-N--+ ---+E+------+E+------+E+-----+E+------+E\| 30 ++\| ---+E+-- +++ ++ \| \| -+E+--- \| 25 ++E ---- +++ ++ \|+++++ -+E+ \| 20 +E+ E-- +++ ++ \|H\|+++ \| \|+\| +H+------- \| 15 ++H+ ---+++ +H+------ ++ \|N++H+-- +++--- +H+------++\| 10 ++ +++ - +++ ---+H+ +++ +H+ \| \| +H+-----+H+------+H+-- \| 5 ++\| +++ ++ ++N+N+--+N++ + + + + + \| 0 ++---------+----------+---------+----------+----------+----------+---++ 0 10 20 30 40 50 60 Number of threads atomic_add-bench: 5000000 ops/thread, [0,128] range 160 ++---------+---------+----------+---------+----------+----------+---++ + atomic +-E--+ + + + + + \| 140 +cmpxchg +-H--+ +++ +++ ++ \| master +-N--+ E--------E------+E+------++\| 120 ++ --\| \| +++ E+ \| -- +++ +++ ++\| 100 ++ - ++ \| +++- +++ ++\| 80 ++ -+E+ -+H+------+H+------H--------++ \| ---- ---- +++ H\| \| ---+E+-----+E+- ---+H+ ++\| 60 ++ +E+--- +++ ---+H+--- ++ \| --+++ ---+H+-- \| 40 ++ +E+-+H+--- ++ \| +H+ \| 20 +EE+ ++ +N+ + + + + + + \| 0 ++N-N---N--+---------+----------+---------+----------+----------+---++ 0 10 20 30 40 50 60 Number of threads atomic_add-bench: 5000000 ops/thread, [0,1024] range 350 ++---------+---------+----------+---------+----------+----------+---++ + atomic +-E--+ + + + + + \| 300 +cmpxchg +-H--+ +++ \| master +-N--+ +++ \|\| \| +++ \| ----E\| 250 ++ \| ----E---- ++ \| ----E--- \| ---+H\| 200 ++ -+E+--- +++ ---+H+--- ++ \| ---- -+H+-- \| \| +E+ +++ ---- +++ \| 150 ++ ---+++ ---+H+- ++ \| --- -+H+-- \| 100 ++ ---+E+ ---- +++ ++ \| +++ ---+E+-----+H+- \| \| -+E+------+H+-- \| 50 ++ +E+ ++ +EE+ + + + + + + \| 0 ++N-N---N--+---------+----------+---------+----------+----------+---++ 0 10 20 30 40 50 60 Number of threads hi-res: http://imgur.com/a/fMRmq For master I stopped measuring master after 8 threads, because there is little point in measuring the well-known performance collapse of a contended lock. Backports commit 37b995f6e7a1cb6fa378c5cd4217b9dd9e1fc98b from qemu	2018-02-27 23:43:22 -05:00
Emilio G. Cota	9d9b7dedac	target-i386: emulate XCHG using atomic helper Backports commit ea97ebe89f7a879ea9aba90140e40c29b5cbd653 from qemu	2018-02-27 23:40:20 -05:00
Emilio G. Cota	8f96b6beb9	target-i386: emulate LOCK'ed BTX ops using atomic helpers Backports commit cfe819d309d472f75fd129faf1d1064a2498326c from qemu	2018-02-27 23:39:21 -05:00
Emilio G. Cota	089965fa8d	target-i386: emulate LOCK'ed XADD using atomic helper Backports commit f53b01817f95781d2bcc8a82e057d1416601e13b from qemu	2018-02-27 23:06:28 -05:00
Emilio G. Cota	f9ed728f27	target-i386: emulate LOCK'ed NEG using cmpxchg helper Backports commit 8eb8c7385608b99bed6055a22d897ff727a6cb8e from qemu	2018-02-27 23:03:28 -05:00
Emilio G. Cota	fedeb0f93e	target-i386: emulate LOCK'ed NOT using atomic helper Backports commit 2a5fe8ae145ef7a3ab480922116d27efcc97b85d from qemu	2018-02-27 23:00:33 -05:00
Emilio G. Cota	05c94546d5	target-i386: emulate LOCK'ed INC using atomic helper Backports commit 60e573462fcdb83aa1a41e66a9f31dc8a4364399 from qemu	2018-02-27 22:56:05 -05:00
Emilio G. Cota	7c7b0fe746	target-i386: emulate LOCK'ed OP instructions using atomic helpers Backports commit a7cee522f3529c2fc85379237b391ea98823271e from qemu	2018-02-27 22:53:46 -05:00
Emilio G. Cota	a386368f82	target-i386: emulate LOCK'ed cmpxchg using cmpxchg helpers The diff here is uglier than necessary. All this does is to turn FOO into: if (s->prefix & PREFIX_LOCK) { BAR } else { FOO } where FOO is the original implementation of an unlocked cmpxchg. Backports commit ae03f8de45427042ecd10b0941a005f21ecc064c from qemu	2018-02-27 22:38:37 -05:00
Richard Henderson	b48508a6c1	tcg: Emit barriers with parallel_cpus Backports commit 91682118aa330aff7e8ef0cc685c32d101f49940 from qemu	2018-02-27 22:28:33 -05:00
Richard Henderson	064543a415	tcg: Add CONFIG_ATOMIC64 Allow qemu to build on 32-bit hosts without 64-bit atomic ops. Even if we only allow 32-bit hosts to multi-thread emulate 32-bit guests, we still need some way to handle the 32-bit guest using a 64-bit atomic operation. Do so by dropping back to single-step. Backports commit df79b996a7b21c6ea7847f7927a2e1a294b86c72 from qemu	2018-02-27 22:25:36 -05:00
Richard Henderson	da01e53757	tcg: Add atomic128 helpers Force the use of cmpxchg16b on x86_64. Wikipedia suggests that only very old AMD64 (circa 2004) did not have this instruction. Further, it's required by Windows 8 so no new cpus will ever omit it. If we truely care about these, then we could check this at startup time and then avoid executing paths that use it. Backports commit 7ebee43ee3e2fcd7b5063058b7ef74bc43216733 from qemu	2018-02-27 21:43:48 -05:00
Richard Henderson	5c0ce1b99c	tcg: Add atomic helpers Add all of cmpxchg, op_fetch, fetch_op, and xchg. Handle both endian-ness, and sizes up to 8. Handle expanding non-atomically, when emulating in serial. Backports commit c482cb117cc418115ca9c6d21a7a2315414c0a40 from qemu	2018-02-27 15:57:47 -05:00
Richard Henderson	0245f93c02	cputlb: Remove includes from softmmu_template.h We already include exec/address-spaces.h and exec/memory.h in cputlb.c; the include of qemu/timer.h appears to be a fossil. Backports commit 40978428853e2f7b4597ab2a9ffeb187333802dc from qemu	2018-02-27 12:40:43 -05:00
Richard Henderson	5c79851143	cputlb: Tidy some macros TGT_LE and TGT_BE are not size dependent and do not need to be redefined. The others are no longer used at all. Backports commit c86c6e4c80fee4d9423bedb10ba9e9c4aa68f861 from qemu	2018-02-27 12:36:25 -05:00
Richard Henderson	4da1cfb902	cputlb: Move most of iotlb code out of line Saves 2k code size off of a cold path. Backports commit 82a45b96a203a7403427183f1afd3d295222ff7d from qemu	2018-02-27 12:34:19 -05:00
Richard Henderson	5df7c9eec7	cputlb: Move probe_write out of softmmu_template.h Backports commit 3b08f0a92545ba06fbdeaae929a5172480300c33 from qemu	2018-02-27 12:25:24 -05:00
Yongbok Kim	79e4c001a9	softmmu: Add probe_write() Probe for whether the specified guest write access is permitted. If it is not permitted then an exception will be taken in the same way as if this were a real write access (and we will not return). Otherwise the function will return, and there will be a valid entry in the TLB for this access. Backports commit 3b4afc9e75ab1a95f33e41f462921093f8a109c4 from qemu	2018-02-27 12:20:50 -05:00
Richard Henderson	1c9c8d3f10	cputlb: Replace SHIFT with DATA_SIZE Backports commit dea2198201b3e0151d75b42774c51cf2ffe2ca4b from qemu	2018-02-27 12:00:33 -05:00
Richard Henderson	e35aacd5ae	tcg: Add EXCP_ATOMIC When we cannot emulate an atomic operation within a parallel context, this exception allows us to stop the world and try again in a serial context. Backports commit fdbc2b5722f6092e47181a947c90fd4bdcc1c121 from qemu Also backports parts of commit 02d57ea115b7669f588371c86484a2e8ebc369be	2018-02-27 11:57:58 -05:00
Richard Henderson	d5510a546f	int128: Add int128_make128 Allows Int128 to be used more generally, rather than having to begin with 64-bit inputs and accumulate. Backports commit 1edaeee0955fba7d834b7c8f4e372e7eae030745 from qemu	2018-02-27 11:06:33 -05:00
Richard Henderson	9084e5fe1b	int128: Use __int128 if available Backports commit 0846beb36641e8f0c3ee55a5bb84d468b653c852 from qemu	2018-02-27 11:03:06 -05:00
Richard Henderson	4fdbe94eea	exec: Avoid direct references to Int128 parts Backports commit 258dfaaad05a5fbe32a142b794e1df3e16501d0e from qemu	2018-02-27 11:01:43 -05:00
Richard Henderson	ba1c63572e	atomics: Add __nocheck atomic operations While the check against sizeof(void *) is appropriate for normal usage within qemu, there are places in which we want wider operaions and have checked for their existance. Backports commit 84bca3927b36fb1d9a2ca85cbbdf9023d2b84678 from qemu	2018-02-27 11:00:20 -05:00
Lioncash	a59eef391e	atomic: MSVC compatible equivalents to some functions	2018-02-27 10:56:04 -05:00
Emilio G. Cota	c837d76a86	atomics: add atomic_op_fetch variants This paves the way for upcoming work. Backports commit 83d0c719f837724d9e3963b078211b2242bdd2a5 from qemu	2018-02-27 10:28:27 -05:00
Emilio G. Cota	102a53aa50	atomics: add atomic_xor This paves the way for upcoming work. Backports commit 61696ddbdc74263ddb6869856772cfe355a5d3bd from qemu	2018-02-27 10:23:31 -05:00
Richard Henderson	3fe8d46a15	atomics: Add parameters to macros Making these functional rather than object macros will prevent later problems with complex macro expansion. Backports commit d1a9f2d12fcfc942924956fbe321aedf4226ccb7 from qemu	2018-02-27 10:21:35 -05:00
Richard Henderson	4168095fed	target-m68k: Optimize gen_flush_flags Backports commit 36f0399d46f2ccf4f6e7451ba46b1e8d0e9ab341 from qemu	2018-02-27 10:19:54 -05:00
Richard Henderson	7403e63f2f	target-m68k: Optimize some comparisons Backports commit 9d896621c1820fd8f437fac26fd7d2e0921091c3 from qemu	2018-02-27 10:15:20 -05:00
Richard Henderson	672a28173f	target-m68k: Use setcond for scc Backports commit b459e3eccfae7fe83e30187c391de00bccf4f51d from qemu	2018-02-27 10:11:35 -05:00
Richard Henderson	ed6feb9329	target-m68k: Introduce DisasCompare Backports commit 6a432295d73df91890dc70c4a94dcc4ba88ad1c3 from qemu	2018-02-27 10:08:32 -05:00
Richard Henderson	4e498cc54d	target-m68k: Reorg flags handling Separate all ccr bits. Continue to batch updates via cc_op. Backports commit 620c6cf66584bfbee90db84a7e87a6eabf230ca9 from qemu	2018-02-27 10:02:02 -05:00
Richard Henderson	121309a4d0	target-m68k: Reorg flags handling Separate all ccr bits. Continue to batch updates via cc_op. Signed-off-by: Richard Henderson <rth@twiddle.net> Fix gen_logic_cc() to really extend the size of the result. Fix gen_get_ccr(): update cc_op as it is used by the helper. Factorize flags computing and src/ccr cleanup Backports commit 620c6cf66584bfbee90db84a7e87a6eabf230ca9 from qemu	2018-02-27 09:30:32 -05:00
Richard Henderson	61ab9a42cd	target-m68k: Remove incorrect clearing of cc_x The CF docs certainly doesnt suggest this is true. Backports commit 18dd87f26bed46f22bb1b9536329c02de500f407 from qemu	2018-02-27 09:21:26 -05:00
Richard Henderson	187c2a9807	target-m68k: Some fixes to SR and flags management Backports commit 99c514485b1d7922c4ca1ed767fd45525de4701f from qemu	2018-02-27 09:19:21 -05:00
Richard Henderson	9493b29399	target-m68k: Print flags properly Backports commit 8e394ccabdb1e439aab092de6b9d2f26432e962f from qemu	2018-02-27 09:17:44 -05:00
Laurent Vivier	57ea90a91f	target-m68k: update CPU flags management Copied from target-i386 Backports commit 9fdb533fb129b19610941bd1e5dd93e7471a18f5 from qemu	2018-02-27 09:15:29 -05:00
Laurent Vivier	125675e334	target-m68k: don't update cc_dest in helpers Backports commit 91f90d7191f862ab27528dbdf76cee55c77f79cf from qemu	2018-02-27 09:04:51 -05:00
Laurent Vivier	12f9ba3fe4	target-m68k: update move to/from ccr/sr Backports commit 7c0eb318bdcc3667a861e7b0f140df0b6d9895e2 from qemu	2018-02-27 08:57:05 -05:00
Laurent Vivier	b8366d5b31	target-m68k: remove m68k_cpu_exec_enter() and m68k_cpu_exec_exit() Update cc_op directly from tcg_gen_insn_start() and restore_state_to_opc() Copied from target-i386 Backports commit 20a8856eba0980fbe9d2b8ed2b33ecdb9c9fe5ad from qemu	2018-02-27 08:53:02 -05:00
Laurent Vivier	a521f4f41d	target-m68k: Replace helper_xflag_lt with setcond Backports commit f9083519034aaa5ad5cd2c5727bd61c29bf60bc5 from qemu	2018-02-27 08:50:44 -05:00
Laurent Vivier	b079255576	target-m68k: allow to update flags with operation on words and bytes Backports commit 5dbb6784b7e2b833c036b4df58aa07067e35f476 from qemu	2018-02-27 08:47:12 -05:00
Laurent Vivier	f069762b61	target-m68k: REG() macro cleanup Backports commit bcc098b0c23b4dd902ff56987d769bd839677331 from qemu	2018-02-27 08:37:25 -05:00
Laurent Vivier	3d59fe56b3	target-m68k: set PAGE_BITS to 12 for m68k Backports commit 2b04e85a3401e13cb19b1de197e6c211eaadca4c from qemu	2018-02-27 08:36:09 -05:00
Laurent Vivier	292fc83c86	target-m68k: define operand sizes Backports commit 7ef25cdd6cee4fa468d6cb913fa064a6689faf7d from qemu	2018-02-27 08:35:13 -05:00
Laurent Vivier	2653165c63	target-m68k: introduce read_imXX() functions Read a 8, 16 or 32bit immediat constant. An immediate constant is stored in the instruction opcode and can be in one or two extension words. Backports commit 28b68cd79ef01e8b1f5bd26718cd8c09a12c625f from qemu	2018-02-27 08:32:04 -05:00
Laurent Vivier	d29cbb70b3	target-m68k: manage scaled index Scaled index is not supported by 68000, 68008, and 68010. EA = (bd + PC) + Xn.SIZE*SCALE + od Ignore it: M68000 FAMILY PROGRAMMER’S REFERENCE MANUAL 2.4 BRIEF EXTENSION WORD FORMAT COMPATIBILITY "If the MC68000 were to execute an instruction that encoded a scaling factor, the scaling factor would be ignored and would not access the desired memory address. The earlier microprocessors do not recognize the brief extension word formats implemented by newer processors. Although they can detect illegal instructions, they do not decode invalid encodings of the brief extension word formats as exceptions." Backports commit d8633620a112296fcf6a6ae9a1cbba614c0ca502 from qemu	2018-02-27 08:27:20 -05:00
Laurent Vivier	fa4a71a1bf	target-m68k: define m680x0 CPUs and features This patch defines height new features: - M68K_FEATURE_SCALED_INDEX, scaled address index register - M68K_FEATURE_LONG_MULDIV, 32bit multiply/divide - M68K_FEATURE_QUAD_MULDIV, 64bit multiply/divide - M68K_FEATURE_BCCL, long conditional branches - M68K_FEATURE_BITFIELD, bit field instructions - M68K_FEATURE_FPU, FPU instructions - M68K_FEATURE_CAS, cas instruction - M68K_FEATURE_BKPT, bkpt instruction Backports commit f076803bbf6ad1618f493f543faff97f3dd0c970 from qemu	2018-02-27 08:26:06 -05:00
John Paul Adrian Glaubitz	2fd7779aa5	target-m68k: Build the opcode table only once to avoid multithreading issues Backports commit b208525797b031c1be4121553e21746686318a38 from qemu	2018-02-27 08:14:35 -05:00
Laurent Vivier	fd84549b3e	target-m68k: fix DEBUG_DISPATCH Backports commit a1ff19302007986fa081738e88905a715bd68e2e from qemu	2018-02-27 08:07:21 -05:00
Daniel P. Berrange	83a5bf2d25	qapi: rename QmpOutputVisitor to QObjectOutputVisitor The QmpOutputVisitor has no direct dependency on QMP. It is valid to use it anywhere that one wants a QObject. Rename it to better reflect its functionality as a generic QAPI to QObject converter. The commit before previous renamed the files, this one renames C identifiers. Backports commit 7d5e199ade76c53ec316ab6779800581bb47c50a from qemu	2018-02-27 08:05:33 -05:00
Daniel P. Berrange	2949a90977	qapi: rename QmpInputVisitor to QObjectInputVisitor The QmpInputVisitor has no direct dependency on QMP. It is valid to use it anywhere that one has a QObject. Rename it to better reflect its functionality as a generic QObject to QAPI converter. The previous commit renamed the files, this one renames C identifiers. Backports commit 09e68369a88d7de0f988972bf28eec1b80cc47f9 from qemu	2018-02-26 15:54:15 -05:00
Daniel P. Berrange	228f122248	qapi: rename qmp--visitor* to qobject--visitor* The QMP visitors have no direct dependency on QMP. It is valid to use them anywhere that one has a QObject. Rename them to better reflect their functionality as a generic QObject to QAPI converter. This is the first of three parts: rename the files. The next two parts will rename C identifiers. The split is necessary to make git rename detection work. Backports commit b3db211f3c80bb996a704d665fe275619f728bd4 from qemu	2018-02-26 15:42:37 -05:00
Peter Maydell	1a850bcb19	target-arm: Implement new HLT trap for semihosting Version 2.0 of the semihosting specification introduces new trap instructions for AArch32: HLT 0xF000 for A32 and HLT 0x3C for T32. Implement these (in the same way we implement the existing HLT semihosting trap for A64). The old traps via SVC and BKPT are unaffected. Backports commit 19a6e31c9d2701ef648b70ddcfc3bf64cec8c37e from qemu	2018-02-26 15:28:45 -05:00
Peter Maydell	db8b0a82b1	cpu: Support a target CPU having a variable page size Support target CPUs having a page size which isn't knownn at compile time. To use this, the CPU implementation should: * define TARGET_PAGE_BITS_VARY * not define TARGET_PAGE_BITS * define TARGET_PAGE_BITS_MIN to the smallest value it might possibly want for TARGET_PAGE_BITS * call set_preferred_target_page_bits() in its realize function to indicate the actual preferred target page size for the CPU (and report any error from it) In CONFIG_USER_ONLY, the CPU implementation should continue to define TARGET_PAGE_BITS appropriately for the guest OS page size. Machines which want to take advantage of having the page size something larger than TARGET_PAGE_BITS_MIN must set the MachineClass minimum_page_bits field to a value which they guarantee will be no greater than the preferred page size for any CPU they create. Note that changing the target page size by setting minimum_page_bits is a migration compatibility break for that machine. For debugging purposes, attempts to use TARGET_PAGE_SIZE before it has been finally confirmed will assert. Backports commit 20bccb82ff3ea09bcb7c4ee226d3160cab15f7da from qemu	2018-02-26 12:29:08 -05:00
Vijaya Kumar K	a7229cc08a	translate-all.c: Compute L1 page table properties at runtime Remove L1 page mapping table properties computing statically using macros which is dependent on TARGET_PAGE_BITS. Drop macros V_L1_SIZE, V_L1_SHIFT, V_L1_BITS macros and replace with variables which are computed at early stage of VM boot. Removing dependency can help to make TARGET_PAGE_BITS dynamic. Backports commit 66ec9f49399f0a9fa13ee77c472caba0de2773fc from qemu	2018-02-26 11:46:58 -05:00
Vijaya Kumar K	3082b4e4ec	exec.c: Remove static allocation of sub_section of sub_page Allocate sub_section dynamically. Remove dependency on TARGET_PAGE_SIZE to make run-time page size detection for arm platforms. Backports commit 2615fabd42ea0078dd9e659bdb21a5b7a1f87a9a from qemu	2018-02-26 10:50:04 -05:00
Paolo Bonzini	eb75004013	memory: add a per-AddressSpace list of listeners This speeds up MEMORY_LISTENER_CALL noticeably. Right now, with many PCI devices you have N regions added to M AddressSpaces (M = # PCI devices with bus-master enabled) and each call looks up the whole listener list, with at least M listeners in it. Because most of the regions in N are BARs, which are also roughly proportional to M, the whole thing is O(M^3). This changes it to O(M^2), which is the best we can do without rewriting the whole thing. Backports commit 9a54635dcb51a3fcf7507af630168f514a8cd4e7 from qemu	2018-02-26 10:46:50 -05:00
Paolo Bonzini	4b06e8bbb7	memory: eliminate global MemoryListeners There is none, so just drop the code. Backports commit d45fa784cd0c111131696808d1168259d66b7519 from qemu	2018-02-26 10:19:28 -05:00
Paolo Bonzini	8734e13a73	tcg: try sti when moving a constant into a dead memory temp This comes from free from unifying tcg_reg_alloc_mov and tcg_reg_alloc_movi's handling of TEMP_VAL_CONST. It triggers often on moves to cc_dst, such as the following translation of "sub $0x3c,%esp": before: after: subl $0x3c,%ebp subl $0x3c,%ebp movl %ebp,0x10(%r14) movl %ebp,0x10(%r14) movl $0x3c,%ebx movl $0x3c,0x2c(%r14) movl %ebx,0x2c(%r14) Backports commit 0fe4fca4e1a5e06a270127dd80bb753d4dda61c6 from qemu	2018-02-26 10:08:47 -05:00
Paolo Bonzini	be00a3e100	target-i386: fix 32-bit addresses in LEA This was found with test-i386. The issue is that instructions such as addr32 lea (%eax), %rax did not perform a 32-bit extension, because the LEA translation skipped the gen_lea_v_seg step. That step does not just add segments, it also takes care of extending from address size to pointer size. Backports commit 620abfb004543404bef1953e25da2ad77352941a from qemu	2018-02-26 10:06:08 -05:00
Paolo Bonzini	8b239bd48b	atomic: base mb_read/mb_set on load-acquire and store-release This introduces load-acquire and store-release operations in QEMU. For now, just use them as an implementation detail of atomic_mb_read and atomic_mb_set. Since docs/atomics.txt documents that atomic_mb_read only synchronizes with an atomic_mb_set of the same variable, we can use the new implementation everywhere instead of seq-cst loads and stores. Backports commit 803cf26a9e019b5d2256a8edeb22e3538c4f3261 from qemu	2018-02-26 10:02:46 -05:00
Paolo Bonzini	fd7ef4c184	atomic: introduce smp_mb_acquire and smp_mb_release	2018-02-26 09:58:22 -05:00
Eduardo Habkost	b41bb81737	target-i386: Don't use cpu->migratable when filtering features When explicitly enabling unmigratable flags using "-cpu host" (e.g. "-cpu host,+invtsc"), the requested feature won't be enabled because cpu->migratable is true by default. This is inconsistent with all other CPU models, which don't have the "migratable" option, making "+invtsc" work without the need for extra options. This happens because x86_cpu_filter_features() uses cpu->migratable as an argument for x86_cpu_get_supported_feature_word(). This is not useful because: 2) on "-cpu host" it only makes QEMU disable features that were explicitly enabled in the command-line; 1) on all the other CPU models, cpu->migratable is already false. The fix is to just use 'false' as an argument to x86_cpu_get_supported_feature_word() in x86_cpu_filter_features(). Note that: * This won't change anything for people using using "-cpu host" or "-cpu host,migratable=<on\|off>" (with no extra features) because the x86_cpu_get_supported_feature_word() call on the cpu->host_features check uses cpu->migratable as argument. * This won't change anything for any CPU model except "host" because they all have cpu->migratable == false (and only "host" has the "migratable" property that allows it to be changed). * This will only change things for people using "-cpu host,+<feature>", where <feature> is a non-migratable feature. The only existing named non-migratable feature is "invtsc". In other words, this change will only affect people using "-cpu host,+invtsc" (that will now get what they asked for: the invtsc flag will be enabled). All other use cases are unaffected. Backports commit 46c032f3afcc05a0123914609f1003906ba63fda from qemu	2018-02-26 09:51:14 -05:00
Eduardo Habkost	4096ce0184	target-i386: x86_cpu_load_features() function When probing for CPU model information, we need to reuse the code that initializes CPUID fields, but not the remaining side-effects of x86_cpu_realizefn(). Move that code to a separate function that can be reused later. Backports commit 41f3d4d69a423dadb8431fda65d8d7c68c0de0fc from qemu	2018-02-26 09:49:34 -05:00
Eduardo Habkost	aa98c8a93f	target-i386: Move warning code outside x86_cpu_filter_features() x86_cpu_filter_features() will be reused by code that shouldn't print any warning. Move the warning code to a new x86_cpu_report_filtered_features() function, and call it from x86_cpu_realizefn(). Backports commit 8ca30e8673aff9bfcf8f969f8db4266b5f62e49c from qemu	2018-02-26 09:40:11 -05:00
Eduardo Habkost	08bfa41e1b	target-i386: xsave: Add FP and SSE bits to x86_ext_save_areas Instead of treating the FP and SSE bits as special cases, add them to the x86_ext_save_areas array. This will simplify the code that calculates the supported xsave components and the size of the xsave area. Backports commit e3c9022b4e2b6a4deb6518361d2bbf33522b9198 from qemu	2018-02-26 09:37:48 -05:00
Eduardo Habkost	54bd827472	target-i386: Register properties for feature aliases manually Instead of keeping the aliases inside the feature name arrays and require parsing the strings, just register alias properties manually. This simplifies the code for property registration and lookup. Backports commit 16d2fcaa509b1ca56eb2fcd8fe877279cf65cccc from qemu	2018-02-26 09:34:52 -05:00
Eduardo Habkost	b508b9e02a	target-i386: Remove underscores from feat_names arrays Instead of translating the feature name entries when adding property names, store the actual property names in the feature name array. For reference, here is the full list of functions that use FeatureWordInfo::feat_names: * x86_cpu_get_migratable_flags(): not affected, as it just check for non-NULL values. * report_unavailable_features(): informative only. It will start printing feature names with hyphens. * x86_cpu_list(): informative only. It will start printing feature names with hyphens * x86_cpu_register_feature_bit_props(): not affected, as it was already calling feat2prop(). Now we can remove the feat2prop() calls safely. So, the only user-visible effect of this patch are the new names being used in help and error messages for users. Backports commit fc7dfd205f3287893c436d932a167bffa30579c8 from qemu	2018-02-26 09:33:15 -05:00
Eduardo Habkost	6d1a7bccb5	target-i386: Disable VME by default with TCG VME is already disabled automatically when using TCG. So, instead of pretending it is there when reporting CPU model data on query-cpu-* QMP commands (making every CPU model to be reported as not runnable), we can disable it by default on all CPU models when using TCG. Do that by adding a tcg_default_props array that will work like kvm_default_props. Backports commit 04d99c3c61f4bdc0450dbeb6512b6dd743baca65 from qemu	2018-02-26 08:23:44 -05:00
Eduardo Habkost	594cbeaa06	target-i386: List CPU models using subclass list Instead of using the builtin_x86_defs array, use the QOM subclass list to list CPU models on "-cpu ?" and "query-cpu-definitions". Backports commit ee465a3ef77c2b2975ffa71c72208c05b3f3970d from qemu	2018-02-26 08:17:04 -05:00
Peter Maydell	200771d0ba	target-arm: Add trace events for the generic timers Backports commit 194cbc492bcc8f3f1868ec97a35146bc99c3c71c from qemu	2018-02-26 08:15:42 -05:00
Peter Maydell	158bfc109a	target-arm: Implement dummy MDCCINT_EL1 MDCCINT_EL1 is part of the DCC debugger communication channel between the CPU and an attached external debugger. QEMU doesn't implement this, but since Linux may try to access this register we need to provide at least a dummy implementation. Backports commit 5dbdc4342f479d799a1970dd5fd22e64c9dcd50d from qemu	2018-02-26 08:11:54 -05:00
Peter Maydell	f2dcb81b27	Fix masking of PC lower bits when doing exception returns In commit 9b6a3ea7a699594 store_reg() was changed to mask both bits 0 and 1 of the new PC value when in ARM mode. Unfortunately this broke the exception return code paths when doing a return from ARM mode to Thumb mode: in some of these we write a new CPSR including new Thumb mode bit via gen_helper_cpsr_write_eret(), and then use store_reg() to write the new PC. In this case if the new CPSR specified Thumb mode then masking bit 1 of the PC is incorrect (these code paths correspond to the v8 ARM ARM pseudocode function AArch32.ExceptionReturn(), which always aligns the new PC appropriately for the new instruction set state). Instead of using store_reg() in exception-return code paths, call a new store_pc_exc_ret() which stores the raw new PC value to env->regs[15], and then mask it appropriately in the subsequent helper_cpsr_write_eret() where the new env->thumb state is available. This fixes a bug introduced by 9b6a3ea7a699594 which caused crashes/hangs or otherwise bad behaviour for Linux when userspace was using Thumb. Backports commit fb0e8e79a9d77ee240dbca036fa8698ce654e5d1 from qemu	2018-02-26 08:09:28 -05:00
Thomas Hanson	c69ae10ca7	target-arm: Comments added to identify cases in a switch 3 cases in a switch in disas_exc() require reference to the ARM ARM spec in order to determine what case they're handling. Backports commit 957956b3013c8122a749dfe61a41aef8b4100e31 from qemu	2018-02-26 08:05:49 -05:00
Thomas Hanson	00d1803436	target-arm: Code changes to implement overwrite of tag field on PC load For BR, BLR and RET instructions, if tagged addresses are enabled, the tag field in the address must be cleared out prior to loading the address into the PC. Depending on the current EL, it will be set to either all 0's or all 1's. Backports commit 6feecb8b941f2d21e5645d0b6e0cdb776998121b from qemu	2018-02-26 08:04:00 -05:00
Thomas Hanson	2af4ca54e9	target-arm: Infrastucture changes to enable handling of tagged address loading into PC When capturing the current CPU state for the TB, extract the TBI0 and TBI1 values from the correct TCR for the current EL and then add them to the TB flags field. Then, at the start of code generation for the block, copy the TBI fields into the DisasContext structure. Backports commit 86fb3fa4ed5873b021a362ea26a021f4aeab1bb4 from qemu	2018-02-26 07:58:17 -05:00
Marc-André Lureau	be6e25bcc7	qapi: return a 'missing parameter' error The 'old' dispatch code returned a QERR_MISSING_PARAMETER for missing parameters, but the qapi qmp_dispatch() code uses QERR_INVALID_PARAMETER_TYPE. Improve qapi code to return QERR_MISSING_PARAMETER where appropriate. Fix expected error message in iotests. Backports commit 1382d4abdf9619985e4078e37e49e487cea9935e from qemu	2018-02-26 05:19:53 -05:00
Marc-André Lureau	ddc25c8aaf	qapi: assert list entry has a value This helps to figure out the expectations. Backports commit eac8e79ff749fc15e1dca4caccf1f38664ab4915 from qemu	2018-02-26 05:15:32 -05:00
Marc-André Lureau	bd469af15f	qapi: add assert about root value qiv->root should not be null, make that clearer with some assert. Backports commit 5d0cbbcfeb59e1e3f5ee7d26b8a215382f6d9abd from qemu	2018-02-26 05:15:01 -05:00
Marc-André Lureau	1a138915a5	qapi: Fix crash when 'any' or 'null' parameter is missing Unlike the other visit methods, visit_type_any() and visit_type_null() neglect to check whether qmp_input_get_object() succeeded. They crash when it fails. Reproducer: { "execute": "qom-set", "arguments": { "path": "/machine", "property": "rtc-time" } } Will crash with: qapi/qapi-visit-core.c:277: visit_type_any: Assertion `!err != !*obj' failed Broken in commit 5c678ee. Fix by adding the missing error checks. Backports commit c489780203f9b22aca5539ec7589b7140bdc951f from qemu	2018-02-26 05:13:54 -05:00
Alex Bennée	fbf6fb1e25	atomic.h: fix __SANITIZE_THREAD__ build Only very modern GCC's actually set this define when building with the ThreadSanitizer so this little typo slipped though. Backports commit 23ea7f57949f2f5934f4d5bbc29fe321b3a7067b from qemu	2018-02-26 05:12:17 -05:00
Alex Bennée	d4cb954102	cpu: atomically modify cpu->exit_request ThreadSanitizer picks up potential races although we already use barriers to ensure things are in the correct order when processing exit requests. For true C11 defined behaviour across threads we need to use relaxed atomic_set/atomic_read semantics to reassure tsan. Backports commit 027d9a7d2911e993cdcbd21c7c35d1dd058f05bb from qemu	2018-02-26 05:11:18 -05:00
Alex Bennée	e1cf9ca84a	qom/cpu: atomically clear the tb_jmp_cache The ThreadSanitizer rightly complains that something initialised with a normal access is later updated and read atomically. Backports commit ce7cf6a973f4b614162b9518954d441fa5e32fc6 from qemu	2018-02-26 05:09:05 -05:00
Alex Bennée	12d7e946a1	qom/object: update class cache atomically The idiom CPU_GET_CLASS(cpu) is fairly extensively used in various threads and trips of ThreadSanitizer due to the fact it updates obj->class->object_cast_cache behind the scenes. As this is just a fast-path cache there is no need to lock updates. However to ensure defined C11 behaviour across threads we need to use the plain atomic_read/set primitives and keep the sanitizer happy. Backports commit b6b3ccfda015dcd5ab50f70c189ee5cc6c622e91 from qemu	2018-02-26 05:06:40 -05:00
Alex Bennée	bf72733576	tcg/optimize: move default return out of if statement This is to appease sanitizer builds which complain that: "error: control reaches end of non-void function" Backports commit 550276ae0a88851edda2cb7fcdd64256dbb8e314 from qemu	2018-02-26 05:05:21 -05:00

1 2 3 4 5 ...

2105 Commits