unicorn

mirror of https://github.com/yuzu-emu/unicorn.git synced 2024-10-21 16:58:18 +02:00

Author	SHA1	Message	Date
Richard Henderson	bc55b3e570	target/arm: Implement SVE Predicate Count Group Backports commit 9ee3a611de28b8d0862fa687215b04b5aad20747 from qemu	2018-06-15 13:49:58 -04:00
Richard Henderson	bb930f35b0	target/arm: Implement SVE Partition Break Group Backports commit 35da316f5e847292ffbe7b6d16cd3988043dfe22 from qemu	2018-06-15 13:42:35 -04:00
Richard Henderson	ade246e87b	target/arm: Implement SVE Integer Compare - Immediate Group Backports commit 38cadeba0daf0f16cf2aeaa5b2752b26fb0676c5 from qemu	2018-06-15 13:35:40 -04:00
Richard Henderson	2969a38d61	target/arm: Implement SVE Integer Compare - Vectors Group Backports commit 757f9cff1b63895bfd6fc8d66a6e52d7c40baa7b from qemu	2018-06-15 13:29:15 -04:00
Richard Henderson	7211d415a4	target/arm: Implement SVE Select Vectors Group Backports commit d3fe4a29d754dee73cbf3cb7584db222981179ac from qemu	2018-06-15 13:17:47 -04:00
Richard Henderson	7698c1634e	target/arm: Implement SVE vector splice (predicated) Backports commit b48ff24098c72f86e187e6abb7e9ca4de40a7fb4 from qemu	2018-06-15 13:14:33 -04:00
Richard Henderson	7d930e8515	target/arm: Implement SVE reverse within elements Backports commit dae8fb9019d2aa6ccb151a19871df40de6c98e29 from qemu	2018-06-15 13:12:14 -04:00
Richard Henderson	0bb2fdd752	target/arm: Implement SVE conditionally broadcast/extract element Backports commit ef23cb726dc32375bc2fca7ac3e9f34816f6ee13 from qemu	2018-06-15 13:01:40 -04:00
Richard Henderson	8ba3bde59b	target/arm: Implement SVE compress active elements Backports commit 3ca879aeb3412bc2be35d01a7bedf5fada960b5d from qemu	2018-06-15 12:52:19 -04:00
Richard Henderson	d9ed221567	target/arm: Implement SVE Permute - Interleaving Group Backports commit 234b48e9c68759aea78ff5a1e49c2ba806cd1d83 from qemu	2018-06-15 12:49:42 -04:00
Richard Henderson	3722ab310b	target/arm: Implement SVE Permute - Predicates Group Backports commit d731d8cb3c74258669211f065c918353eb7b8f4a from qemu	2018-06-15 12:44:50 -04:00
Richard Henderson	c57ff23c56	target/arm: Implement SVE Permute - Unpredicated Group Backports commit 30562ab716bcec0bf718b47b5268949856b17604 from qemu	2018-06-15 12:37:56 -04:00
Peter Maydell	7a6ae26346	cputlb: Pass cpu_transaction_failed() the correct physaddr The API for cpu_transaction_failed() says that it takes the physical address for the failed transaction. However we were actually passing it the offset within the target MemoryRegion. We don't currently have any target CPU implementations of this hook that require the physical address; fix this bug so we don't get confused if we ever do add one. Backports commit 2d54f19401bc54b3b56d1cc44c96e4087b604b97 from qemu	2018-06-15 12:03:23 -04:00
Richard Henderson	6835b2dd13	target/arm: Implement SVE Permute - Extract Group Backports commit b94f8f60bd841c5b737185cd38263e26822f77ab from qemu	2018-05-20 05:26:55 -04:00
Richard Henderson	9917f0d536	target/arm: Implement SVE Integer Wide Immediate - Predicated Group Backports commit f25a2361539626721dbccce14c077cad03b2e72c from qemu	2018-05-20 05:24:04 -04:00
Richard Henderson	89038c1e4b	target/arm: Implement SVE Element Count Group Backports commit 24e82e68341e73ec0f65534c78c13fd03395b188 from qemu	2018-05-20 05:15:35 -04:00
Richard Henderson	0249ab3f7e	target/arm: Implement SVE floating-point trig select coefficient Backports commit a1f233f25fd502f9a5b40c14df1b4dbdda463487 from qemu	2018-05-20 05:05:20 -04:00
Richard Henderson	d6c18fc788	target/arm: Implement SVE floating-point exponential accelerator Backports commit 0762cd428fd7b471207f5cb5b4bd4bd8f141dbe0 from qemu	2018-05-20 05:01:16 -04:00
Richard Henderson	cb55a3acdb	target/arm: Implement SVE Compute Vector Address Group Backports commit 4b242d9c1b6beaf5c81d84e956243b614a4a1d84 from qemu	2018-05-20 04:57:18 -04:00
Richard Henderson	45e009269e	target/arm: Implement SVE Bitwise Shift - Unpredicated Group Backports commit d9d78dccc86eed10ccf1c8e1ac236e41ec330b06 from qemu	2018-05-20 04:51:58 -04:00
Richard Henderson	45a09e2f25	target/arm: Implement SVE Index Generation Group Backports commit 9a56c9c3a955b77fe436beef7ac03c76a65fa32d from qemu	2018-05-20 04:43:01 -04:00
Richard Henderson	1730d3cff0	target/arm: Implement SVE Integer Multiply-Add Group Backports commit 96a36e4a44bbf296ac212ed68ebf4e48d3dfb1f0 from qemu	2018-05-20 04:35:36 -04:00
Richard Henderson	32949156d2	target/arm: Implement SVE Integer Arithmetic - Unary Predicated Group Backports commit afac6d0467c1327ad2e30a3c35347fcf5a773742 from qemu	2018-05-20 04:31:18 -04:00
Lioncash	878b862a04	target/arm: Implement SVE bitwise shift by wide elements (predicated)	2018-05-20 03:10:24 -04:00
Richard Henderson	5aa51a3a74	target/arm: Implement SVE bitwise shift by vector (predicated) Backports commit 27721dbb7ae5e2a52f06588cf38854e4cbc613c0 from qemu	2018-05-20 03:07:02 -04:00
Richard Henderson	7bb3067b95	target/arm: Implement SVE bitwise shift by immediate (predicated) Backports commit ccd841c3d71db6943f8b6d3d56bd2abb548ba40c from qemu	2018-05-20 03:01:07 -04:00
Richard Henderson	837e39ea63	target/arm: Implement SVE Integer Reduction Group Excepting MOVPRFX, which isn't a reduction. Presumably it is placed within the group because of its encoding. Backports commit 047cec971d2791b206677b954227ea92ff7ee3db from qemu	2018-05-20 02:53:04 -04:00
Richard Henderson	331aabddeb	target/arm: Implement SVE Predicate Misc Group Backports commit 028e2a7b876631eff165cac59eb43bdb2dcc213b and f97cfd596ed9bd38644323cb61d19b85ac703c81 from qemu	2018-05-20 02:43:36 -04:00
Richard Henderson	65f74e3608	target/arm: Implement SVE Predicate Logical Operations Group Backports commit 516e246a1a292f6c6f6aad5451799accbb08acd9 from qemu	2018-05-20 01:35:59 -04:00
Lioncash	1eaa2e4571	target/arm: Implement SVE predicate test	2018-05-20 01:16:16 -04:00
Richard Henderson	49def4bbde	target/arm: Add SVE decode skeleton Including only 4, as-yet unimplemented, instruction patterns so that the whole thing compiles. Backports commit 38388f7ee3adc04a7e7246c04352451c4f8d00fb from qemu	2018-05-20 00:48:14 -04:00
Richard Henderson	d2d8e2fc33	target/arm: Introduce translate-a64.h Move some stuff that will be common to both translate-a64.c and translate-sve.c. Backports commit 8c71baedb8055beaa681823206ee3a74f9f8649a from qemu	2018-05-20 00:34:25 -04:00
Alex Bennée	40d57900bf	target/arm: convert conversion helpers to fpst/ahp_flag Instead of passing env and leaving it up to the helper to get the right fpstatus we pass it explicitly. There was already a get_fpstatus helper for neon for the 32 bit code. We also add an get_ahp_flag() for passing the state of the alternative FP16 format flag. This leaves scope for later tracking the AHP state in translation flags. Backports commit 486624fcd3eaca6165ab8401d73bbae6c0fb81c1 from qemu	2018-05-19 22:58:25 -04:00
Alex Bennée	80074e4745	target/arm: Implement FCMP for fp16 These where missed out from the rest of the half-precision work. Backports commit 7a1929256ea1a03df12625e75ed571c60dca5bfb from qemu	2018-05-15 22:24:39 -04:00
Richard Henderson	8436080518	target/arm: Implement FCVT (scalar, integer) for fp16 Backports commit 564a0632504fad840491aa9a59453f4e64a316c4 from qemu	2018-05-15 22:06:49 -04:00
Richard Henderson	67740bbc7f	target/arm: Fix float16 to/from int16 The instruction "ucvtf v0.4h, v04h, #2", with input 0x8000u, overflows the intermediate float16 to infinity before we have a chance to scale the output. Use float64 as the intermediate type so that no input argument (uint32_t in this case) can overflow or round before scaling. Given the declared argument, the signed int32_t function has the same problem. When converting from float16 to integer, using u/int32_t instead of u/int16_t means that the bounding is incorrect. Backports commit 88808a022c06f98d81cd3f2d105a5734c5614839 from qemu	2018-05-14 08:41:20 -04:00
Richard Henderson	688d0fd0ed	target/arm: Implement CAS and CASP Backports commit 44ac14b06fa33f60982923b6b8a3bf8dd2fea61d from qemu	2018-05-14 08:28:45 -04:00
Richard Henderson	de1708aadc	tcg: Introduce atomic helpers for integer min/max Given that this atomic operation will be used by both risc-v and aarch64, let's not duplicate code across the two targets. Backports commit 5507c2bf35aa6b4705939349184e71afd5e058b2 from qemu	2018-05-14 08:06:42 -04:00
Richard Henderson	eef66443b2	tcg: Introduce helpers for integer min/max These operations are re-invented by several targets so far. Several supported hosts have insns for these, so place the expanders out-of-line for a future introduction of tcg opcodes. Backports commit b87fb8cd9f9a0ba599ff79e7bf03222da02e5724 from qemu	2018-05-14 07:31:50 -04:00
Richard Henderson	2150745db4	tcg: Improve TCGv_ptr support Drop TCGV_PTR_TO_NAT and TCGV_NAT_TO_PTR internal macros. Add tcg_temp_local_new_ptr, tcg_gen_brcondi_ptr, tcg_gen_ext_i32_ptr, tcg_gen_trunc_i64_ptr, tcg_gen_extu_ptr_i64, tcg_gen_trunc_ptr_i32. Use inlines instead of macros where possible. Backports commit 5bfa803448638a45542441fd6b7cc1241403ea72 from qemu	2018-05-03 15:05:43 -04:00
Aaron Lindsay	99a0be89a8	target/arm: Add pre-EL change hooks Because the design of the PMU requires that the counter values be converted between their delta and guest-visible forms for mode filtering, an additional hook which occurs before the EL is changed is necessary. Backports commit b5c53d1b3886387874f8c8582b205aeb3e4c3df6 from qemu	2018-04-26 09:21:54 -04:00
Peter Maydell	7a3ee5fd95	target/arm: Honour MDCR_EL2.TDE when routing exceptions due to BKPT/BRK The MDCR_EL2.TDE bit allows the exception level targeted by debug exceptions to be set to EL2 for code executing at EL0. We handle this in the arm_debug_target_el() function, but this is only used for hardware breakpoint and watchpoint exceptions, not for the exception generated when the guest executes an AArch32 BKPT or AArch64 BRK instruction. We don't have enough information for a translate-time equivalent of arm_debug_target_el(), so instead make BKPT and BRK call a special purpose helper which can do the routing, rather than the generic exception_with_syndrome helper. Backports commit c900a2e62dd6dde11c8f5249b638caad05bb15be from qemu	2018-03-25 16:33:04 -04:00
Bharata B Rao	309b85548f	cpu: Convert cpu_index into a bitmap Currently CPUState::cpu_index is monotonically increasing and a newly created CPU always gets the next higher index. The next available index is calculated by counting the existing number of CPUs. This is fine as long as we only add CPUs, but there are architectures which are starting to support CPU removal, too. For an architecture like PowerPC which derives its CPU identifier (device tree ID) from cpu_index, the existing logic of generating cpu_index values causes problems. With the currently proposed method of handling vCPU removal by parking the vCPU fd in QEMU (Ref: http://lists.gnu.org/archive/html/qemu-devel/2015-02/msg02604.html), generating cpu_index this way will not work for PowerPC. This patch changes the way cpu_index is handed out by maintaining a bit map of the CPUs that tracks both addition and removal of CPUs. The CPU bitmap allocation logic is part of cpu_exec_init(), which is called by instance_init routines of various CPU targets. Newly added cpu_exec_exit() API handles the deallocation part and this routine is called from generic CPU instance_finalize. Note: This new CPU enumeration is for !CONFIG_USER_ONLY only. CONFIG_USER_ONLY continues to have the old enumeration logic. Backports commit b7bca7333411bd19c449147e8202ae6b0e4a8e09 from qemu	2018-03-21 08:06:07 -04:00
Igor Mammedov	f8eeacb280	Use cpu_create(type) instead of cpu_init(cpu_model) With all targets defining CPU_RESOLVING_TYPE, refactor cpu_parse_cpu_model(type, cpu_model) to parse_cpu_model(cpu_model) so that callers won't have to know internal resolving cpu type. Place it in exec.c so it could be called from both target independed vl.c and *-user/main.c. That allows us to stop abusing cpu type from MachineClass::default_cpu_type as resolver class in vl.c which were confusing part of cpu_parse_cpu_model(). Also with new parse_cpu_model(), the last users of cpu_init() in null-machine.c and bsd/linux-user targets could be switched to cpu_create() API and cpu_init() API will be removed by follow up patch. With no longer users left remove MachineState::cpu_model field, new code should use MachineState::cpu_type instead and leave cpu_model parsing to generic code in vl.c. Backports commit 2278b93941d42c30e2950d4b8dff4943d064e7de from qemu	2018-03-20 14:20:30 -04:00
Peter Xu	a6ee6f1a87	qobject: introduce qobject_get_try_str() A quick way to fetch string from qobject when it's a QString. Backports commit b26ae1cb8eb0756524e322169138830b9b542311 from qemu	2018-03-20 11:10:03 -04:00
Peter Xu	6446b66dc7	qobject: introduce qstring_get_try_str() The only difference from qstring_get_str() is that it allows the qstring to be NULL. If so, NULL is returned. Backports commit 775932020dd6bd7e9c1acc0d7779677d8b4c094c from qemu	2018-03-20 11:08:40 -04:00
Marc-André Lureau	910d50be6b	qlit: add qobject_from_qlit() Instantiate a QObject* from a literal QLitObject. LitObject only supports int64_t for now. uint64_t and double aren't implemented. Backports commit 3cf42b8b3af1bd61e736a9ca0f94806c7931ae56 from qemu	2018-03-20 10:30:41 -04:00
Eduardo Habkost	074865ff98	cpu: Generify CPU init functions Backports commits 2994fd96d986578a342f2342501b4ad30f6d0a85, 701e3c78ce45fa630ffc6826c4b9a4218954bc7f, and d1853231c60d16af78cf4d1608d043614bfbac0b from qemuu	2018-03-20 08:21:51 -04:00
Peter Crosthwaite	ce1831bfb4	target-*: Don't redefine cpu_exec() This function needs to be converted to QOM hook and virtualised for multi-arch. This rename interferes, as cpu-qom will not have access to the renaming causing name divergence. This rename doesn't really do anything anyway so just delete it. Backports commit 8642c1b81e0418df066a7960a7426d85a923a253 from qemu	2018-03-20 07:02:47 -04:00
Paolo Bonzini	37117a74ed	qom: introduce object_class_get_list_sorted Unify half a dozen copies of very similar code (the only difference being whether comparisons were case-sensitive) and use it also in Tricore, which did not do any sorting of CPU model names. Backports commit 47c66009ab793241e8210b3018c77a9ce9506aa8 from qemu	2018-03-17 19:16:25 -04:00
Kevin Wolf	025e354370	qdict: Introduce qdict_rename_keys() A few block drivers will need to rename .bdrv_create options for their QAPIfication, so let's have a helper function for that. Backports commit bcebf102ccc3c6db327f341adc379fdf0673ca6b from qemu	2018-03-12 10:11:48 -04:00
Lioncash	a81439c7ca	exec: Drop unnecessary code for unicorn The dirty memory code isn't strictly necessary	2018-03-12 10:11:46 -04:00
Alexey Kardashevskiy	b90333a531	memory: Share special empty FlatView This shares an cached empty FlatView among address spaces. The empty FV is used every time when a root MR renders into a FV without memory sections which happens when MR or its children are not enabled or zero-sized. The empty_view is not NULL to keep the rest of memory API intact; it also has a dispatch tree for the same reason. On POWER8 with 255 CPUs, 255 virtio-net, 40 PCI bridges guest this halves the amount of FlatView's in use (557 -> 260) and dispatch tables (~800000 -> ~370000). In an unrelated experiment with 112 non-virtio devices on x86 ("-M pc"), only 4 FlatViews are alive, and about ~2000 are created at startup. Backports commit 092aa2fc65b7a35121616aad8f39d47b8f921618 from qemu	2018-03-11 22:34:28 -04:00
Alexey Kardashevskiy	1fd8b64072	memory: Get rid of address_space_init_shareable Since FlatViews are shared now and ASes not, this gets rid of address_space_init_shareable(). This should cause no behavioural change. Backports commit b516572f31c0ea0937cd9d11d9bd72dd83809886 from qemu	2018-03-11 22:12:38 -04:00
Alexey Kardashevskiy	d9bc1bcc8c	memory: Rename mem_begin/mem_commit/mem_add helpers This renames some helpers to reflect better what they do. This should cause no behavioural change. Backports commit 8629d3fcb77e9775e44d9051bad0fb5187925eae from qemu	2018-03-11 21:36:50 -04:00
Alexey Kardashevskiy	aa2b76b4e8	memory: Switch memory from using AddressSpace to FlatView FlatView's will be shared between AddressSpace's and subpage_t and MemoryRegionSection cannot store AS anymore, hence this change. In particular, for: typedef struct subpage_t { MemoryRegion iomem; - AddressSpace as; + FlatView fv; hwaddr base; uint16_t sub_section[]; } subpage_t; struct MemoryRegionSection { MemoryRegion mr; - AddressSpace address_space; + FlatView *fv; hwaddr offset_within_region; Int128 size; hwaddr offset_within_address_space; bool readonly; }; This should cause no behavioural change. Backports commit 166206845f7fd75e720e6feea0bb01957c8da07f from qemu	2018-03-11 21:21:37 -04:00
Lioncash	1591f208c0	memory: Move AddressSpaceDispatch from AddressSpace to FlatView As we are going to share FlatView's between AddressSpace's, and AddressSpaceDispatch is a structure to perform quick lookup in FlatView, this moves ASD to FlatView. After previosly open coded ASD rendering, we can also remove as->next_dispatch as the new FlatView pointer is stored on a stack and set to an AS atomically. flatview_destroy() is executed under RCU instead of address_space_dispatch_free() now. This makes mem_begin/mem_commit to work with ASD and mem_add with FV as later on mem_add will be taking FV as an argument anyway. This should cause no behavioural change. Backports commit 66a6df1dc6d5b28cc3e65db0d71683fbdddc6b62 from qemu	2018-03-11 20:40:24 -04:00
Eduardo Habkost	a7f59d7771	Use DEFINE_MACHINE() to register all machines Convert all machines to use DEFINE_MACHINE() instead of QEMUMachine automatically using a script. Backports commit e264d29de28c5b0be3d063307ce9fb613b427cc3 from qemu	2018-03-11 15:12:46 -04:00
Laurent Vivier	5fa3a97549	softfloat: use floatx80_infinity in softfloat Since f3218a8 ("softfloat: add floatx80 constants") floatx80_infinity is defined but never used. This patch updates floatx80 functions to use this definition. This allows to define a different default Infinity value on m68k: the m68k FPU defines infinity with all bits set to zero in the mantissa. Backports commit 0f605c889ca3fe9744166ad4149d0dff6dacb696 from qemu	2018-03-09 01:34:45 -05:00
Richard Henderson	abd86b2287	target/arm: Decode aa64 armv8.3 fcmla Backports commit d17b7cdcf4ea3e858ceee8b86fc8544bb71561e6 from qemu Also remember to commit vec_helper.	2018-03-09 01:05:02 -05:00
Richard Henderson	4b39a36416	target/arm: Decode aa64 armv8.3 fcadd Backports commit 1695cd61b08d4376c11e0658836c4f08b4fc3aa1 from qemu	2018-03-09 00:58:37 -05:00
Lioncash	12fd2cc113	target/arm: Decode aa64 armv8.1 three same extra	2018-03-09 00:10:09 -05:00
Richard Henderson	4f585f71fb	target/arm: Decode aa64 armv8.1 scalar three same extra Backports commit d9061ec3d27eb940402a7eafee3fb77ce1146ad4 from qemu	2018-03-09 00:02:23 -05:00
Alex Bennée	068143595e	arm/helper.c: re-factor rsqrte and add rsqrte_f16 Much like recpe the ARM ARM has simplified the pseudo code for the calculation which is done on a fixed point 9 bit integer maths. So while adding f16 we can also clean this up to be a little less heavy on the floating point and just return the fractional part and leave the calle's to do the final packing of the result. Backports commit d719cbc7641991d16b891ffbbfc3a16a04e37b9a from qemu Also removes a load of symbols that seem unnecessary from the header_gen script	2018-03-08 22:42:04 -05:00
Alex Bennée	fdb07713e6	arm/translate-a64: add FP16 FSQRT to simd_two_reg_misc_fp16 Backports commit b96a54c7e5576bd35b7d00d37b7929d2892d8cac from qemu	2018-03-08 21:57:35 -05:00
Alex Bennée	5f3864c2c2	arm/helper.c: re-factor recpe and add recepe_f16 It looks like the ARM ARM has simplified the pseudo code for the calculation which is done on a fixed point 9 bit integer maths. So while adding f16 we can also clean this up to be a little less heavy on the floating point and just return the fractional part and leave the calle's to do the final packing of the result. Backports commit 5eb70735af1c0b607bf2671a53aff3710cc1672f from qemu	2018-03-08 19:05:48 -05:00
Alex Bennée	7161c1ed52	arm/translate-a64: add FP16 SCVTF/UCVFT to simd_two_reg_misc_fp16	2018-03-08 18:48:25 -05:00
Alex Bennée	39a68548d1	arm/translate-a64: add FCVTxx to simd_two_reg_misc_fp16 This covers all the floating point convert operations. Backports commit 2df581304193d70eaf0d22cf4cb4613f74b6e59b from qemu	2018-03-08 18:25:29 -05:00
Alex Bennée	d5f002b39a	arm/translate-a64: add FP16 FPRINTx to simd_two_reg_misc_fp16 This adds the full range of half-precision floating point to integral instructions. Backports commit 6109aea2d954891027acba64a13f1f1c7463cfac from qemu	2018-03-08 18:21:58 -05:00
Alex Bennée	82ffaab7de	arm/translate-a64: add FP16 x2 ops for simd_indexed A bunch of the vectorised bitwise operations just operate on larger chunks at a time. We can do the same for the new half-precision operations by introducing some TWOHALFOP helpers which work on each half of a pair of half-precision operations at once. Hopefully all this hoop jumping will get simpler once we have generically vectorised helpers here. Backports commit 6089030c7322d8f96b54fb9904e53b0f464bb8fe from qemu	2018-03-08 18:08:39 -05:00
Alex Bennée	4b2577537b	arm/translate-a64: add FP16 FR[ECP/SQRT]S to simd_three_reg_same_fp16 As some of the constants here will also be needed elsewhere (specifically for the upcoming SVE support) we move them out to softfloat.h. Backports commit 026e2d6ef74000afb9049f46add4b94f594c8fb3 from qemu	2018-03-08 15:47:34 -05:00
Alex Bennée	a02b9b81a9	arm/translate-a64: add FP16 FMULA/X/S to simd_three_reg_same_fp16 Backports commit 2deb992b767d28035fac3b374c7730494ff0b43d from qemu Also backports the fp16 changes introduced in commit f566c0474a9b9bbd9ed248607e4007e24d3358c0	2018-03-08 15:42:48 -05:00
Alex Bennée	ba8df54753	arm/translate-a64: add FP16 F[A]C[EQ/GE/GT] to simd_three_reg_same_fp16 These use the generic float16_compare functionality which in turn uses the common float_compare code from the softfloat re-factor. Backports commit d32adeae1a71a8e71374fa48d3d6ab0ad4c23e94 from qemu	2018-03-08 12:59:37 -05:00
Alex Bennée	4a6a41d2c5	arm/translate-a64: add FP16 FADD/FABD/FSUB/FMUL/FDIV to simd_three_reg_same_fp16 The fprintf is only there for debugging as the skeleton is added to, it will be removed once the skeleton is complete. Backports commit 372087348d561e7f4051d7b32609bda417092ddf from qemu	2018-03-08 12:56:15 -05:00
Alex Bennée	af75074fe7	arm/translate-a64: implement half-precision F(MIN\|MAX)(V\|NMV) This implements the half-precision variants of the across vector reduction operations. This involves a re-factor of the reduction code which more closely matches the ARM ARM order (and handles 8 element reductions). Backports commit 807cdd504283c11addcd7ea95ba594bbddc86fe4 from qemu	2018-03-08 12:49:30 -05:00
Alex Bennée	283abedc68	fpu/softfloat: re-factor sqrt This is a little bit of a departure from softfloat's original approach as we skip the estimate step in favour of a straight iteration. There is a minor optimisation to avoid calculating more bits of precision than we need however this still brings a performance drop, especially for float64 operations. Backports commit c13bb2da9eedfbc5886c8048df1bc1114b285fb0 from qemu	2018-03-08 12:23:54 -05:00
Alex Bennée	e2fb4b40c3	fpu/softfloat: re-factor compare The compare function was already expanded from a macro. I keep the macro expansion but move most of the logic into a compare_decomposed. Backports commit 0c4c90929143a530730e2879204a55a30bf63758 from qemu	2018-03-08 12:21:20 -05:00
Alex Bennée	c38b64f8a9	fpu/softfloat: re-factor minmax Let's do the same re-factor treatment for minmax functions. I still use the MACRO trick to expand but now all the checking code is common. Backports commit 89360067071b1844bf745682e18db7dde74cdb8d from qemu	2018-03-08 12:18:35 -05:00
Alex Bennée	9b296329f6	fpu/softfloat: re-factor scalbn This is one of the simpler manipulations you could make to a floating point number. Backports commit 0bfc9f195209593e91a98cf2233753f56a2e5c02 from qemu	2018-03-08 12:16:19 -05:00
Alex Bennée	b389a8c7c4	fpu/softfloat: re-factor int/uint to float These are considerably simpler as the lower order integers can just use the higher order conversion function. As the decomposed fractional part is a full 64 bit rounding and inexact handling comes from the pack functions. Backports commit c02e1fb80b553d47420f7492de4bc590c2461a86 from qemu	2018-03-08 12:13:09 -05:00
Alex Bennée	acb4b1d5b1	fpu/softfloat: re-factor float to int/uint We share the common int64/uint64_pack_decomposed function across all the helpers and simply limit the final result depending on the final size. Backports commit ab52f973a504f8de0c5df64631ba4caea70a7d9e from qemu	2018-03-08 12:07:20 -05:00
Alex Bennée	b82253adce	fpu/softfloat: re-factor round_to_int We can now add float16_round_to_int and use the common round_decomposed and canonicalize functions to have a single implementation for float16/32/64 round_to_int functions. Backports commit dbe4d53a590f5689772b683984588b3cf6df163e from qemu	2018-03-08 11:56:59 -05:00
Alex Bennée	d92d5c6910	fpu/softfloat: re-factor muladd We can now add float16_muladd and use the common decompose and canonicalize functions to have a single implementation for float16/32/64 muladd functions. Backports commit d446830a3aac33e7221e361dad3ab1e1892646cb from qemu	2018-03-08 10:55:40 -05:00
Alex Bennée	5ea008e178	fpu/softfloat: re-factor div We can now add float16_div and use the common decompose and canonicalize functions to have a single implementation for float16/32/64 versions. Backports commit cf07323d494f4bc225e405688c2e455c3423cc40 from qemu	2018-03-08 10:25:07 -05:00
Alex Bennée	2bb86e1efc	fpu/softfloat: re-factor mul We can now add float16_mul and use the common decompose and canonicalize functions to have a single implementation for float16/32/64 versions. Backports commit 74d707e2cc1e406068acad8e5559cd2584b1073a from qemu	2018-03-08 10:21:15 -05:00
Alex Bennée	58defd9bc0	fpu/softfloat: re-factor add/sub We can now add float16_add/sub and use the common decompose and canonicalize functions to have a single implementation for float16/32/64 add and sub functions. Backports commit 6fff216769cf7eaa3961c85dee7a72838696d365 from qemu	2018-03-08 10:17:41 -05:00
Alex Bennée	8110bc8264	fpu/softfloat: implement float16_squash_input_denormal This will be required when expanding the MINMAX() macro for 16 bit/half-precision operations. Backports commit 210cbd4910ae9e41e0a1785b96890ea2c291b381 from qemu	2018-03-08 09:44:20 -05:00
Paolo Bonzini	c88064b52c	memory: remove memory_region_test_and_clear_dirty It is unused after g364fb has been converted to use DirtyBitmapSnapshot. Backports commit 77302fb5df05ffca9f41b5b54e3b67c601719d57 from qemu	2018-03-08 09:02:06 -05:00
Marc-André Lureau	c51622c4ce	qlit: rename compare_litqobj_to_qobj() to qlit_equal_qobject() compare_litqobj_to_qobj() lacks a qlit_ prefix. Moreover, "compare" suggests -1, 0, +1 for less than, equal and greater than. The function actually returns non-zero for equal, zero for unequal. Rename to qlit_equal_qobject(). Its return type will be cleaned up in the next patch. Backports commit 60cc2eb7afd40b9cbaa35a5e0b54f365ac6e49f1 from qemu	2018-03-07 17:14:55 -05:00
Ard Biesheuvel	85e6d710e4	target/arm: implement SM4 instructions This implements emulation of the new SM4 instructions that have been added as an optional extension to the ARMv8 Crypto Extensions in ARM v8.2. Backports commit b6577bcd251ca0d57ae1de149e3c706b38f21587 from qemu	2018-03-07 08:57:53 -05:00
Ard Biesheuvel	78d15a9cd0	target/arm: implement SM3 instructions This implements emulation of the new SM3 instructions that have been added as an optional extension to the ARMv8 Crypto Extensions in ARM v8.2. Backports commit 80d6f4c6bbb718f343a832df8dee15329cc7686c from qemu	2018-03-07 08:53:47 -05:00
Ard Biesheuvel	0ef74f6d6d	target/arm: implement SHA-512 instructions This implements emulation of the new SHA-512 instructions that have been added as an optional extensions to the ARMv8 Crypto Extensions in ARM v8.2. Backports commit 90b827d131812d7f0a8abb13dba1942a2bcee821 from qemu	2018-03-07 08:39:49 -05:00
Richard Henderson	b3e89e9996	tcg/i386: Add vector operations The x86 vector instruction set is extremely irregular. With newer editions, Intel has filled in some of the blanks. However, we don't get many 64-bit operations until SSE4.2, introduced in 2009. The subsequent edition was for AVX1, introduced in 2011, which added three-operand addressing, and adjusts how all instructions should be encoded. Given the relatively narrow 2 year window between possible to support and desirable to support, and to vastly simplify code maintainence, I am only planning to support AVX1 and later cpus. Backports commit 770c2fc7bb70804ae9869995fd02dadd6d7656ac from qemu	2018-03-07 08:07:40 -05:00
Richard Henderson	ac4d051b05	tcg: Add generic vector helpers with a scalar operand Use dup to convert a non-constant scalar to a third vector. Add addition, multiplication, and logical operations with an immediate. Add addition, subtraction, multiplication, and logical operations with a non-constant scalar. Allow for the front-end to build operations in which the scalar operand comes first. Backports commit 22fc3527034678489ec554e82fd52f8a7f05418e from qemu	2018-03-06 16:10:09 -05:00
Richard Henderson	57bdf0faa2	tcg: Add generic helpers for saturating arithmetic No vector ops as yet. SSE only has direct support for 8- and 16-bit saturation; handling 32- and 64-bit saturation is much more expensive. Backports commit f49b12c6e6a75a5bd109bcbbda072b24e5fb8dfd from qemu	2018-03-06 16:10:09 -05:00
Richard Henderson	ab8579123e	tcg: Add generic vector ops for multiplication Backports commit 3774030a3e523689df24a7ed22854ce7a06b0116 from qemu	2018-03-06 16:10:08 -05:00
Richard Henderson	f9c4930ecd	tcg: Add generic vector ops for comparisons Backports commit 212be173f01e85e6589fd76676827953a84a732b from qemu	2018-03-06 16:09:38 -05:00
Richard Henderson	577ee114c3	tcg: Add generic vector ops for constant shifts Opcodes are added for scalar and vector shifts, but considering the varied semantics of these do not expose them to the front ends. Do go ahead and provide them in case they are needed for backend expansion. Backports commit d0ec97967f940bbc11dced83422b39c224127f1e from qemu	2018-03-06 14:03:30 -05:00
Richard Henderson	64365612bf	tcg: Add generic vector expanders Backports commit db432672dc50ed86dda17ac821b7eb07411a90af from qemu	2018-03-06 13:42:52 -05:00
Richard Henderson	b9cd924fa5	tcg: Add types and basic operations for host vectors Nothing uses or enables them yet. Backports commit d2fd745fe8b9ac574d28b7ac63c39f6529749bd2 from qemu	2018-03-06 12:13:32 -05:00
Richard Henderson	7fe5f620df	tcg: Dynamically allocate TCGOps With no fixed array allocation, we can't overflow a buffer. This will be important as optimizations related to host vectors may expand the number of ops used. Use QTAILQ to link the ops together. Backports commit 15fa08f8451babc88d733bd411d4c94976f9d0f8 from qemu	2018-03-05 16:34:40 -05:00
Marc-André Lureau	ffa45adb57	memory: remove unused memory_region_set_global_locking() This was never used since its introduction in commit 196ea13104f8 ("memory: Add global-locking property to memory regions"). Backports commit e2fbe20851ceec5ccd7b539a89db0420393fb85d from qemu	2018-03-05 14:14:43 -05:00
Peter Maydell	8fe6b6c308	target/arm: Implement TT instruction Implement the TT instruction which queries the security state and access permissions of a memory location. Backports commit 5158de241b0fb344a6c948dfcbc4e611ab5fafbe from qemu	2018-03-05 13:48:31 -05:00
Peter Maydell	e312993f1f	target/arm: Implement BLXNS Implement the BLXNS instruction, which allows secure code to call non-secure code. Backports commit 3e3fa230e3b8ffe119f14ba57a6bc677a411be57 from qemu	2018-03-05 03:31:59 -05:00
Peter Maydell	c7b5fccfb8	target/arm: Prepare for CONTROL.SPSEL being nonzero in Handler mode In the v7M architecture, there is an invariant that if the CPU is in Handler mode then the CONTROL.SPSEL bit cannot be nonzero. This in turn means that the current stack pointer is always indicated by CONTROL.SPSEL, even though Handler mode always uses the Main stack pointer. In v8M, this invariant is removed, and CONTROL.SPSEL may now be nonzero in Handler mode (though Handler mode still always uses the Main stack pointer). In preparation for this change, change how we handle this bit: rename switch_v7m_sp() to the now more accurate write_v7m_control_spsel(), and make it check both the handler mode state and the SPSEL bit. Note that this implicitly changes the point at which we switch active SP on exception exit from before we pop the exception frame to after it. Backports commit de2db7ec894f11931932ca78cd14a8d2b1389d5b from qemu	2018-03-05 01:29:54 -05:00
Peter Xu	0741c3880a	qom: provide root container for internal objs We have object_get_objects_root() to keep user created objects, however no place for objects that will be used internally. Create such a container for internal objects. Backports commit 7c47c4ead75d0b733ee8f2f51fd1de0644cc1308 from qemu	2018-03-05 01:16:50 -05:00
Richard Henderson	7b68a8f0ca	tcg: Add tcg_op_supported Backports commit be0f34b5840312bbe9627c2b9f68a25f32903dae from qemu	2018-03-04 23:20:28 -05:00
Richard Henderson	31b8b67cd3	tcg: Move USE_DIRECT_JUMP discriminator to tcg/cpu/tcg-target.h Replace the USE_DIRECT_JUMP ifdef with a TCG_TARGET_HAS_direct_jump boolean test. Replace the tb_set_jmp_target1 ifdef with an unconditional function tb_target_set_jmp_target. While we're touching all backends, add a parameter for tb->tc_ptr; we're going to need it shortly for some backends. Move tb_set_jmp_target and tb_add_jump from exec-all.h to cpu-exec.c. Backports commit a85833933628384d74ec412024d55cf012640287 from qemu	2018-03-04 21:52:35 -05:00
Peter Maydell	2070ef1c37	boards.h: Define new flag ignore_memory_transaction_failures Define a new MachineClass field ignore_memory_transaction_failures. If this is flag is true then the CPU will ignore memory transaction failures which should cause the CPU to take an exception due to an access to an unassigned physical address; the transaction will instead return zero (for a read) or be ignored (for a write). This should be set only by legacy board models which rely on the old RAZ/WI behaviour for handling devices that QEMU does not yet model. New board models should instead use "unimplemented-device" for all memory ranges where the guest will attempt to probe for a device that QEMU doesn't implement and a stub device is required. We need this for ARM boards, where we're about to implement support for generating external aborts on memory transaction failures. Too many of our legacy board models rely on the RAZ/WI behaviour and we would break currently working guests when their "probe for device" code provoked an external abort rather than a RAZ. Backports commit ed860129acd3fcd0b1e47884e810212aaca4d21b from qemu	2018-03-04 21:27:15 -05:00
Peter Maydell	4b816fe0aa	target/arm: Implement BXNS, and banked stack pointers Implement the BXNS v8M instruction, which is like BX but will do a jump-and-switch-to-NonSecure if the branch target address has bit 0 clear. This is the first piece of code which implements "switch to the other security state", so the commit also includes the code to switch the stack pointers around, which is the only complicated part of switching security state. BLXNS is more complicated than just "BXNS but set the link register", so we leave it for a separate commit. Backports commit fb602cb726b3ebdd01ef3b1732d74baf9fee7ec9 from qemu	2018-03-04 21:21:23 -05:00
Lluís Vilanova	74d437827b	target/arm: [tcg] Port to generic translation framework Backports commit 2316922420da6fd0d1ffb5557d0cdcc5958bcf44 from qemu	2018-03-04 20:28:06 -05:00
Lluís Vilanova	ed7225e685	tcg: Add generic translation framework Backports commit bb2e0039dc07177f928f9fe24758967da02d60a2 from qemu	2018-03-04 14:31:16 -05:00
Peter Maydell	3bd5694a0a	memory: Rename memory_region_init_rom() and _rom_device() to _nomigrate() Rename memory_region_init_rom() to memory_region_init_rom_nomigrate() and memory_region_init_rom_device() to memory_region_init_rom_device_nomigrate(). Backports commit b59821a95bd1d7cb4697fd7748725c910582e0e7 from qemu	2018-03-03 22:29:01 -05:00
Peter Maydell	7b0027a828	memory: Rename memory_region_init_ram() to memory_region_init_ram_nomigrate() Rename memory_region_init_ram() to memory_region_init_ram_nomigrate(). This leaves the way clear for us to provide a memory_region_init_ram() which does handle migration. Backports commit 1cfe48c1ce219b60a9096312f7a61806fae64ab3 from qemu	2018-03-03 22:25:39 -05:00
Thomas Huth	cf5d583ef0	cpu: Introduce a wrapper for tlb_flush() that can be used in common code Commit 1f5c00cfdb8114c ("qom/cpu: move tlb_flush to cpu_common_reset") moved the call to tlb_flush() from the target-specific reset handlers into the common code qom/cpu.c file, and protected the call with "#ifdef CONFIG_SOFTMMU" to avoid that it is called for linux-user only targets. But since qom/cpu.c is common code, CONFIG_SOFTMMU is never defined here, so the tlb_flush() was simply never executed anymore. Fix it by introducing a wrapper for tlb_flush() in a file that is re-compiled for each target, i.e. in translate-all.c. Backports commit 2cd53943115be5118b5b2d4b80ee0a39c94c4f73 from qemu	2018-03-03 21:24:55 -05:00
Lioncash	0ef338aa71	Fix building for multi-arch targets	2018-03-03 21:14:08 -05:00
Emilio G. Cota	d3ada2feb5	tcg: allocate TB structs before the corresponding translated code Allocating an arbitrarily-sized array of tbs results in either (a) a lot of memory wasted or (b) unnecessary flushes of the code cache when we run out of TB structs in the array. An obvious solution would be to just malloc a TB struct when needed, and keep the TB array as an array of pointers (recall that tb_find_pc() needs the TB array to run in O(log n)). Perhaps a better solution, which is implemented in this patch, is to allocate TB's right before the translated code they describe. This results in some memory waste due to padding to have code and TBs in separate cache lines--for instance, I measured 4.7% of padding in the used portion of code_gen_buffer when booting aarch64 Linux on a host with 64-byte cache lines. However, it can allow for optimizations in some host architectures, since TCG backends could safely assume that the TB and the corresponding translated code are very close to each other in memory. See this message by rth for a detailed explanation: https://lists.gnu.org/archive/html/qemu-devel/2017-03/msg05172.html Subject: Re: GSoC 2017 Proposal: TCG performance enhancements Backports commit 6e3b2bfd6af488a896f7936e99ef160f8f37e6f2 from qemu	2018-03-03 17:05:49 -05:00
Emilio G. Cota	8f4f15e5f5	tcg: Introduce goto_ptr opcode and tcg_gen_lookup_and_goto_ptr Instead of exporting goto_ptr directly to TCG frontends, export tcg_gen_lookup_and_goto_ptr(), which calls goto_ptr with the pointer returned by the lookup_tb_ptr() helper. This is the only use case we have for goto_ptr and lookup_tb_ptr, so having this function is very convenient. Furthermore, it trivially allows us to avoid calling the lookup helper if goto_ptr is not implemented by the backend. Backports commit cedbcb01529cb6cf9a2289cdbebbc63f6149fc18 from qemu	2018-03-02 21:05:18 -05:00
Dr. David Alan Gilbert	55d79cf4c0	RAMBlocks: qemu_ram_is_shared Provide a helper to say whether a RAMBlock was created as a shared mapping. Backports commit 463a4ac23bcf0f0b65c850fa66f5ae6e43edd243 from qemu	2018-03-02 13:05:35 -05:00
Lioncash	18a229a69f	Resolve symbol errors with softfloat	2018-03-02 09:25:05 -05:00
KONRAD Frederic	c5730ff194	tcg: add options for enabling MTTCG We know there will be cases where MTTCG won't work until additional work is done in the front/back ends to support. It will however be useful to be able to turn it on. As a result MTTCG will default to off unless the combination is supported. However the user can turn it on for the sake of testing. Backports commit 8d4e9146b3568022ea5730d92841345d41275d66 from qemu	2018-03-02 09:25:01 -05:00
Julian Brown	cc217b0c90	arm: Correctly handle watchpoints for BE32 CPUs In BE32 mode, sub-word size watchpoints can fail to trigger because the address of the access is adjusted in the opcode helpers before being compared with the watchpoint registers. This patch reverses the address adjustment before performing the comparison with the help of a new CPUClass hook. This version of the patch augments and tidies up comments a little. Backports commit 40612000599e52e792d23c998377a0fa429c4036 from qemu	2018-03-02 00:24:33 -05:00
Jean-Christophe DUBOIS	0aa0b849c2	ARM: Factor out ARM on/off PSCI control functions Split ARM on/off function from PSCI support code. This will allow to reuse these functions in other code. Backports commit 825482adde1f971cbddf27e15fb4453ab3fae994 from qemu	2018-03-01 23:31:47 -05:00
Richard Henderson	4bec129626	tcg/i386: Handle ctpop opcode Backports commit 993508e43e6d180e9ba9b747a9657eac69aec5bb from qemu	2018-03-01 18:49:43 -05:00
Richard Henderson	5f6e7bbdbd	tcg: Add opcode for ctpop The number of actual invocations of ctpop itself does not warrent an opcode, but it is very helpful for POWER7 to use in generating an expansion for ctz. Backports commit a768e4e99247911f00c5c0267c12d4e207d5f6cc from qemu	2018-03-01 18:26:41 -05:00
Richard Henderson	01b3c6273a	target-arm: Use clrsb helper Backports commit bc21dbcc1203ae6bb536f832c46a3b5e22a73451 from qemu	2018-03-01 18:16:56 -05:00
Richard Henderson	fff7ca4617	tcg: Add helpers for clrsb The number of actual invocations does not warrent an opcode, and the backends generating it. But at least we can eliminate redundant helpers. Backports commit 086920c2c8008f125fd38781072fa25c3ad158ea from qemu	2018-03-01 18:14:11 -05:00
Richard Henderson	9cde8bfc44	target-arm: Use clz opcode Backports commit 7539a012f614b724426ac9360238f3281d928a3f from qemu	2018-03-01 16:13:26 -05:00
Richard Henderson	2cf34e1b55	tcg: Add clz and ctz opcodes Backports commit 0e28d0063bbd9e59a981ea2d20f82f30c5d956a8 from qemu	2018-03-01 16:04:11 -05:00
Richard Henderson	9f2fcaaf27	tcg: Add deposit_z expander While we don't require a new opcode, it is handy to have an expander that knows the first source is zero. Backports commit 07cc68d52852bf47dea7c402b46ddd28248d4212 from qemu	2018-03-01 13:29:24 -05:00
Richard Henderson	8e0585dcb1	tcg: Add field extraction primitives Adds tcg_gen_extract_* and tcg_gen_sextract_* for extraction of fixed position bitfields, much like we already have for deposit. Backports commit 7ec8bab3deae643b1ce579c2d65a244f30708330 from qemu	2018-03-01 13:21:30 -05:00
Jason Wang	fdca6292a1	exec: introduce address_space_get_iotlb_entry() This patch introduces a helper to query the iotlb entry for a possible iova. This will be used by later device IOTLB API to enable the capability for a dataplane (e.g vhost) to query the IOTLB. Backports commit 052c8fa9983f553fdfa0d61034774070dd639c2b from qemu	2018-03-01 13:05:08 -05:00
Paolo Bonzini	81ad780e5e	exec: introduce MemoryRegionCache Device models often have to perform multiple access to a single memory region that is known in advance, but would to use "DMA-style" functions instead of address_space_map/unmap. This can happen for example when the data has to undergo endianness conversion. Introduce a new data structure to cache the result of address_space_translate without forcing usage of a host address like address_space_map does. Backports commit 1f4e496e1fc2eb6c8bf377a0f9695930c380bfd3 from qemu	2018-03-01 10:50:30 -05:00
Richard Henderson	f5a35908da	tcg: Add tcg_gen_mulsu2_{i32,i64,tl} This multiply has one signed input and one unsigned input, producing the full double-width result. Backports commit 5087abfb7dfd1d368ae6939420057036b4d8e509 from qemu	2018-03-01 08:39:37 -05:00
Emilio G. Cota	cb92eea81a	target-arm: emulate aarch64's LL/SC using cmpxchg helpers Emulating LL/SC with cmpxchg is not correct, since it can suffer from the ABA problem. Portable parallel code, however, is written assuming only cmpxchg--and not LL/SC--is available. This means that in practice emulating LL/SC with cmpxchg is a viable alternative. The appended emulates LL/SC pairs in aarch64 with cmpxchg helpers. This works in both user and system mode. In usermode, it avoids pausing all other CPUs to perform the LL/SC pair. The subsequent performance and scalability improvement is significant, as the plots below show. They plot the throughput of atomic_add-bench compiled for ARM and executed on a 64-core x86 machine. Hi-res plots: http://imgur.com/a/JVc8Y atomic_add-bench: 1000000 ops/thread, [0,1] range 18 ++---------+----------+---------+----------+----------+----------+---++ +cmpxchg +-E--+ + + + + + \| 16 ++master +-H--+ ++ \|\| \| 14 ++ ++ \| \| \| 12 ++\| ++ \| \| \| 10 ++++ ++ 8 ++E ++ \|+++ \| 6 ++ \| ++ \| \| \| 4 ++ \| ++ \| \| \| 2 +H++E+--- ++ + \| +E++----+E+---+--+E+----++E+------+E+------+E++----+E+---+--+E\| 0 ++H-H----H-+-----H----+---------+----------+----------+----------+---++ 0 10 20 30 40 50 60 Number of threads atomic_add-bench: 1000000 ops/thread, [0,2] range 18 ++---------+----------+---------+----------+----------+----------+---++ +cmpxchg +-E--+ + + + + + \| 16 ++master +-H--+ ++ \| \| \| 14 ++E ++ \| \| \| 12 ++\| ++ \|+++ \| 10 ++ \| ++ 8 ++ \| ++ \| \| \| 6 ++ \| ++ \| \| \| 4 ++ \| ++ \| +E+--- \| 2 +H+ +E+-----+++ +++ +++ ---+E+-----+E+------+++ +++ + +E+---+--+E+----++E+------+E+--- ++++ +++ + +E\| 0 ++H-H----H-+-----H----+---------+----------+----------+----------+---++ 0 10 20 30 40 50 60 Number of threads atomic_add-bench: 1000000 ops/thread, [0,128] range 70 ++---------+----------+---------+----------+----------+----------+---++ +cmpxchg +-E--+ + + + + + \| 60 ++master +-H--+ +++ ---+E+-----+E+------+E+ \| +E+------E-------+E+--- \| \| --- +++ \| 50 ++ +++--- ++ \| -+E+ \| 40 ++ +++---- ++ \| E- \| \| --\| \| 30 ++ -- +++ ++ \| +E+ \| 20 ++E+ ++ \|E+ \| \| \| 10 ++ ++ + + + + + + + \| 0 +HH-H----H-+-----H----+---------+----------+----------+----------+---++ 0 10 20 30 40 50 60 Number of threads atomic_add-bench: 1000000 ops/thread, [0,1024] range 160 ++---------+---------+----------+---------+----------+----------+---++ +cmpxchg +-E--+ + + + + + \| 140 ++master +-H--+ +++ +++ \| -+E+-----+E+-------E\| 120 ++ +++ ---- +++ \| +++ ----E-- \| 100 ++ --E--- +++ ++ \| +++ ---- +++ \| 80 ++ --E-- ++ \| ---- +++ \| \| -+E+ \| 60 ++ ---- +++ ++ \| +E+- \| 40 ++ -- ++ \| +E+ \| 20 +EE+ ++ +++ + + + + + + \| 0 +HH-H---H--+-----H---+----------+---------+----------+----------+---++ 0 10 20 30 40 50 60 Number of threads Backports commit 1dd089d0eec060dcd8478735114d98421d414805 from qemu	2018-02-28 00:21:27 -05:00
Richard Henderson	064543a415	tcg: Add CONFIG_ATOMIC64 Allow qemu to build on 32-bit hosts without 64-bit atomic ops. Even if we only allow 32-bit hosts to multi-thread emulate 32-bit guests, we still need some way to handle the 32-bit guest using a 64-bit atomic operation. Do so by dropping back to single-step. Backports commit df79b996a7b21c6ea7847f7927a2e1a294b86c72 from qemu	2018-02-27 22:25:36 -05:00
Richard Henderson	da01e53757	tcg: Add atomic128 helpers Force the use of cmpxchg16b on x86_64. Wikipedia suggests that only very old AMD64 (circa 2004) did not have this instruction. Further, it's required by Windows 8 so no new cpus will ever omit it. If we truely care about these, then we could check this at startup time and then avoid executing paths that use it. Backports commit 7ebee43ee3e2fcd7b5063058b7ef74bc43216733 from qemu	2018-02-27 21:43:48 -05:00
Richard Henderson	5c0ce1b99c	tcg: Add atomic helpers Add all of cmpxchg, op_fetch, fetch_op, and xchg. Handle both endian-ness, and sizes up to 8. Handle expanding non-atomically, when emulating in serial. Backports commit c482cb117cc418115ca9c6d21a7a2315414c0a40 from qemu	2018-02-27 15:57:47 -05:00
Yongbok Kim	79e4c001a9	softmmu: Add probe_write() Probe for whether the specified guest write access is permitted. If it is not permitted then an exception will be taken in the same way as if this were a real write access (and we will not return). Otherwise the function will return, and there will be a valid entry in the TLB for this access. Backports commit 3b4afc9e75ab1a95f33e41f462921093f8a109c4 from qemu	2018-02-27 12:20:50 -05:00
Richard Henderson	e35aacd5ae	tcg: Add EXCP_ATOMIC When we cannot emulate an atomic operation within a parallel context, this exception allows us to stop the world and try again in a serial context. Backports commit fdbc2b5722f6092e47181a947c90fd4bdcc1c121 from qemu Also backports parts of commit 02d57ea115b7669f588371c86484a2e8ebc369be	2018-02-27 11:57:58 -05:00
Daniel P. Berrange	83a5bf2d25	qapi: rename QmpOutputVisitor to QObjectOutputVisitor The QmpOutputVisitor has no direct dependency on QMP. It is valid to use it anywhere that one wants a QObject. Rename it to better reflect its functionality as a generic QAPI to QObject converter. The commit before previous renamed the files, this one renames C identifiers. Backports commit 7d5e199ade76c53ec316ab6779800581bb47c50a from qemu	2018-02-27 08:05:33 -05:00
Daniel P. Berrange	2949a90977	qapi: rename QmpInputVisitor to QObjectInputVisitor The QmpInputVisitor has no direct dependency on QMP. It is valid to use it anywhere that one has a QObject. Rename it to better reflect its functionality as a generic QObject to QAPI converter. The previous commit renamed the files, this one renames C identifiers. Backports commit 09e68369a88d7de0f988972bf28eec1b80cc47f9 from qemu	2018-02-26 15:54:15 -05:00
Peter Maydell	db8b0a82b1	cpu: Support a target CPU having a variable page size Support target CPUs having a page size which isn't knownn at compile time. To use this, the CPU implementation should: * define TARGET_PAGE_BITS_VARY * not define TARGET_PAGE_BITS * define TARGET_PAGE_BITS_MIN to the smallest value it might possibly want for TARGET_PAGE_BITS * call set_preferred_target_page_bits() in its realize function to indicate the actual preferred target page size for the CPU (and report any error from it) In CONFIG_USER_ONLY, the CPU implementation should continue to define TARGET_PAGE_BITS appropriately for the guest OS page size. Machines which want to take advantage of having the page size something larger than TARGET_PAGE_BITS_MIN must set the MachineClass minimum_page_bits field to a value which they guarantee will be no greater than the preferred page size for any CPU they create. Note that changing the target page size by setting minimum_page_bits is a migration compatibility break for that machine. For debugging purposes, attempts to use TARGET_PAGE_SIZE before it has been finally confirmed will assert. Backports commit 20bccb82ff3ea09bcb7c4ee226d3160cab15f7da from qemu	2018-02-26 12:29:08 -05:00
Vijaya Kumar K	a7229cc08a	translate-all.c: Compute L1 page table properties at runtime Remove L1 page mapping table properties computing statically using macros which is dependent on TARGET_PAGE_BITS. Drop macros V_L1_SIZE, V_L1_SHIFT, V_L1_BITS macros and replace with variables which are computed at early stage of VM boot. Removing dependency can help to make TARGET_PAGE_BITS dynamic. Backports commit 66ec9f49399f0a9fa13ee77c472caba0de2773fc from qemu	2018-02-26 11:46:58 -05:00
Thomas Hanson	2af4ca54e9	target-arm: Infrastucture changes to enable handling of tagged address loading into PC When capturing the current CPU state for the TB, extract the TBI0 and TBI1 values from the correct TCR for the current EL and then add them to the TB flags field. Then, at the start of code generation for the block, copy the TBI fields into the DisasContext structure. Backports commit 86fb3fa4ed5873b021a362ea26a021f4aeab1bb4 from qemu	2018-02-26 07:58:17 -05:00
Pranith Kumar	5e44ce9be8	Introduce TCGOpcode for memory barrier This commit introduces the TCGOpcode for memory barrier instruction. This opcode takes an argument which is the type of memory barrier which should be generated. Backports commit f65e19bc2c9e8358e634d309606144ac2a3c2936 from qemu	2018-02-26 03:02:41 -05:00
Alex Williamson	5db45219c9	memory: Replace skip_dump flag with ram_device Setting skip_dump on a MemoryRegion allows us to modify one specific code path, but the restriction we're trying to address encompasses more than that. If we have a RAM MemoryRegion backed by a physical device, it not only restricts our ability to dump that region, but also affects how we should manipulate it. Here we recognize that MemoryRegions do not change to sometimes allow dumps and other times not, so we replace setting the skip_dump flag with a new initializer so that we know exactly the type of region to which we're applying this behavior. Backports commit ca83f87a66d19fdaabf23d4f5ebb49396fe232c1 from qemu	2018-02-25 23:00:45 -05:00
Richard Henderson	ede1cae3dc	tcg: Lower indirect registers in a separate pass Rather than rely on recursion during the middle of register allocation, lower indirect registers to loads and stores off the indirect base into plain temps. For an x86_64 host, with sufficient registers, this results in identical code, modulo the actual register assignments. For an i686 host, with insufficient registers, this means that temps can be (temporarily) spilled to the stack in order to satisfy an allocation. This as opposed to the possibility of not being able to spill, to allocate a register for the indirect base, in order to perform a spill. Backports commit 5a18407f55ade924aa6397c9a043a9ffd59645fe from qemu	2018-02-25 22:32:28 -05:00
Lioncash	17c54e2702	header_gen: alphabetize general symbols	2018-02-25 19:07:20 -05:00
Lioncash	fa10382007	header_gen: alphabetize aarch64 symbols	2018-02-25 19:00:01 -05:00

1 2 3 4 5 ...

350 Commits