unicorn

mirror of https://github.com/yuzu-emu/unicorn.git synced 2024-10-22 04:48:16 +02:00

Author	SHA1	Message	Date
Richard Henderson	02fd7e2472	target/arm: Decode PAuth within system hint space Backports commit 7c94c8343c6a0eea1633a65ed27987b6a71b9089 from qemu	2019-01-22 15:33:27 -05:00
Richard Henderson	e6ffbc22c2	target/arm: Add PAuth active bit to tbflags There are 5 bits of state that could be added, but to save space within tbflags, add only a single enable bit. Helpers will determine the rest of the state at runtime. Backports commit 0816ef1bfcd3ac53e7454b62ca436727887f6056 from qemu	2019-01-22 15:15:59 -05:00
Richard Henderson	4d8b7a9967	target/arm: Convert ARM_TBFLAG_* to FIELDs Use "register" TBFLAG_ANY to indicate shared state between A32 and A64, and "registers" TBFLAG_A32 & TBFLAG_A64 for fields that are specific to the given cpu state. Move ARM_TBFLAG_BE_DATA to shared state, instead of its current placement within "Bit usage when in AArch32 state". Backports commit aad821ac4faad369fad8941d25e59edf2514246b from qemu	2019-01-13 19:21:18 -05:00
Richard Henderson	8816550c10	target/arm: Implement the ARMv8.1-LOR extension Provide a trivial implementation with zero limited ordering regions, which causes the LDLAR and STLLR instructions to devolve into the LDAR and STLR instructions from the base ARMv8.0 instruction set. Backports commit 2d7137c10fafefe40a0a049ff8a7bd78b66e661f from qemu	2018-12-18 04:36:58 -05:00
Peter Maydell	5aa5ebbcc9	target/arm: Remove can't-happen if() from handle_vec_simd_shli() In handle_vec_simd_shli() we have a check: if (size > 3 && !is_q) { unallocated_encoding(s); return; } However this can never be true, because we calculate int size = 32 - clz32(immh) - 1; where immh is a 4 bit field which we know cannot be all-zeroes. So the clz32() return must be in {28,29,30,31} and the resulting size is in {0,1,2,3}, and "size > 3" is never true. This unnecessary code confuses Coverity's analysis: in CID 1396476 it thinks we might later index off the end of an array because the condition implies that we might have a size > 3. Remove the code, and instead assert that the size is in [0..3], since the decode that enforces that is somewhat distant from this function. Backports commit f6c98f91f56031141a47f86225fdc30f0f9f84fb from qemu	2018-11-11 08:37:16 -05:00
Richard Henderson	985acb9cde	target/arm: Use gvec for NEON_3R_VTST_VCEQ, NEON_3R_VCGT, NEON_3R_VCGE Move cmtst_op expanders from translate-a64.c. Backports commit ea580fa312674c1ba82a8b137caf42b0609ce3e3 from qemu	2018-11-10 11:03:42 -05:00
Richard Henderson	5d9c0e52bf	target/arm: Use gvec for NEON_3R_VML Move mla_op and mls_op expanders from translate-a64.c. Backports commit 4a7832b095b9ce97a815749a13516f5cfb3c5dd4 from qemu	2018-11-10 10:58:44 -05:00
Richard Henderson	79bbb7c730	target/arm: Use gvec for VSRI, VSLI Move shi_op and sli_op expanders from translate-a64.c. Backports commit f3cd8218d1d3e534877ce3f3cb61c6757d10f9df from qemu	2018-11-10 10:53:28 -05:00
Lioncash	edb36c7505	target/arm: Use gvec for VSRA	2018-11-10 10:32:29 -05:00
Richard Henderson	0965b9513a	target/arm: Use gvec for NEON_3R_LOGIC insns Move expanders for VBSL, VBIT, and VBIF from translate-a64.c. Backports commit eabcd6faa90461e0b7463f4ebe75b8d050487c9c from qemu	2018-11-10 10:06:13 -05:00
Richard Henderson	931b49fb06	target/arm: Promote consecutive memory ops for aa64 For a sequence of loads or stores from a single register, little-endian operations can be promoted to an 8-byte op. This can reduce the number of operations by a factor of 8. Backports commit 87f9a7f0c8d5122c36743885158782c2348a6d21 from qemu	2018-11-10 09:46:04 -05:00
Richard Henderson	e6707b900c	target/arm: Use tcg_gen_gvec_dup_i64 for LD[1-4]R Backports commit 10e0b33c676b4e8ac80d5929980f4fa6be617c5a from qemu	2018-11-10 09:41:14 -05:00
Richard Henderson	74aba4ba51	target/arm: Don't call tcg_clear_temp_count This is done generically in translator_loop. Backports commit 7108e255c2d95b44c9dfee8075d0d6fb391281a8 from qemu	2018-11-10 09:40:06 -05:00
Richard Henderson	9bbc412c66	target/arm: Hoist address increment for vector memory ops This can reduce the number of opcodes required for certain complex forms of load-multiple (e.g. ld4.16b). Backports commit a7d8143aed2268f147cc1abfebc848ed6282a313 from qemu	2018-11-10 09:39:20 -05:00
Richard Henderson	03ec90f39b	target/arm: Convert v8.2-fp16 from feature bit to aa64pfr0 test Backports commit 5763190fa8705863b4b725aa1657661a97113eb4 from qemu	2018-11-10 08:34:32 -05:00
Richard Henderson	0286f9525d	target/arm: Convert sve from feature bit to aa64pfr0 test Backports commit cd208a1c3923bc097ec55c5b207d79294ab0e719 from qemu	2018-11-10 08:27:35 -05:00
Richard Henderson	4221703f18	target/arm: Convert v8 extensions from feature bits to isar tests Most of the v8 extensions are self-contained within the ISAR registers and are not implied by other feature bits, which makes them the easiest to convert. Backports commit 962fcbf2efe57231a9f5df0ae0f40c05e35628ba from qemu	2018-11-10 08:17:57 -05:00
Richard Henderson	a37f24aa11	target/arm: Adjust aarch64_cpu_dump_state for system mode SVE Use the existing helpers to determine if (1) the fpu is enabled, (2) sve state is enabled, and (3) the current sve vector length. Backports commit ced3155141755ba244c988c72c4bde32cc819670 from qemu	2018-10-08 14:15:15 -04:00
Lioncash	b62e892b20	mips: Use DisasContext for parameters in place of TCGContext where applicable This is more future-friendly with qemu's main repo, as it's more generic.	2018-10-06 04:37:28 -04:00
Lioncash	47b45f1bc2	arm: Take DisasContext as a parameter instead of TCGContext where applicable This is more future-friendly with qemu, as it's more generic.	2018-10-06 04:17:12 -04:00
Richard Henderson	0136ca773f	target/arm: Fix aa64 FCADD and FCMLA decode These insns require u=1; failed to include that in the switch cases. This probably happened during one of the rebases just before final commit. Fixes: d17b7cdcf4e Backports commit b8a4a96db3639e17ab5e5cdc14fca4b19fbf5b3b from qemu	2018-08-17 14:06:01 -04:00
Richard Henderson	1d3cf8a0b0	target/arm: Dump SVE state if enabled Also fold the FPCR/FPSR state onto the same line as PSTATE, and mention but do not dump disabled FPU state. Backports commit 2bf5f3f91bb4e3faa2a19aec042138a938afbf6a from qemu	2018-08-17 13:52:28 -04:00
Richard Henderson	6d81235ebb	target/arm: Fix SVE system register access checks Leave ARM_CP_SVE, removing ARM_CP_FPU; the sve_access_check produced by the flag already includes fp_access_check. If we also check ARM_CP_FPU the double fp_access_check asserts. Backports commit 11d7870b1b4d038d7beb827f3afa72e284701351 from qemu	2018-07-03 05:07:53 -04:00
Richard Henderson	a325de6685	target/arm: Implement ARMv8.2-DotProd We've already added the helpers with an SVE patch, all that remains is to wire up the aa64 and aa32 translators. Enable the feature within -cpu max for CONFIG_USER_ONLY. Backports commit 26c470a7bb4233454137de1062341ad48947f252 from qemu	2018-07-03 04:55:43 -04:00
Richard Henderson	281deae0a9	target/arm: Pass index to AdvSIMD FCMLA (indexed) For aa64 advsimd, we had been passing the pre-indexed vector. However, sve applies the index to each 128-bit segment, so we need to pass in the index separately. For aa32 advsimd, the fp32 operation always has index 0, but we failed to interpret the fp16 index correctly. Backports commit 2cc99919a81a62589a4a6b0f365eabfead1db1a7 from qemu	2018-07-03 04:27:10 -04:00
Richard Henderson	10e2b13650	tcg: Pass tb and index to tcg_gen_exit_tb separately Do the cast to uintptr_t within the helper, so that the compiler can type check the pointer argument. We can also do some more sanity checking of the index argument. Backports commit 07ea28b41830f946de3841b0ac61a3413679feb9 from qemu	2018-06-07 11:56:32 -04:00
Richard Henderson	49def4bbde	target/arm: Add SVE decode skeleton Including only 4, as-yet unimplemented, instruction patterns so that the whole thing compiles. Backports commit 38388f7ee3adc04a7e7246c04352451c4f8d00fb from qemu	2018-05-20 00:48:14 -04:00
Richard Henderson	d2d8e2fc33	target/arm: Introduce translate-a64.h Move some stuff that will be common to both translate-a64.c and translate-sve.c. Backports commit 8c71baedb8055beaa681823206ee3a74f9f8649a from qemu	2018-05-20 00:34:25 -04:00
Alex Bennée	40d57900bf	target/arm: convert conversion helpers to fpst/ahp_flag Instead of passing env and leaving it up to the helper to get the right fpstatus we pass it explicitly. There was already a get_fpstatus helper for neon for the 32 bit code. We also add an get_ahp_flag() for passing the state of the alternative FP16 format flag. This leaves scope for later tracking the AHP state in translation flags. Backports commit 486624fcd3eaca6165ab8401d73bbae6c0fb81c1 from qemu	2018-05-19 22:58:25 -04:00
Alex Bennée	070276faf6	target/arm: Fix sqrt_f16 exception raising We are meant to explicitly pass fpst, not cpu_env. Backports commit 905edee9101c54cda5b72286b7f7607cf1c3c4d1 from qemu	2018-05-15 22:29:54 -04:00
Alex Bennée	f8e1f71df9	target/arm: Implement FMOV (immediate) for fp16 All the hard work is already done by vfp_expand_imm, we just need to make sure we pick up the correct size. Backports commit 6ba28ddb9be37bdb67e3e38007a53ccbdcd010df from qemu	2018-05-15 22:28:46 -04:00
Alex Bennée	cd76e7aaaa	target/arm: Implement FCSEL for fp16 These were missed out from the rest of the half-precision work. Backports commit ace97feef3613194900d4eb9ffc6819b840fbaeb from qemu	2018-05-15 22:26:53 -04:00
Alex Bennée	80074e4745	target/arm: Implement FCMP for fp16 These where missed out from the rest of the half-precision work. Backports commit 7a1929256ea1a03df12625e75ed571c60dca5bfb from qemu	2018-05-15 22:24:39 -04:00
Richard Henderson	eeab666292	target/arm: Implement FP data-processing (3 source) for fp16 We missed all of the scalar fp16 fma operations. Backports commit 95f9864fde6078e2d2c036a07cc4fe44f199be96 from qemu	2018-05-15 22:19:42 -04:00
Richard Henderson	a614dbb3c7	target/arm: Implement FP data-processing (2 source) for fp16 We missed all of the scalar fp16 binary operations. Backports commit b8f5171cf01420a9f0ee895c5591e9b9914f391a from qemu	2018-05-15 22:14:43 -04:00
Richard Henderson	60dfdb724b	target/arm: Introduce and use read_fp_hreg Backports commit 3d99d931266eaeaf7e83703a53f32232cd6faad7 from qemu	2018-05-15 22:10:51 -04:00
Richard Henderson	9b42d01480	target/arm: Implement FCVT (scalar, fixed-point) for fp16 Backports commit 2752728016bef06e7c9cfb961019272859beeca4 from qemu	2018-05-15 22:08:07 -04:00
Richard Henderson	8436080518	target/arm: Implement FCVT (scalar, integer) for fp16 Backports commit 564a0632504fad840491aa9a59453f4e64a316c4 from qemu	2018-05-15 22:06:49 -04:00
Richard Henderson	75643ab1cf	target/arm: Early exit after unallocated_encoding in disas_fp_int_conv No sense in emitting code after the exception. Backports commit 8c738d430796edeae5e13d6daf0895c02c62bd54 from qemu	2018-05-15 21:55:42 -04:00
Richard Henderson	bcaceb9bc7	target/arm: Implement FMOV (general) for fp16 Adding the fp16 moves to/from general registers. Backports commit 68130236e30a1ec64363f4915349feee181bfbc1 from qemu	2018-05-15 21:54:32 -04:00
Richard Henderson	5902f32abf	target/arm: Clear SVE high bits for FMOV Use write_fp_dreg and clear_vec_high to zero the bits that need zeroing for these cases. Backports commit 9a9f1f59521f46e8ff4527d9a2b52f83577e2aa3 from qemu	2018-05-14 08:43:55 -04:00
Richard Henderson	67740bbc7f	target/arm: Fix float16 to/from int16 The instruction "ucvtf v0.4h, v04h, #2", with input 0x8000u, overflows the intermediate float16 to infinity before we have a chance to scale the output. Use float64 as the intermediate type so that no input argument (uint32_t in this case) can overflow or round before scaling. Given the declared argument, the signed int32_t function has the same problem. When converting from float16 to integer, using u/int32_t instead of u/int16_t means that the bounding is incorrect. Backports commit 88808a022c06f98d81cd3f2d105a5734c5614839 from qemu	2018-05-14 08:41:20 -04:00
Richard Henderson	e403957a5e	target/arm: Implement vector shifted FCVT for fp16 While we have some of the scalar paths for FCVT for fp16, we failed to decode the fp16 version of these instructions. Backports commit d0ba8e74acd299b092786ffc30b306638d395a9e from qemu	2018-05-14 08:36:54 -04:00
Richard Henderson	ad6c191d96	target/arm: Implement vector shifted SCVF/UCVF for fp16 While we have some of the scalar paths for *CVF for fp16, we failed to decode the fp16 version of these instructions. Backports commit a6117fae4576edfe7a5a5b802a742c33112c0993 from qemu	2018-05-14 08:31:29 -04:00
Richard Henderson	688d0fd0ed	target/arm: Implement CAS and CASP Backports commit 44ac14b06fa33f60982923b6b8a3bf8dd2fea61d from qemu	2018-05-14 08:28:45 -04:00
Richard Henderson	b23c543e1a	target/arm: Fill in disas_ldst_atomic This implements all of the v8.1-Atomics instructions except for compare-and-swap, which is decoded elsewhere. Backports commit 74608ea45434c9b07055b21885e093528c5ed98c from qemu	2018-05-14 08:18:37 -04:00
Richard Henderson	7ae8671b5e	target/arm: Introduce ARM_FEATURE_V8_ATOMICS and initial decode The insns in the ARMv8.1-Atomics are added to the existing load/store exclusive and load/store reg opcode spaces. Rearrange the top-level decoders for these to accomodate. The Atomics insns themselves still generate Unallocated. Backports commit 68412d2ecedbab5a43b0d346cddb27e00d724aff from qemu	2018-05-14 08:15:52 -04:00
Richard Henderson	b2af557a0f	target/arm: Use new min/max expanders The generic expanders replace nearly identical code in the translator. Backports commit ecb8ab8d71aab770555a6972428b711400a27248 from qemu	2018-05-14 07:34:52 -04:00
Emilio G. Cota	d26bf1d446	translator: merge max_insns into DisasContextBase While at it, use int for both num_insns and max_insns to make sure we have same-type comparisons. Backports commit b542683d77b4f56cef0221b267c341616d87bce9 from qemu	2018-05-11 13:59:17 -04:00
Richard Henderson	5940a36394	target/arm: Tidy condition in disas_simd_two_reg_misc Path analysis shows that size == 3 && !is_q has been eliminated. Fixes: Coverity CID1385853 Backports commit a8766e3172c1671cab297c1ef4566a3c5d094822 from qemu	2018-05-08 08:26:31 -04:00
Richard Henderson	cb324fd039	target/arm: Tidy conditions in handle_vec_simd_shri The (size > 3 && !is_q) condition is identical to the preceeding test of bit 3 in immh; eliminate it. For the benefit of Coverity, assert that size is within the bounds we expect. Fixes: Coverity CID1385846 Fixes: Coverity CID1385849 Fixes: Coverity CID1385852 Fixes: Coverity CID1385857 Backports commit 8dae46970532afcf93470b00e83ca9921980efc3 from qemu	2018-05-08 08:25:37 -04:00
Peter Maydell	7a3ee5fd95	target/arm: Honour MDCR_EL2.TDE when routing exceptions due to BKPT/BRK The MDCR_EL2.TDE bit allows the exception level targeted by debug exceptions to be set to EL2 for code executing at EL0. We handle this in the arm_debug_target_el() function, but this is only used for hardware breakpoint and watchpoint exceptions, not for the exception generated when the guest executes an AArch32 BKPT or AArch64 BRK instruction. We don't have enough information for a translate-time equivalent of arm_debug_target_el(), so instead make BKPT and BRK call a special purpose helper which can do the routing, rather than the generic exception_with_syndrome helper. Backports commit c900a2e62dd6dde11c8f5249b638caad05bb15be from qemu	2018-03-25 16:33:04 -04:00
Victor Kamensky	ecd2ecb590	arm/translate-a64: treat DISAS_UPDATE as variant of DISAS_EXIT In OE project 4.15 linux kernel boot hang was observed under single cpu aarch64 qemu. Kernel code was in a loop waiting for vtimer arrival, spinning in TC generated blocks, while interrupt was pending unprocessed. This happened because when qemu tried to handle vtimer interrupt target had interrupts disabled, as result flag indicating TCG exit, cpu->icount_decr.u16.high, was cleared but arm_cpu_exec_interrupt function did not call arm_cpu_do_interrupt to process interrupt. Later when target reenabled interrupts, it happened without exit into main loop, so following code that waited for result of interrupt execution run in infinite loop. To solve the problem instructions that operate on CPU sys state (i.e enable/disable interrupt), and marked as DISAS_UPDATE, should be considered as DISAS_EXIT variant, and should be forced to exit back to main loop so qemu will have a chance processing pending CPU state updates, including pending interrupts. This change brings consistency with how DISAS_UPDATE is treated in aarch32 case. Backports commit a75a52d62418dafe462be4fe30485501d1010bb9 from qemu	2018-03-25 16:27:27 -04:00
Lioncash	0dd13de42f	target/arm/translate-a64: Correct bad merge	2018-03-12 11:17:33 -04:00
Richard Henderson	abd86b2287	target/arm: Decode aa64 armv8.3 fcmla Backports commit d17b7cdcf4ea3e858ceee8b86fc8544bb71561e6 from qemu Also remember to commit vec_helper.	2018-03-09 01:05:02 -05:00
Richard Henderson	4b39a36416	target/arm: Decode aa64 armv8.3 fcadd Backports commit 1695cd61b08d4376c11e0658836c4f08b4fc3aa1 from qemu	2018-03-09 00:58:37 -05:00
Richard Henderson	152c9484bd	target/arm: Decode aa64 armv8.1 scalar/vector x indexed element Backports commit d345df7a3f1336ceb0537c1fa0a7261030426768 from qemu	2018-03-09 00:12:00 -05:00
Lioncash	12fd2cc113	target/arm: Decode aa64 armv8.1 three same extra	2018-03-09 00:10:09 -05:00
Richard Henderson	4f585f71fb	target/arm: Decode aa64 armv8.1 scalar three same extra Backports commit d9061ec3d27eb940402a7eafee3fb77ce1146ad4 from qemu	2018-03-09 00:02:23 -05:00
Richard Henderson	774cbded7a	target/arm: Refactor disas_simd_indexed size checks The integer size check was already outside of the opcode switch; move the floating-point size check outside as well. Unify the size vs index adjustment between fp and integer paths. Backports commit 449f264b1749ac0e59c58bbc2eacdb3dc302c2bf from qemu	2018-03-08 23:53:39 -05:00
Richard Henderson	1fd2644738	target/arm: Refactor disas_simd_indexed decode Include the U bit in the switches rather than testing separately. Backports commit 5f81b1de43259ed0969e62a7419ab9dd9da2c5c0 from qemu	2018-03-08 23:44:03 -05:00
Alex Bennée	6e41113897	arm/translate-a64: add all single op FP16 to handle_fp_1src_half This includes FMOV, FABS, FNEG, FSQRT and FRINT[NPMZAXI]. We re-use existing helpers to achieve this. Backports commit c2c08713a6a5846bbe601d4d1b4f9708ba77efdc from qemu	2018-03-08 23:44:02 -05:00
Alex Bennée	c6c8a1cccc	arm/translate-a64: implement simd_scalar_three_reg_same_fp16 This covers the encoding group: Advanced SIMD scalar three same FP16 As all the helpers are already there it is simply a case of calling the existing helpers in the scalar context. Backports commit 7c93b7741b29b3ffda81a6e9525771b4409db99f from qemu	2018-03-08 23:44:02 -05:00
Alex Bennée	dd29452046	arm/translate-a64: add all FP16 ops in simd_scalar_pairwise I only needed to do a little light re-factoring to support the half-precision helpers. Backports commit 5c36d89567cfd049a7c59ff219639f788225068f from qemu	2018-03-08 23:44:02 -05:00
Alex Bennée	8bbabd7eb3	arm/translate-a64: add FP16 FMOV to simd_mod_imm Only one half-precision instruction has been added to this group. Backports commit 70b4e6a445715519ae55179dc54f6e961ab30c27 from qemu	2018-03-08 23:43:52 -05:00
Alex Bennée	b117df18df	arm/translate-a64: add FP16 FRSQRTE to simd_two_reg_misc_fp16 Backports commit c625ff95070e3ef96bd007de744e1d97c881efeb from qemu	2018-03-08 22:45:39 -05:00
Alex Bennée	fdb07713e6	arm/translate-a64: add FP16 FSQRT to simd_two_reg_misc_fp16 Backports commit b96a54c7e5576bd35b7d00d37b7929d2892d8cac from qemu	2018-03-08 21:57:35 -05:00
Alex Bennée	6102a61b14	arm/translate-a64: add FP16 FRCPX to simd_two_reg_misc_fp16 We go with the localised helper. Backports commit 986950283837f697b35782b9ac3bc99fca614640 from qemu	2018-03-08 19:15:23 -05:00
Alex Bennée	4ea310c131	arm/translate-a64: add FP16 FRECPE Now we have added f16 during the re-factoring we can simply call the helper. Backports commit fbd06e1e4b6566b4d727f9e553c819d034942f68 from qemu	2018-03-08 19:12:06 -05:00
Alex Bennée	c590ff441c	arm/translate-a64: add FP16 FNEG/FABS to simd_two_reg_misc_fp16 Neither of these operations alter the floating point status registers so we can do a pure bitwise operation, either squashing any sign bit (ABS) or inverting it (NEG). Backports commit 15f8a233c8c023dbc77b6fe6cd7c79eac9bee263 from qemu	2018-03-08 18:51:35 -05:00
Alex Bennée	7161c1ed52	arm/translate-a64: add FP16 SCVTF/UCVFT to simd_two_reg_misc_fp16	2018-03-08 18:48:25 -05:00
Alex Bennée	8ac9e3cff2	arm/translate-a64: add FP16 FCMxx (zero) to simd_two_reg_misc_fp16 I re-use the existing handle_2misc_fcmp_zero handler and tweak it slightly to deal with the half-precision case. Backports commit 7d4dd1a73a023f75c893623710e43743501b318e from qemu	2018-03-08 18:32:36 -05:00
Alex Bennée	39a68548d1	arm/translate-a64: add FCVTxx to simd_two_reg_misc_fp16 This covers all the floating point convert operations. Backports commit 2df581304193d70eaf0d22cf4cb4613f74b6e59b from qemu	2018-03-08 18:25:29 -05:00
Alex Bennée	d5f002b39a	arm/translate-a64: add FP16 FPRINTx to simd_two_reg_misc_fp16 This adds the full range of half-precision floating point to integral instructions. Backports commit 6109aea2d954891027acba64a13f1f1c7463cfac from qemu	2018-03-08 18:21:58 -05:00
Alex Bennée	33eda0f5d4	arm/translate-a64: initial decode for simd_two_reg_misc_fp16 This actually covers two different sections of the encoding table: Advanced SIMD scalar two-register miscellaneous FP16 Advanced SIMD two-register miscellaneous (FP16) The difference between the two is covered by a combination of Q (bit 30) and S (bit 28). Notably the FRINTx instructions are only available in the vector form. This is just the decode skeleton which will be filled out by later patches. Backports commit 5d432be6fd6efe37833ac82623c3abd35117b421 from qemu	2018-03-08 18:14:04 -05:00
Alex Bennée	82ffaab7de	arm/translate-a64: add FP16 x2 ops for simd_indexed A bunch of the vectorised bitwise operations just operate on larger chunks at a time. We can do the same for the new half-precision operations by introducing some TWOHALFOP helpers which work on each half of a pair of half-precision operations at once. Hopefully all this hoop jumping will get simpler once we have generically vectorised helpers here. Backports commit 6089030c7322d8f96b54fb9904e53b0f464bb8fe from qemu	2018-03-08 18:08:39 -05:00
Alex Bennée	38815b2901	arm/translate-a64: add FP16 FMULX/MLS/FMLA to simd_indexed The helpers use the new re-factored muladd support in SoftFloat for the float16 work. Backports commit 5d265064cf30daaacce5a4ce9945fc573015fb5f from qemu	2018-03-08 15:56:20 -05:00
Alex Bennée	c6fda07628	arm/translate-a64: add FP16 pairwise ops simd_three_reg_same_fp16 This includes FMAXNMP, FADDP, FMAXP, FMINNMP, FMINP. Backports commit 7a2c6e618156674cf9eac8bf36e79f674fbf974e from qemu	2018-03-08 15:50:56 -05:00
Alex Bennée	4b2577537b	arm/translate-a64: add FP16 FR[ECP/SQRT]S to simd_three_reg_same_fp16 As some of the constants here will also be needed elsewhere (specifically for the upcoming SVE support) we move them out to softfloat.h. Backports commit 026e2d6ef74000afb9049f46add4b94f594c8fb3 from qemu	2018-03-08 15:47:34 -05:00
Alex Bennée	a02b9b81a9	arm/translate-a64: add FP16 FMULA/X/S to simd_three_reg_same_fp16 Backports commit 2deb992b767d28035fac3b374c7730494ff0b43d from qemu Also backports the fp16 changes introduced in commit f566c0474a9b9bbd9ed248607e4007e24d3358c0	2018-03-08 15:42:48 -05:00
Alex Bennée	ba8df54753	arm/translate-a64: add FP16 F[A]C[EQ/GE/GT] to simd_three_reg_same_fp16 These use the generic float16_compare functionality which in turn uses the common float_compare code from the softfloat re-factor. Backports commit d32adeae1a71a8e71374fa48d3d6ab0ad4c23e94 from qemu	2018-03-08 12:59:37 -05:00
Alex Bennée	4a6a41d2c5	arm/translate-a64: add FP16 FADD/FABD/FSUB/FMUL/FDIV to simd_three_reg_same_fp16 The fprintf is only there for debugging as the skeleton is added to, it will be removed once the skeleton is complete. Backports commit 372087348d561e7f4051d7b32609bda417092ddf from qemu	2018-03-08 12:56:15 -05:00
Alex Bennée	2f850606e9	arm/translate-a64: initial decode for simd_three_reg_same_fp16 This is the initial decode skeleton for the Advanced SIMD three same instruction group. The fprintf is purely to aid debugging as the additional instructions are added. It will be removed once the group is complete. Backports commit 376e8d6cda985df31c8561db4b7ea365b6fe6f87 from qemu	2018-03-08 12:53:23 -05:00
Alex Bennée	fe74abd307	arm/translate-a64: handle_3same_64 comment fix We do implement all the opcodes. Backports commit 3840d219b433507f04a685120ff770ce4e06c55d from qemu	2018-03-08 12:51:01 -05:00
Alex Bennée	af75074fe7	arm/translate-a64: implement half-precision F(MIN\|MAX)(V\|NMV) This implements the half-precision variants of the across vector reduction operations. This involves a re-factor of the reduction code which more closely matches the ARM ARM order (and handles 8 element reductions). Backports commit 807cdd504283c11addcd7ea95ba594bbddc86fe4 from qemu	2018-03-08 12:49:30 -05:00
Alex Bennée	27d8d01566	target/arm/helper: pass explicit fpst to set_rmode As the rounding mode is now split between FP16 and the rest of floating point we need to be explicit when tweaking it. Instead of passing the CPU env we now pass the appropriate fpst pointer directly. Backports commit 9b04991686785e18b18a36d193b68f08f7c91648 from qemu	2018-03-08 12:41:54 -05:00
Alex Bennée	996f38056f	target/arm/cpu.h: add additional float_status flags Half-precision flush to zero behaviour is controlled by a separate FZ16 bit in the FPCR. To handle this we pass a pointer to fp_status_fp16 when working on half-precision operations. The value of the presented FPCR is calculated from an amalgam of the two when read. Backports commit d81ce0ef2c4f1052fcdef891a12499eca3084db7 from qemu	2018-03-08 12:34:39 -05:00
Richard Henderson	1f71084740	target/arm: Handle SVE registers when using clear_vec_high When storing to an AdvSIMD FP register, all of the high bits of the SVE register are zeroed. Therefore, call it more often with is_q as a parameter. Backports commit 4ff55bcb0ee6452b768835f86d94bd727185f812 from qemu	2018-03-08 09:32:33 -05:00
Richard Henderson	07b928eca4	target/arm: Enforce access to ZCR_EL at translation This also makes sure that we get the correct ordering of SVE vs FP exceptions. Backports commit 490aa7f13a2ad31f92205879c4dc2387b602ef14 from qemu	2018-03-08 09:17:33 -05:00
Richard Henderson	d5c4d3e3c3	target/arm: Enforce FP access to FPCR/FPSR Backports commit fe03d45f9e9baa89e8c4da50de771767d5d48990 from qemu	2018-03-08 09:14:52 -05:00
Richard Henderson	02516c53ff	target/arm: Add SVE state to TB->FLAGS Add both SVE exception state and vector length. Backports commit 1db5e96c54d8b3d1df0a6fed6771390be6b010da from qemu	2018-03-07 11:44:32 -05:00
Richard Henderson	834e3a1d04	target/arm: Expand vector registers for SVE Change vfp.regs as a uint64_t to vfp.zregs as an ARMVectorReg. The previous patches have made the change in representation relatively painless. Backports commit c39c2b9043ec59516c80f2c6f3e8193e99d04d4b from qemu	2018-03-07 11:33:49 -05:00
Ard Biesheuvel	85e6d710e4	target/arm: implement SM4 instructions This implements emulation of the new SM4 instructions that have been added as an optional extension to the ARMv8 Crypto Extensions in ARM v8.2. Backports commit b6577bcd251ca0d57ae1de149e3c706b38f21587 from qemu	2018-03-07 08:57:53 -05:00
Ard Biesheuvel	78d15a9cd0	target/arm: implement SM3 instructions This implements emulation of the new SM3 instructions that have been added as an optional extension to the ARMv8 Crypto Extensions in ARM v8.2. Backports commit 80d6f4c6bbb718f343a832df8dee15329cc7686c from qemu	2018-03-07 08:53:47 -05:00
Ard Biesheuvel	72078a7674	target/arm: implement SHA-3 instructions This implements emulation of the new SHA-3 instructions that have been added as an optional extensions to the ARMv8 Crypto Extensions in ARM v8.2. Backports commit cd270ade74ea86467f393a9fb9c54c4f1148c28f from qemu	2018-03-07 08:44:47 -05:00
Ard Biesheuvel	66b8b01f09	target/arm: implement SHA-3 instructions This implements emulation of the new SHA-3 instructions that have been added as an optional extensions to the ARMv8 Crypto Extensions in ARM v8.2. Backports commit cd270ade74ea86467f393a9fb9c54c4f1148c28f from qemu	2018-03-07 08:41:40 -05:00
Ard Biesheuvel	0ef74f6d6d	target/arm: implement SHA-512 instructions This implements emulation of the new SHA-512 instructions that have been added as an optional extensions to the ARMv8 Crypto Extensions in ARM v8.2. Backports commit 90b827d131812d7f0a8abb13dba1942a2bcee821 from qemu	2018-03-07 08:39:49 -05:00
Richard Henderson	16a0a3e156	target/arm: Use vector infrastructure for aa64 orr/bic immediate Backports commit 064e265d5680e5c605d6ee8370fc1e8da094e66d from qemu	2018-03-06 16:17:42 -05:00
Richard Henderson	c5c8488928	target/arm: Use vector infrastructure for aa64 multiplies Backports commit 0c7c55c492c918b6275baa3fee8b176c31465e3c from qemu	2018-03-06 16:14:47 -05:00
Richard Henderson	955fec9300	target/arm: Use vector infrastructure for aa64 compares Backports commit 79d61de6bdc3980f0efef85f7539e129ab8a4a40 from qemu	2018-03-06 16:10:10 -05:00

1 2 3 4

190 Commits