unicorn

mirror of https://github.com/yuzu-emu/unicorn.git synced 2024-10-21 11:28:18 +02:00

Author	SHA1	Message	Date
Alex Bennée	9d83300f3e	fpu: rename softfloat-specialize.h -> .inc.c This is not a normal header and should only be included in the main softfloat.c file to bring in the various target specific specialisations. Indeed as it contains non-inlined C functions it is not even a legal header. Rename it to match our included C convention. Backports commit 00f43279a3e5e7ea3a0fa853157863663e838e2e from qemu	2019-11-18 21:12:30 -05:00
Alex Bennée	dbddafe2df	fpu: replace LIT64 with UINT64_C macros In our quest to eliminate the home rolled LIT64 macro we fixup usage inside the softfloat code. While we are at it we remove some of the extraneous spaces to closer fit the house style. Backports commit e932112420f063776f2b9d9e5512830cd6890a7a from qemu	2019-11-18 20:57:12 -05:00
Alex Bennée	6eb3c9ee79	fpu: use min/max values from stdint.h for integral overflow Remove some more use of LIT64 while making the meaning more clear. We also avoid the need of casts as the results by definition fit into the return type. Backports commit 2c217da0fc9f1127bda804e2a500b8138b02c581 from qemu	2019-11-18 20:45:40 -05:00
Alex Bennée	0d573763c9	fpu: convert float[16/32/64]_squash_denormal to new modern style This also allows us to remove the extractFloat16exp/frac helpers. We avoid using the floatXX_pack_raw functions as they are slight overkill for masking out all but the top bit of the number. The generated code is almost exactly the same as makes no difference to the pre-conversion code. Backports commit e6b405fe00d8e6424a58492b37a1656d1ef0929b from qemu	2019-11-18 20:42:06 -05:00
Lioncash	d6b706a296	qemu/fpu: Synchronize with Qemu Resolves a few formatting discrepancies	2019-03-09 18:27:31 -05:00
Richard Henderson	11679ff3cf	softfloat: Support float_round_to_odd more places Previously this was only supported for roundAndPackFloat64. New support in round_canonical, round_to_int, float128_round_to_int, roundAndPackFloat32, roundAndPackInt32, roundAndPackInt64, roundAndPackUint64. This does not include any of the floatx80 routines, as we do not have users for that rounding mode there. Backports commit 5d64abb32ffe558e616545819f3e53dd66335994 from qemu	2019-02-28 15:17:38 -05:00
David Hildenbrand	7373819b1a	softfloat: Implement float128_to_uint32 Handling it just like float128_to_uint32_round_to_zero, that hopefully is free of bugs :) Documentation basically copied from float128_to_uint64 Backports commit e45de9922e43c1ce4f4739b62142314a13029d5c from qemu	2019-02-28 15:13:09 -05:00
Emilio G. Cota	8276a4dc66	hardfloat: implement float32/64 comparison Performance results for fp-bench: Host: Intel(R) Core(TM) i7-6700K CPU @ 4.00GHz - before: cmp-single: 110.98 MFlops cmp-double: 107.12 MFlops - after: cmp-single: 506.28 MFlops cmp-double: 524.77 MFlops Note that flattening both eq and eq_signaling versions would give us extra performance (695v506, 615v524 Mflops for single/double, respectively) but this would emit two essentially identical functions for each eq/signaling pair, which is a waste. Aggregate performance improvement for the last few patches: [ all charts in png: https://imgur.com/a/4yV8p ] 1. Host: Intel(R) Core(TM) i7-6700K CPU @ 4.00GHz qemu-aarch64 NBench score; higher is better Host: Intel(R) Core(TM) i7-6700K CPU @ 4.00GHz 16 +-+-----------+-------------+----===-------+---===-------+-----------+-+ 14 +-+..........................@@@&&.=.......@@@&&.=...................+-+ 12 +-+..........................@.@.&.=.......@.@.&.=.....+befor=== +-+ 10 +-+..........................@.@.&.=.......@.@.&.=.....+ad@@&& = +-+ 8 +-+.......................$$$%.@.&.=.......@.@.&.=.....+ @@u& = +-+ 6 +-+............@@@&&=+*##.$%.@.&.=##$$%+@.&.=..###$$%%@i& = +-+ 4 +-+.......###$%%.@.&=...#.$%.@.&.=..#.$%.@.&.=+.#+$ +@m& = +-+ 2 +-+.....*.#$.%.@.&=...#.$%.@.&.=..#.$%.@.&.=..#+$+sqr& = +-+ 0 +-+-----##$%%@@&&=-##$$%@@&&==##$$%@@&&==-##$$%+cmp==-----+-+ FOURIER NEURAL NELU DECOMPOSITION gmean qemu-aarch64 SPEC06fp (test set) speedup over QEMU 4c2c1015905 Host: Intel(R) Core(TM) i7-6700K CPU @ 4.00GHz error bars: 95% confidence interval 4.5 +-+---+-----+----+-----+-----+-&---+-----+----+-----+-----+-----+----+-----+-----+-----+-----+----+-----+---+-+ 4 +-+..........................+@@+...........................................................................+-+ 3.5 +-+..............%%@&.........@@..............%%@&............................................+++dsub +-+ 2.5 +-+....&&+.......%%@&.......+%%@..+%%&+..@@&+.%%@&....................................+%%&+.+%@&++%%@& +-+ 2 +-+..+%%&..+%@&+.%%@&...+++..%%@...%%&.+$$@&..%%@&..%%@&.......+%%&+.%%@&+......+%%@&.+%%&++$$@&++d%@& %%@&+-+ 1.5 +-+#$%&#$@&#%@&$%@#$%@#$%&#$@&$%@&#$%@#$%@#$%&#%@&$%@&#$%@#$%&#$@&+f%@&$%@&+-+ 0.5 +-+#$%&#$@&#%@&$%@#$%@#$%&#$@&$%@&#$%@#$%@#$%&#%@&$%@&#$%@#$%&#$@&+sqr@&$%@&+-+ 0 +-+#$%&#$@&#%@&$%@#$%@#$%&#$@&$%@&#$%@#$%@#$%&#%@&$%@&#$%@#$%&#$@&+cmp&$%@&+-+ 410.bw416.gam433.434.z435.436.cac437.lesli444.447.de450.so453454.ca459.GemsF465.tont470.lb4482.sphinxgeomean 2. Host: ARM Aarch64 A57 @ 2.4GHz qemu-aarch64 NBench score; higher is better Host: Applied Micro X-Gene, Aarch64 A57 @ 2.4 GHz 5 +-+-----------+-------------+-------------+-------------+-----------+-+ 4.5 +-+........................................@@@&==...................+-+ 3 4 +-+..........................@@@&==........@.@&.=.....+before +-+ 3 +-+..........................@.@&.=........@.@&.=.....+ad@@@&== +-+ 2.5 +-+.....................##$$%%.@&.=........@.@&.=.....+ @m@& = +-+ 2 +-+............@@@&==.#.$.%.@&.=.#$$%%.@&.=.#$$%%d@& = +-+ 1.5 +-+.....*#$$%%.@&.=..#.$.%.@&.=..#.$.%.@&.=..#+$ +f@& = +-+ 0.5 +-+......#.$.%.@&.=..#.$.%.@&.=..#.$.%.@&.=..#+$+sqr& = +-+ 0 +-+-----#$$%%@@&==-#$$%%@@&==-#$$%%@@&==-*#$$%+cmp==-----+-+ FOURIER NEURAL NLU DECOMPOSITION gmean	2018-12-19 10:45:22 -05:00
Emilio G. Cota	f7549fc13e	hardfloat: implement float32/64 square root Performance results for fp-bench: Host: Intel(R) Core(TM) i7-6700K CPU @ 4.00GHz - before: sqrt-single: 42.30 MFlops sqrt-double: 22.97 MFlops - after: sqrt-single: 311.42 MFlops sqrt-double: 311.08 MFlops Here USE_FP makes a huge difference for f64's, with throughput going from ~200 MFlops to ~300 MFlops. Backports commit f131bae8a7b7ed1928cc94c69df291db609c316a from qemu	2018-12-19 10:43:23 -05:00
Emilio G. Cota	3cf836ca83	hardfloat: implement float32/64 fused multiply-add Performance results for fp-bench: 1. Intel(R) Core(TM) i7-6700K CPU @ 4.00GHz - before: fma-single: 74.73 MFlops fma-double: 74.54 MFlops - after: fma-single: 203.37 MFlops fma-double: 169.37 MFlops 2. ARM Aarch64 A57 @ 2.4GHz - before: fma-single: 23.24 MFlops fma-double: 23.70 MFlops - after: fma-single: 66.14 MFlops fma-double: 63.10 MFlops 3. IBM POWER8E @ 2.1 GHz - before: fma-single: 37.26 MFlops fma-double: 37.29 MFlops - after: fma-single: 48.90 MFlops fma-double: 59.51 MFlops Here having 3FP64 set to 1 pays off for x86_64: [1] 170.15 vs [0] 153.12 MFlops Backports commit ccf770ba7396c240ca8a1564740083742dd04c08 from qemu	2018-12-19 10:42:00 -05:00
Emilio G. Cota	95781d2bb5	hardfloat: implement float32/64 division Performance results for fp-bench: 1. Intel(R) Core(TM) i7-6700K CPU @ 4.00GHz - before: div-single: 34.84 MFlops div-double: 34.04 MFlops - after: div-single: 275.23 MFlops div-double: 216.38 MFlops 2. ARM Aarch64 A57 @ 2.4GHz - before: div-single: 9.33 MFlops div-double: 9.30 MFlops - after: div-single: 51.55 MFlops div-double: 15.09 MFlops 3. IBM POWER8E @ 2.1 GHz - before: div-single: 25.65 MFlops div-double: 24.91 MFlops - after: div-single: 96.83 MFlops div-double: 31.01 MFlops Here setting 2FP64_USE_FP to 1 pays off for x86_64: [1] 215.97 vs [0] 62.15 MFlops Backports commit 4a6295613f533a6841de5968c50e1ca36748807e from qemu	2018-12-19 10:40:00 -05:00
Emilio G. Cota	93991714fb	hardfloat: implement float32/64 multiplication Performance results for fp-bench: 1. Intel(R) Core(TM) i7-6700K CPU @ 4.00GHz - before: mul-single: 126.91 MFlops mul-double: 118.28 MFlops - after: mul-single: 258.02 MFlops mul-double: 197.96 MFlops 2. ARM Aarch64 A57 @ 2.4GHz - before: mul-single: 37.42 MFlops mul-double: 38.77 MFlops - after: mul-single: 73.41 MFlops mul-double: 76.93 MFlops 3. IBM POWER8E @ 2.1 GHz - before: mul-single: 58.40 MFlops mul-double: 59.33 MFlops - after: mul-single: 60.25 MFlops mul-double: 94.79 MFlops Backports commit 2dfabc86e656e835c67954c60e143ecd33e15817 from qemu	2018-12-19 10:38:33 -05:00
Emilio G. Cota	0862d9c462	hardfloat: implement float32/64 addition and subtraction Performance results (single and double precision) for fp-bench: 1. Intel(R) Core(TM) i7-6700K CPU @ 4.00GHz - before: add-single: 135.07 MFlops add-double: 131.60 MFlops sub-single: 130.04 MFlops sub-double: 133.01 MFlops - after: add-single: 443.04 MFlops add-double: 301.95 MFlops sub-single: 411.36 MFlops sub-double: 293.15 MFlops 2. ARM Aarch64 A57 @ 2.4GHz - before: add-single: 44.79 MFlops add-double: 49.20 MFlops sub-single: 44.55 MFlops sub-double: 49.06 MFlops - after: add-single: 93.28 MFlops add-double: 88.27 MFlops sub-single: 91.47 MFlops sub-double: 88.27 MFlops 3. IBM POWER8E @ 2.1 GHz - before: add-single: 72.59 MFlops add-double: 72.27 MFlops sub-single: 75.33 MFlops sub-double: 70.54 MFlops - after: add-single: 112.95 MFlops add-double: 201.11 MFlops sub-single: 116.80 MFlops sub-double: 188.72 MFlops Note that the IBM and ARM machines benefit from having HARDFLOAT_2F{32,64}_USE_FP set to 0. Otherwise their performance can suffer significantly: - IBM Power8: add-single: [1] 54.94 vs [0] 116.37 MFlops add-double: [1] 58.92 vs [0] 201.44 MFlops - Aarch64 A57: add-single: [1] 80.72 vs [0] 93.24 MFlops add-double: [1] 82.10 vs [0] 88.18 MFlops On the Intel machine, having 2F64 set to 1 pays off, but it doesn't for 2F32: - Intel i7-6700K: add-single: [1] 285.79 vs [0] 426.70 MFlops add-double: [1] 302.15 vs [0] 278.82 MFlops Backports commit 1b615d482094e0123d187f0ad3c676ba8eb9d0a3 from qemu	2018-12-19 10:36:55 -05:00
Emilio G. Cota	bca8e39e3c	fpu: introduce hardfloat The appended paves the way for leveraging the host FPU for a subset of guest FP operations. For most guest workloads (e.g. FP flags aren't ever cleared, inexact occurs often and rounding is set to the default [to nearest]) this will yield sizable performance speedups. The approach followed here avoids checking the FP exception flags register. See the added comment for details. This assumes that QEMU is running on an IEEE754-compliant FPU and that the rounding is set to the default (to nearest). The implementation-dependent specifics of the FPU should not matter; things like tininess detection and snan representation are still dealt with in soft-fp. However, this approach will break on most hosts if we compile QEMU with flags that break IEEE compatibility. There is no way to detect all of these flags at compilation time, but at least we check for -ffast-math (which defines __FAST_MATH__) and disable hardfloat (plus emit a #warning) when it is set. This patch just adds common code. Some operations will be migrated to hardfloat in subsequent patches to ease bisection. Note: some architectures (at least PPC, there might be others) clear the status flags passed to softfloat before most FP operations. This precludes the use of hardfloat, so to avoid introducing a performance regression for those targets, we add a flag to disable hardfloat. In the long run though it would be good to fix the targets so that at least the inexact flag passed to softfloat is indeed sticky. Backports commit a94b783952cc493cb241aabb1da8c7a830385baa from qemu	2018-12-19 10:32:32 -05:00
Emilio G. Cota	a9d9005399	softfloat: rename canonicalize to sf_canonicalize glibc >= 2.25 defines canonicalize in commit eaf5ad0 (Add canonicalize, canonicalizef, canonicalizel., 2016-10-26). Given that we'll be including <math.h> soon, prepare for this by prefixing our canonicalize() with sf_ to avoid clashing with the libc's canonicalize(). Backports commit f9943c7f766678af36d31076b78e466256f4871b from qemu	2018-12-19 10:30:38 -05:00
Richard Henderson	90fdf9b598	softfloat: Fix division The __udiv_qrnnd primitive that we nicked from gmp requires its inputs to be normalized. We were not doing that. Because the inputs are nearly normalized already, finishing that is trivial. Replace div128to64 with a "proper" udiv_qrnnd, so that this remains a reusable primitive. Fixes: cf07323d494 Fixes: https://bugs.launchpad.net/qemu/+bug/1793119 Backports commit 5dfbc9e4903c0121140f2945f05df48cea72dd82 from qemu	2018-10-08 14:15:15 -04:00
Thomas Huth	bd116582e8	softfloat: Replace countLeadingZeros32/64 with clz32/64 Our minimum required compiler for compiling QEMU is GCC 4.1 these days, so we can drop the support for compilers which do not provide the __builtin_clz*() functions yet. Since the countLeadingZeros32/64 are then identical to the clz32/64 functions, and we do not have to sync the softloat 2 codebase with upstream anymore (softloat 3 is a complete rewrite) we can simply replace the functions with our QEMU versions. Backports commit 0019d5c3a18c31604fb55f9cec3ceb13999c4866 from qemu	2018-10-08 14:15:15 -04:00
Emilio G. Cota	9f08fb35bc	softfloat: remove float64_trunc_to_int It has not had users since f83311e476 ("target-m68k: use floatx80 internally", 2017-06-21). Note that no other bit-width has floatX_trunc_to_int. Backports commit c953da8f0be5e026d1c9128660736d72294feb3e from qemu	2018-10-08 14:15:11 -04:00
Richard Henderson	0d69492b64	softfloat: Add scaling float-to-int routines Backports commit 2f6c74be593ec5219e54d7b4abd4e5a98d7f3efc from qemu	2018-08-25 04:05:45 -04:00
Richard Henderson	ee0d8aff72	softfloat: Add scaling int-to-float routines Backports commit 2abdfe24402907e7e8c103bdd4166f26b74200c2 from qemu	2018-08-25 03:47:12 -04:00
Richard Henderson	a456920fd7	softfloat: Fix missing inexact for floating-point add For 0x1.0000000000003p+0 + 0x1.ffffffep+14 = 0x1.0001fffp+15 we dropped the sticky bit and so failed to raise inexact. Backports commit 64d450a0eaad5f02f9d6bba1dd451446297bb4dc from qemu	2018-08-17 14:06:55 -04:00
Richard Henderson	31b3a98233	fpu/softfloat: Define floatN_silence_nan in terms of parts_silence_nan Isolate the target-specific choice to 3 functions instead of 6. The code in floatx80_default_nan tried to be over-general. There are only two targets that support this format: x86 and m68k. Thus there is no point in inventing a mechanism for snan_bit_is_one. Move routines that no longer have ifdefs out of softfloat-specialize.h. Backports commit 377ed92679a2a5f838bc0a095112ea5020720fff from qemu	2018-05-20 00:13:42 -04:00
Richard Henderson	df3436b518	fpu/softfloat: Define floatN_default_nan in terms of parts_default_nan Isolate the target-specific choice to 2 functions instead of 6. The code in float16_default_nan was only correct for ARM, MIPS, and X86. Though float16 support is rare among our targets. The code in float128_default_nan was arguably wrong for Sparc. While QEMU supports the Sparc 128-bit insns, no real cpu enables it. The code in floatx80_default_nan tried to be over-general. There are only two targets that support this format: x86 and m68k. Thus there is no point in inventing a value for snan_bit_is_one. Move routines that no longer have ifdefs out of softfloat-specialize.h. Backports commit 0218a16e540ad416683e19dfbd52f75092507b27 from qemu	2018-05-20 00:04:20 -04:00
Richard Henderson	77bee9e8a5	fpu/softfloat: Pass FloatClass to pickNaNMulAdd For each operand, pass a single enumeration instead of a pair of booleans. The commit also merges multiple different ifdef-selected implementations of pickNaNMulAdd into a single function whose body is ifdef-selected. Backports commit 3bd2dec1a1e8fadb49e3ff2e2633f79e01a25c41 from qemu	2018-05-19 23:59:01 -04:00
Richard Henderson	d03c570049	fpu/softfloat: Pass FloatClass to pickNaN For each operand, pass a single enumeration instead of a pair of booleans. The commit also merges multiple different ifdef-selected implementations of pickNaN into a single function whose body is ifdef-selected. Backports commit 4f251cfd52c7945ebd6ab0d86518b1a9aa51b10c from qemu	2018-05-19 23:49:46 -04:00
Richard Henderson	7995525f68	fpu/softfloat: Make is_nan et al available to softfloat-specialize.h We will need these helpers within softfloat-specialize.h, so move the definitions above the include. After specialization, they will not always be used so mark them to avoid the Werror. Backports commit 247d1f2190c5530fd18fe92a145d0a1985fca4e4 from qemu	2018-05-19 23:41:51 -04:00
Alex Bennée	f1f2521b38	fpu/softfloat: re-factor float to float conversions This allows us to delete a lot of additional boilerplate code which is no longer needed. Backports commit 6fed16b265a4fcc810895bbca4d67e1ae7a89f07 from qemu	2018-05-19 23:16:58 -04:00
Alex Bennée	e65fd25e17	fpu/softfloat: Partial support for ARM Alternative half-precision For float16 ARM supports an alternative half-precision format which sacrifices the ability to represent NaN/Inf in return for a higher dynamic range. The new FloatFmt flag, arm_althp, is then used to modify the behaviour of canonicalize and round_canonical with respect to representation and exception raising. Usage of this new flag waits until we re-factor float-to-float conversions. Backports commit ca3a3d5a3141d44aa717dc11e4d33a834a85e1f6 from qemu	2018-05-19 23:02:38 -04:00
Richard Henderson	e26a389129	fpu/softfloat: Replace float_class_msnan with parts_silence_nan With a canonical representation of NaNs, we can silence an SNaN immediately rather than delay until the final format is known. Backports commit 0bcfbcbea548656ff930394f296589728c2a0c5d from qemu	2018-05-19 22:28:27 -04:00
Richard Henderson	baf1d57071	fpu/softfloat: Replace float_class_dnan with parts_default_nan With a canonical representation of NaNs, we can return the default nan directly rather than delay the expansion until the final format is known. Note one case where we uselessly assigned to a.sign, which was overwritten/ignored later when expanding float_class_dnan. Backports commit f7e598e264b94d0982e647ac303108781d5eb4fa from qemu	2018-05-19 22:25:21 -04:00
Richard Henderson	97152bc427	fpu/softfloat: Introduce parts_is_snan_frac Backports commit 298b468e4389587ab2e8599dd33eff3fbc698011 from qemu	2018-05-19 22:20:03 -04:00
Richard Henderson	f89c1fc034	fpu/softfloat: Canonicalize NaN fraction Shift the NaN fraction to a canonical position, much like we do for the fraction of normal numbers. This will facilitate manipulation of NaNs within the shared code paths. Backports commit 94933df0e5c34d1a50fc950553f9c9649cae5320 from qemu	2018-05-19 22:17:44 -04:00
Richard Henderson	3873bfe133	fpu/softfloat: Move softfloat-specialize.h below FloatParts definition We want to be able to specialize on the canonical representation. Backports commit 0664335a6eb65d684918dd3f1a7f0a3d5b92cbe1 from qemu	2018-05-19 22:07:12 -04:00
Petr Tesarik	d1d09f384e	fpu/softfloat: Fix conversion from uint64 to float128 The significand is passed to normalizeRoundAndPackFloat128() as high first, low second. The current code passes the integer first, so the result is incorrectly shifted left by 64 bits. This bug affects the emulation of s390x instruction CXLGBR (convert from logical 64-bit binary-integer operand to extended BFP result). Backports commit 6603d50648901e8b9e6d66ec1142accf0b1df1e6 from qemu	2018-05-19 21:55:59 -04:00
Peter Maydell	e7656cbd7a	fpu/softfloat: Don't set Invalid for float-to-int(MAXINT) In float-to-integer conversion, if the floating point input converts exactly to the largest or smallest integer that fits in to the result type, this is not an overflow. In this situation we were producing the correct result value, but were incorrectly setting the Invalid flag. For example for Arm A64, "FCVTAS w0, d0" on an input of 0x41dfffffffc00000 should produce 0x7fffffff and set no flags. Fix the boundary case to take the right half of the if() statements. This fixes a regression from 2.11 introduced by the softfloat refactoring. Backports commit 333583757c5e910b040bef793974773635ce1918 from qemu	2018-05-15 21:50:41 -04:00
Alex Bennée	99de568b7d	fpu/softfloat: int_to_float ensure r fully initialised Reported by Coverity (CID1390635). We ensure this for uint_to_float later on so we might as well mirror that. Backports commit a5a5f5e2e437db6c19164b734f838a7bf9e0c5ec from qemu	2018-05-15 21:49:34 -04:00
Peter Maydell	150125de3f	softfloat: Handle default NaN mode after pickNaNMulAdd, not before It is implementation defined whether a multiply-add of (0,inf,qnan) or (inf,0,qnan) raises InvalidaOperation or not, so we let the target-specific pickNaNMulAdd function handle this. This means that we must do the "return the default NaN in default NaN mode" check after the call, not before. Correct the ordering, and restore the comment from the old propagateFloat64MulAddNaN() that warned about this corner case. This fixes a regression from 2.11 for Arm guests where we would incorrectly fail to set the Invalid flag for these cases. Backports commit 1839189bbf89889076aadf0c793c1b57977b28d7 from qemu	2018-05-14 07:24:40 -04:00
Richard Henderson	76e343ef55	fpu: Bound increment for scalbn Without bounding the increment, we can overflow exp either here in scalbn_decomposed or when adding the bias in round_canonical. This can result in e.g. underflowing to 0 instead of overflowing to infinity. The old softfloat code did bound the increment. Backports commit ce8d4082054519f2eaac39958edde502860a7fc6 from qemu	2018-04-18 09:21:03 -04:00
Alex Bennée	af6a0b7c14	fpu/softfloat: check for Inf / x or 0 / x before /0 The re-factoring of div_floats changed the order of checking meaning an operation like -inf/0 erroneously raises the divbyzero flag. IEEE-754 (2008) specifies this should only occur for operations on finite operands. We fix this by moving the check on the dividend being Inf/0 to before the divisor is zero check. Backports commit 9cb4e398c2f95c1e837fe9c570e124a55259f725 from qemu	2018-04-18 09:19:28 -04:00
Alex Bennée	9517a002da	fpu/softfloat: raise float_invalid for NaN/Inf in round_to_int_and_pack The re-factor broke the raising of INVALID when NaN/Inf is passed to the float_to_int conversion functions. round_to_uint_and_pack got this right for NaN but also missed out the Inf handling. Fixes https://bugs.launchpad.net/qemu/+bug/1759264 Backports commit 801bc56336a127d9b351b3a2cc0336e4d0cb2686 from qemu	2018-04-16 13:48:17 -04:00
Emilio G. Cota	74b7fe484c	softfloat: fix {min, max}nummag for same-abs-value inputs Before 8936006 ("fpu/softfloat: re-factor minmax", 2018-02-21), we used to return +Zero for maxnummag(-Zero,+Zero); after that commit, we return -Zero. Fix it by making {min,max}nummag consistent with {min,max}num, deferring to the latter when the absolute value of the operands is the same. With this fix we now pass fp-test. Backports commit 6245327a367292b354489c54e965646823023919 from qemu	2018-04-16 13:46:29 -04:00
Richard Henderson	bc8e85a1c5	fpu: Fix rounding mode for floatN_to_uintM_round_to_zero We incorrectly passed in the current rounding mode instead of float_round_to_zero. Backports commit bd49e6027cbc207c87633c7add3ebd7d3474cd35 from qemu	2018-04-11 19:36:12 -04:00
Lioncash	ba874ef639	softfloat: Perform comparison pass with qemu Ensure that the code and formatting are similar.	2018-03-21 13:26:09 -04:00
Stef O'Rear	d25e1a6216	softfloat: fix crash on int conversion of SNaN Backports commit cffad426f520d39c19db64485d8a429fc95b8aca from qemu	2018-03-09 11:40:17 -05:00
Laurent Vivier	5fa3a97549	softfloat: use floatx80_infinity in softfloat Since f3218a8 ("softfloat: add floatx80 constants") floatx80_infinity is defined but never used. This patch updates floatx80 functions to use this definition. This allows to define a different default Infinity value on m68k: the m68k FPU defines infinity with all bits set to zero in the mantissa. Backports commit 0f605c889ca3fe9744166ad4149d0dff6dacb696 from qemu	2018-03-09 01:34:45 -05:00
Laurent Vivier	b42fcb5496	softfloat: export some functions Move fpu/softfloat-macros.h to include/fpu/ Export floatx80 functions to be used by target floatx80 specific implementations. Exports: propagateFloatx80NaN(), extractFloatx80Frac(), extractFloatx80Exp(), extractFloatx80Sign(), normalizeFloatx80Subnormal(), packFloatx80(), roundAndPackFloatx80(), normalizeRoundAndPackFloatx80() Also exports packFloat32() that will be used to implement m68k fsinh, fcos, fsin, ftan operations. Backports commit 88857aca93f6ec8f372fb9c8201394b0e5582034 from qemu	2018-03-09 01:22:00 -05:00
Alex Bennée	283abedc68	fpu/softfloat: re-factor sqrt This is a little bit of a departure from softfloat's original approach as we skip the estimate step in favour of a straight iteration. There is a minor optimisation to avoid calculating more bits of precision than we need however this still brings a performance drop, especially for float64 operations. Backports commit c13bb2da9eedfbc5886c8048df1bc1114b285fb0 from qemu	2018-03-08 12:23:54 -05:00
Alex Bennée	e2fb4b40c3	fpu/softfloat: re-factor compare The compare function was already expanded from a macro. I keep the macro expansion but move most of the logic into a compare_decomposed. Backports commit 0c4c90929143a530730e2879204a55a30bf63758 from qemu	2018-03-08 12:21:20 -05:00
Alex Bennée	c38b64f8a9	fpu/softfloat: re-factor minmax Let's do the same re-factor treatment for minmax functions. I still use the MACRO trick to expand but now all the checking code is common. Backports commit 89360067071b1844bf745682e18db7dde74cdb8d from qemu	2018-03-08 12:18:35 -05:00
Alex Bennée	9b296329f6	fpu/softfloat: re-factor scalbn This is one of the simpler manipulations you could make to a floating point number. Backports commit 0bfc9f195209593e91a98cf2233753f56a2e5c02 from qemu	2018-03-08 12:16:19 -05:00

1 2

91 Commits