calling convention notice
This commit is contained in:
parent
44b53e4099
commit
f2020f7463
@ -37,7 +37,8 @@ pub fn RectSIMD(comptime T: type) type {
|
|||||||
### Assembly ###
|
### Assembly ###
|
||||||
This is produced by godbolt, which apparently has AVX512 extensions, so, it's extremely compact.
|
This is produced by godbolt, which apparently has AVX512 extensions, so, it's extremely compact.
|
||||||
|
|
||||||
note: SysV prelude and outro are omitted, with inlining you can expect it looking similarly.
|
Note: Calling prelude and outro are omitted, with inlining you can expect it looking similarly.
|
||||||
|
Zig calling convention is used, which is roughly equal to C's static marked procedure.
|
||||||
|
|
||||||
For 32bit floating point:
|
For 32bit floating point:
|
||||||
```asm
|
```asm
|
||||||
@ -148,9 +149,8 @@ For 32bit signed integers it fares amazing too:
|
|||||||
|
|
||||||
AVX512 makes it so that there's no big penalty for double precision types, which is nice.
|
AVX512 makes it so that there's no big penalty for double precision types, which is nice.
|
||||||
|
|
||||||
Note that permutation masks are also supplied along side code which increase binary size.
|
|
||||||
With inlining it could be quite substantial if per object %rip relative addressing is used.
|
|
||||||
|
|
||||||
### Edits ###
|
### Edits ###
|
||||||
- Reordered to use packed vectors without swizzling when possible.
|
- Reordered to use packed vectors without swizzling when possible.
|
||||||
- Eliminated redundant computations.
|
- Eliminated redundant computations.
|
||||||
|
- Calling convention notice.
|
||||||
|
|
||||||
|
Loading…
Reference in New Issue
Block a user