Concatenative topics
Concatenative meta
Other languages
Meta
This is a quick reference for Intel's Streaming SIMD Extensions. Feel free to make additions or corrections!
The vector types here are named with the same convention as in Factor's SIMD library. It should be obvious what they mean:
The number next to each instruction is the SSE version:
There are many more instructions that do not fit in this grid, but these are the most important ones to know.
| char-16 | uchar-16 | short-8 | ushort-8 | int-4 | uint-4 | float-4 | double-2 | |
| move | MOVQ 2 | MOVQ 2 | MOVQ 2 | MOVQ 2 | MOVQ 2 | MOVQ 2 | MOVPS 1 | MOVPD 2 | 
| add | PADDB 2 | PADDB 2 | PADDW 2 | PADDW 2 | PADDD 2 | PADDD 2 | ADDPS 1 | ADDPD 2 | 
| subtract | PSUBB 2 | PSUBB 2 | PSUBW 2 | PSUBW 2 | PSUBD 2 | PSUBD 2 | SUBPS 1 | SUBPD 2 | 
| add with saturation | PADDSB 2 | PADDUSB 2 | PADDSW 2 | PADDUSW 2 | ||||
| subtract with saturation | PSUBSB 2 | PSUBUSB 2 | PSUBSW 2 | PSUBUSW 2 | ||||
| add-subtract | ADDSUBPS 3 | ADDSUBPD 3 | ||||||
| horizontal add | PHADDW 3.3 | PHADDW 3.3 | PHADDD 3.3 | PHADDW 3.3 | HADDPS 3 | HADDPS 3 | ||
| multiply | PMULLW 2 | PMULLW 2 | PMULLD 2 | PMULLD 2 | MULPS 1 | MULPD 2 | ||
| divide | DIVPS 1 | DIVPD 2 | ||||||
| absolute value | PABSB 3.3 | PABSW 3.3 | PABSD 3.3 | |||||
| minimum | PMINUB 2 | PMINSW 2 | MINPS 1 | MINPD 2 | ||||
| maximum | PMAXUB 2 | PMAXSW 2 | MAXPS 1 | MAXPD 2 | ||||
| approximate reciprocal | RCPSS 1 | RCPSD 2 | ||||||
| square root | SQRTSS 1 | SQRTSD 2 | 
For full details, consult Intel's or AMD's instruction set reference documentation.
This revision created on Wed, 23 Sep 2009 00:25:31 by slava