XOP instruction set

The XOP instruction set, announced by AMD on May 1, 2009, is an extension to the 128-bit SSE core instructions in the x86 and AMD64 instruction set for the Bulldozer processor core, which was released on October 12, 2011. However AMD removed support for XOP from Zen onward.
The XOP instruction set contains several different types of vector instructions since it was originally intended as a major upgrade to SSE. Most of the instructions are integer instructions, but it also contains floating point permutation and floating point fraction extraction instructions. See the index for a list of instruction types.

History

XOP is a revised subset of what was originally intended as SSE5. It was changed to be similar but not overlapping with AVX, parts that overlapped with AVX were removed or moved to separate standards such as FMA4 and CVT16.
All SSE5 instructions that were equivalent or similar to instructions in the AVX and FMA4 instruction sets announced by Intel have been changed to use the coding proposed by Intel. Integer instructions without equivalents in AVX were classified as the XOP extension. The XOP instructions have an opcode byte 8F, but otherwise almost identical coding scheme as AVX with the 3-byte VEX prefix.
Commentators have seen this as evidence that Intel has not allowed AMD to use any part of the large VEX coding space. AMD has been forced to use different codes in order to avoid using any code combination that Intel might possibly be using in its development pipeline for something else. The XOP coding scheme is as close to the VEX scheme as technically possible without risking that the AMD codes overlap with future Intel codes. This inference is speculative, since no public information is available about negotiations between the two companies on this issue.
The use of the 8F byte requires that the m-bits have a value larger than or equal to 8 in order to avoid overlap with existing instructions. The C4 byte used in the VEX scheme has no such restriction. This may prevent the use of the m-bits for other purposes in the future in the XOP scheme, but not in the VEX scheme. Another possible problem is that the pp bits have the value 00 in the XOP scheme, while they have the value 01 in the VEX scheme for instructions that have no legacy equivalent. This may complicate the use of the pp bits for other purposes in the future.
A similar compatibility issue is the difference between the FMA3 and FMA4 instruction sets. Intel initially proposed FMA4 in AVX/FMA specification version 3 to supersede the 3-operand FMA proposed by AMD in SSE5. After AMD adopted FMA4, Intel canceled FMA4 support and reverted to FMA3 in the AVX/FMA specification version 5.
In March 2015, AMD explicitly revealed in the description of the patch for the GNU Binutils package that Zen, its third-generation x86-64 architecture in its first iteration, will not support TBM, FMA4, XOP and LWP instructions developed specifically for the "Bulldozer" family of micro-architectures.

Integer vector multiply–accumulate instructions

These are integer version of the FMA instruction set. These are all four operand instructions similar to FMA4 and they all operate on signed integers.

Instruction	Description	Operation
`VPMACSWW` `VPMACSSWW`	Multiply Accumulate Word to Word	2x8 words + 8 words → 8 words ,,..
`VPMACSWD` `VPMACSSWD`	Multiply Accumulate Low Word to Doubleword	2x8 words + 4 doublewords → 4 doublewords ,,.
`VPMACSDD` `VPMACSSDD`	Multiply Accumulate Doubleword to Doubleword	2x4 doublewords + 4 doublewords → 4 doublewords ,,..
`VPMACSDQL` `VPMACSSDQL`	Multiply Accumulate Low Doubleword to Quadword	2x4 doublewords + 2 quadwords → 2 quadwords ,
`VPMACSDQH` `VPMACSSDQH`	Multiply Accumulate High Doubleword to Quadword	2x4 doublewords + 2 quadwords → 2 quadwords ,
`VPMADCSWD` `VPMADCSSWD`	Multiply Add Accumulate Word to Doubleword	2x8 words + 4 doublewords → 4 doublewords , +,..

Integer vector horizontal addition

Horizontal addition instructions adds adjacent values in the input vector to each other. The output size in the instructions below describes how wide the horizontal addition performed is. For instance horizontal byte to word adds two bytes at a time and returns the result as vector of words, but byte to quadword adds eight bytes together at a time and returns the result as vector of quadwords. Six additional horizontal addition and subtraction instructions can be found in SSSE3, but they operate on two input vectors and only does two and two operations.

Instruction	Description	Operation
`VPHADDBW` `VPHADDUBW`	Horizontal add two signed/unsigned bytes to word	16 bytes → 8 words ,,,...
`VPHADDBD` `VPHADDUBD`	Horizontal add four signed/unsigned bytes to doubleword	16 bytes → 4 doublewords ,,...
`VPHADDBQ` `VPHADDUBQ`	Horizontal add eight signed/unsigned bytes to quadword	16 bytes → 2 quadwords ,...
`VPHADDWD` `VPHADDUWD`	Horizontal add two signed/unsigned words to doubleword	8 words → 4 doublewords ,,,...
`VPHADDWQ` `VPHADDUWQ`	Horizontal add four signed/unsigned words to quadword	8 words → 2 quadwords ,
`VPHADDDQ` `VPHADDUDQ`	Horizontal add two signed/unsigned doublewords to quadword	4 doublewords → 2 quadwords ,
`VPHSUBBW`	Horizontal subtract two signed bytes to word	16 bytes → 8 words ,,,...
`VPHSUBWD`	Horizontal subtract two signed words to doubleword	8 words → 4 doublewords ,,,...
`VPHSUBDQ`	Horizontal subtract two signed doublewords to quadword	4 doublewords → 2 quadwords ,

Integer vector compare

This set of vector compare instructions all take an immediate as an extra argument. The immediate controls what kind of comparison is performed. There are eight comparison possible for each instruction. The vectors are compared and all comparisons that evaluate to true set all corresponding bits in the destination to 1, and false comparisons sets all the same bits to 0. This result can be used directly in VPCMOV instruction for a vectorized conditional move.

Instruction	Description	Immediate	Comparison
`VPCOMB`	Compare Vector Signed Bytes	000	Less Than
`VPCOMW`	Compare Vector Signed Words	001	Less Than or Equal
`VPCOMD`	Compare Vector Signed Doublewords	010	Greater Than
`VPCOMQ`	Compare Vector Signed Quadwords	011	Greater Than or Equal
`VPCOMUB`	Compare Vector Unsigned Bytes	100	Equal
`VPCOMUW`	Compare Vector Unsigned Words	101	Not Equal
`VPCOMUD`	Compare Vector Unsigned Doublewords	110	False
`VPCOMUQ`	Compare Vector Unsigned Quadwords	111	True

Vector conditional move

VPCMOV works as bitwise variant of the blend instructions in SSE4. For each bit in the selector 1 selects the same bit in the first source, and 0 selects the same in the second source. When used together with the XOP vector comparison instructions above this can be used to implement a vectorized ternary move, or if the second input is the same as the destination, a conditional move.

Instruction	Description
`VPCMOV`	Vector Conditional Move

Integer vector shift and rotate instructions

The shift instructions here differ from those in SSE2 in that they can shift each unit with a different amount using a vector register interpreted as packed signed integers. The sign indicates the direction of shift or rotate, with positive values causing left shift and negative right shift
Intel has specified a different incompatible set of variable vector shift instructions in AVX2.

Instruction	Description
`VPROTB`	Packed Rotate Bytes
`VPROTW`	Packed Rotate Words
`VPROTD`	Packed Rotate Doublewords
`VPROTQ`	Packed Rotate Quadwords
`VPSHAB`	Packed Shift Arithmetic Bytes
`VPSHAW`	Packed Shift Arithmetic Words
`VPSHAD`	Packed Shift Arithmetic Doublewords
`VPSHAQ`	Packed Shift Arithmetic Quadwords
`VPSHLB`	Packed Shift Logical Bytes
`VPSHLW`	Packed Shift Logical Words
`VPSHLD`	Packed Shift Logical Doublewords
`VPSHLQ`	Packed Shift Logical Quadwords

Vector permute

VPPERM is a single instruction that combines the SSSE3 instruction PALIGNR and PSHUFB and adds more to both. Some compare it the Altivec instruction VPERM. It takes three registers as input, the first two are source registers and the third the selector register. Each byte in the selector selects one of the bytes in one of the two input registers for the output. The selector can also apply effects on the selected bytes such as setting it to 0, reverse the bit order, and repeating most signicating bit. All of the effects or the input can in addition be inverted.
The VPERMIL2PD and VPERMIL2PS instructions are two source versions of the VPERMILPD and VPERMILPS instructions in AVX which means like VPPERM they can select output from any of the fields in the two inputs.

Instruction	Description
`VPPERM`	Packed Permute Byte
`VPERMIL2PD`	Permute Two-Source Double-Precision Floating-Point
`VPERMIL2PS`	Permute Two-Source Single-Precision Floating-Point

Floating-point fraction extraction

These instructions extracts the fractional part of floating point, that is the part that would be lost in conversion to integer.

Instruction	Description
`VFRCZPD`	Extract Fraction Packed Double-Precision Floating-Point
`VFRCZPS`	Extract Fraction Packed Single-Precision Floating-Point
`VFRCZSD`	Extract Fraction Scalar Double-Precision Floating-Point
`VFRCZSS`	Extract Fraction Scalar Single-Precision Floating Point

CPUs with XOP

AMD:
* "Heavy Equipment" processors
** Bulldozer-based processors, Q4 2011
** Piledriver-based processors, Q4 2012
** Steamroller-based processors, Q1 2014
** Excavator-based processors, 2015

Popular movies

The Hunger Games (film) - 2012 American dystopian action thriller science fiction-adventure film directed by Gary Ross and based on Suzanne Collins’s 2008 novel of the same name. It is the first insta...
untitled Captain Marvel sequel - part of Marvel Cinematic Universe....
Killers of the Flower Moon (film project) - Killers of the Flower Moon - film project in United States of America. It was presented as drama, detective fiction, thriller. The film project starred Leonardo Dicaprio, Robert De Niro. Director of...
Five Nights at Freddy's (film) - Five Nights at Freddy's - film published in 2017 in United States of America. Scenarist of the film - Scott Cawthon....

Popular books

Book of Revelation - The Book of Revelation is the final book of the New Testament, and consequently is also the final book of the Christian Bible. Its title is derived from the first word of the Koine Greek text: apok...
Book of Genesis - account of the creation of the world, the early history of humanity, Israel's ancestors and the origins...
Gospel of Matthew - The Gospel According to Matthew is the first book of the New Testament and one of the three synoptic gospels. It tells how Israel's Messiah, rejected and executed in Israel, pronounces judgement on ...
Michelin Guide - Michelin Guides are a series of guide books published by the French tyre company Michelin for more than a century. The term normally refers to the annually published Michelin Red Guide , the oldest...
Psalms - The Book of Psalms , commonly referred to simply as Psalms , the Psalter or "the Psalms", is the first book of the Ketuvim , the third section of the Hebrew Bible, and thus a book of th...
Ecclesiastes - Ecclesiastes is one of 24 books of the Tanakh , where it is classified as one of the Ketuvim . Originally written c. 450–200 BCE, it is also among the canonical Wisdom literature of the Old Tes...
The 48 Laws of Power - non-fiction book by American author Robert Greene. The book...

Popular television series

The Crown (TV series) - historical drama web television series about the reign of Queen Elizabeth II, created and principally written by Peter Morgan, and produced by Left Bank Pictures and Sony Pictures Tel...
Friends - American sitcom television series, created by David Crane and Marta Kauffman, which aired on NBC from September 22, 1994, to May 6, 2004, lasting ten seasons. With an ensemble cast sta...
Young Sheldon - spin-off prequel to The Big Bang Theory and begins with the character Sheldon...
Modern Family - American television mockumentary family sitcom created by Christopher Lloyd and Steven Levitan for the American Broadcasting Company. It ran for eleven seasons, from September 23...
Loki (TV series) - upcoming American web television miniseries created for Disney+ by Michael Waldron, based on the Marvel Comics character of the same name. It is set in the Marvel Cinematic Universe, shar...
Game of Thrones - American fantasy drama television series created by David Benioff and D. B. Weiss for HBO. It...
Shameless (American TV series) - American comedy-drama television series developed by John Wells which debuted on Showtime on January 9, 2011. It...