Context Navigation

Changes between Version 34 and Version 35 of GcnInstrsVop3

Timestamp:: 11/26/17 10:00:26 (6 years ago)
Author:: trac
Comment:: --

Legend:

: Unmodified
: Added
: Removed
: Modified

GcnInstrsVop3

-                      v34
+                      v35
 <tr>
 <th>Opcode</th>
+<th>GCN 1.2</th>
+<th>GCN 1.4</th>
+<th>Mnemonic</th>
+<th>Mnemonic (GCN 1.4)</th>
+<th>Mnemonic (GCN 1.4)</th>
 </tr>
 </thead>
 …
 <tr>
 <td>448 (0x1c0)</td>
-<td>✓</td>
-<td>✓</td>
 <td>V_MAD_LEGACY_F32</td>
+<td>V_MAD_LEGACY_F32</td>
 </tr>
 <tr>
 <td>449 (0x1c1)</td>
-<td>✓</td>
-<td>✓</td>
 <td>V_MAD_F32</td>
+<td>V_MAD_F32</td>
 </tr>
 <tr>
 <td>450 (0x1c2)</td>
-<td>✓</td>
-<td>✓</td>
 <td>V_MAD_I32_I24</td>
+<td>V_MAD_I32_I24</td>
 </tr>
 <tr>
 <td>451 (0x1c3)</td>
-<td>✓</td>
-<td>✓</td>
 <td>V_MAD_U32_U24</td>
+<td>V_MAD_U32_U24</td>
 </tr>
 <tr>
 <td>452 (0x1c4)</td>
-<td>✓</td>
-<td>✓</td>
 <td>V_CUBEID_F32</td>
+<td>V_CUBEID_F32</td>
 </tr>
 <tr>
 <td>453 (0x1c5)</td>
-<td>✓</td>
-<td>✓</td>
 <td>V_CUBESC_F32</td>
+<td>V_CUBESC_F32</td>
 </tr>
 <tr>
 <td>454 (0x1c6)</td>
-<td>✓</td>
-<td>✓</td>
 <td>V_CUBETC_F32</td>
+<td>V_CUBETC_F32</td>
 </tr>
 <tr>
 <td>455 (0x1c7)</td>
-<td>✓</td>
-<td>✓</td>
 <td>V_CUBEMA_F32</td>
+<td>V_CUBEMA_F32</td>
 </tr>
 <tr>
 <td>456 (0x1c8)</td>
-<td>✓</td>
-<td>✓</td>
 <td>V_BFE_U32</td>
+<td>V_BFE_U32</td>
 </tr>
 <tr>
 <td>457 (0x1c9)</td>
-<td>✓</td>
-<td>✓</td>
 <td>V_BFE_I32</td>
+<td>V_BFE_I32</td>
 </tr>
 <tr>
 <td>458 (0x1ca)</td>
-<td>✓</td>
-<td>✓</td>
 <td>V_BFI_B32</td>
+<td>V_BFI_B32</td>
 </tr>
 <tr>
 <td>459 (0x1cb)</td>
-<td>✓</td>
-<td>✓</td>
 <td>V_FMA_F32</td>
+<td>V_FMA_F32</td>
 </tr>
 <tr>
 <td>460 (0x1cc)</td>
-<td>✓</td>
-<td>✓</td>
 <td>V_FMA_F64</td>
+<td>V_FMA_F64</td>
 </tr>
 <tr>
 <td>461 (0x1cd)</td>
-<td>✓</td>
-<td>✓</td>
 <td>V_LERP_U8</td>
+<td>V_LERP_U8</td>
 </tr>
 <tr>
 <td>462 (0x1ce)</td>
-<td>✓</td>
-<td>✓</td>
 <td>V_ALIGNBIT_B32</td>
+<td>V_ALIGNBIT_B32</td>
 </tr>
 <tr>
 <td>463 (0x1cf)</td>
-<td>✓</td>
-<td>✓</td>
 <td>V_ALIGNBYTE_B32</td>
+<td>V_ALIGNBYTE_B32</td>
 </tr>
 <tr>
 <td>464 (0x1d0)</td>
-<td>✓</td>
-<td>✓</td>
 <td>V_MIN3_F32</td>
+<td>V_MIN3_F32</td>
 </tr>
 <tr>
 <td>465 (0x1d1)</td>
-<td>✓</td>
-<td>✓</td>
 <td>V_MIN3_I32</td>
+<td>V_MIN3_I32</td>
 </tr>
 <tr>
 <td>466 (0x1d2)</td>
-<td>✓</td>
-<td>✓</td>
 <td>V_MIN3_U32</td>
+<td>V_MIN3_U32</td>
 </tr>
 <tr>
 <td>467 (0x1d3)</td>
-<td>✓</td>
-<td>✓</td>
 <td>V_MAX3_F32</td>
+<td>V_MAX3_F32</td>
 </tr>
 <tr>
 <td>468 (0x1d4)</td>
-<td>✓</td>
-<td>✓</td>
 <td>V_MAX3_I32</td>
+<td>V_MAX3_I32</td>
 </tr>
 <tr>
 <td>469 (0x1d5)</td>
-<td>✓</td>
-<td>✓</td>
 <td>V_MAX3_U32</td>
+<td>V_MAX3_U32</td>
 </tr>
 <tr>
 <td>470 (0x1d6)</td>
-<td>✓</td>
-<td>✓</td>
 <td>V_MED3_F32</td>
+<td>V_MED3_F32</td>
 </tr>
 <tr>
 <td>471 (0x1d7)</td>
-<td>✓</td>
-<td>✓</td>
 <td>V_MED3_I32</td>
+<td>V_MED3_I32</td>
 </tr>
 <tr>
 <td>472 (0x1d8)</td>
-<td>✓</td>
-<td>✓</td>
 <td>V_MED3_U32</td>
+<td>V_MED3_U32</td>
 </tr>
 <tr>
 <td>473 (0x1d9)</td>
-<td>✓</td>
-<td>✓</td>
 <td>V_SAD_U8</td>
+<td>V_SAD_U8</td>
 </tr>
 <tr>
 <td>474 (0x1da)</td>
-<td>✓</td>
-<td>✓</td>
 <td>V_SAD_HI_U8</td>
+<td>V_SAD_HI_U8</td>
 </tr>
 <tr>
 <td>475 (0x1db)</td>
-<td>✓</td>
-<td>✓</td>
 <td>V_SAD_U16</td>
+<td>V_SAD_U16</td>
 </tr>
 <tr>
 <td>476 (0x1dc)</td>
-<td>✓</td>
-<td>✓</td>
 <td>V_SAD_U32</td>
+<td>V_SAD_U32</td>
 </tr>
 <tr>
 <td>477 (0x1dd)</td>
-<td>✓</td>
-<td>✓</td>
 <td>V_CVT_PK_U8_F32</td>
+<td>V_CVT_PK_U8_F32</td>
 </tr>
 <tr>
 <td>478 (0x1de)</td>
-<td>✓</td>
-<td>✓</td>
 <td>V_DIV_FIXUP_F32</td>
+<td>V_DIV_FIXUP_F32</td>
 </tr>
 <tr>
 <td>479 (0x1df)</td>
-<td>✓</td>
-<td>✓</td>
 <td>V_DIV_FIXUP_F64</td>
+<td>V_DIV_FIXUP_F64</td>
 </tr>
 <tr>
 <td>480 (0x1e0)</td>
-<td>✓</td>
-<td>✓</td>
 <td>V_DIV_SCALE_F32 (VOP3B)</td>
+<td>V_DIV_SCALE_F32 (VOP3B)</td>
 </tr>
 <tr>
 <td>481 (0x1e1)</td>
-<td>✓</td>
-<td>✓</td>
 <td>V_DIV_SCALE_F64 (VOP3B)</td>
+<td>V_DIV_SCALE_F64 (VOP3B)</td>
 </tr>
 <tr>
 <td>482 (0x1e2)</td>
-<td>✓</td>
-<td>✓</td>
 <td>V_DIV_FMAS_F32</td>
+<td>V_DIV_FMAS_F32</td>
 </tr>
 <tr>
 <td>483 (0x1e3)</td>
-<td>✓</td>
-<td>✓</td>
 <td>V_DIV_FMAS_F64</td>
+<td>V_DIV_FMAS_F64</td>
 </tr>
 <tr>
 <td>484 (0x1e4)</td>
-<td>✓</td>
-<td>✓</td>
 <td>V_MSAD_U8</td>
+<td>V_MSAD_U8</td>
 </tr>
 <tr>
 <td>485 (0x1e5)</td>
-<td>✓</td>
-<td>✓</td>
 <td>V_QSAD_PK_U16_U8</td>
+<td>V_QSAD_PK_U16_U8</td>
 </tr>
 <tr>
 <td>486 (0x1e6)</td>
-<td>✓</td>
-<td>✓</td>
 <td>V_MQSAD_PK_U16_U8</td>
+<td>V_MQSAD_PK_U16_U8</td>
 </tr>
 <tr>
 <td>487 (0x1e7)</td>
-<td>✓</td>
-<td>✓</td>
 <td>V_MQSAD_U32_U8</td>
+<td>V_MQSAD_U32_U8</td>
 </tr>
 <tr>
 <td>488 (0x1e8)</td>
-<td>✓</td>
-<td>✓</td>
 <td>V_MAD_U64_U32 (VOP3B)</td>
+<td>V_MAD_U64_U32 (VOP3B)</td>
 </tr>
 <tr>
 <td>489 (0x1e9)</td>
-<td>✓</td>
-<td>✓</td>
 <td>V_MAD_I64_I32 (VOP3B)</td>
+<td>V_MAD_I64_I32 (VOP3B)</td>
 </tr>
 <tr>
 <td>490 (0x1ea)</td>
-<td>✓</td>
-<td>✓</td>
 <td>V_MAD_F16</td>
+<td>V_MAD_LEGACY_F16</td>
 </tr>
 <tr>
 <td>491 (0x1eb)</td>
-<td>✓</td>
-<td>✓</td>
 <td>V_MAD_U16</td>
+<td>V_MAD_LEGACY_U16</td>
 </tr>
 <tr>
 <td>492 (0x1ec)</td>
-<td>✓</td>
-<td>✓</td>
 <td>V_MAD_I16</td>
+<td>V_MAD_LEGACY_I16</td>
 </tr>
 <tr>
 <td>493 (0x1ed)</td>
-<td>✓</td>
-<td>✓</td>
 <td>V_PERM_B32</td>
+<td>V_PERM_B32</td>
 </tr>
 <tr>
 <td>494 (0x1ee)</td>
-<td>✓</td>
-<td>✓</td>
 <td>V_FMA_F16</td>
+<td>V_FMA_LEGACY_F16</td>
 </tr>
 <tr>
 <td>495 (0x1ef)</td>
-<td>✓</td>
-<td>✓</td>
 <td>V_DIV_FIXUP_F16</td>
+<td>V_DIV_FIXUP_LEGACY_F16</td>
 </tr>
 <tr>
 <td>496 (0x1f0)</td>
-<td>✓</td>
-<td>✓</td>
 <td>V_CVT_PKACCUM_U8_F32</td>
+<td>V_CVT_PKACCUM_U8_F32</td>
 </tr>
 <tr>
 <td>497 (0x1f1)</td>
+<td></td>
+<td>✓</td>
+<td>--</td>
 <td>V_MAD_U32_U16</td>
 </tr>
 <tr>
 <td>498 (0x1f2)</td>
+<td></td>
+<td>✓</td>
+<td>--</td>
 <td>V_MAD_I32_I16</td>
 </tr>
 <tr>
 <td>499 (0x1f3)</td>
+<td></td>
+<td>✓</td>
+<td>--</td>
 <td>V_XAD_U32</td>
 </tr>
 <tr>
 <td>500 (0x1f4)</td>
+<td></td>
+<td>✓</td>
+<td>--</td>
 <td>V_MIN3_F16</td>
 </tr>
 <tr>
 <td>501 (0x1f5)</td>
+<td></td>
+<td>✓</td>
+<td>--</td>
 <td>V_MIN3_I16</td>
 </tr>
 <tr>
 <td>502 (0x1f6)</td>
+<td></td>
+<td>✓</td>
+<td>--</td>
 <td>V_MIN3_U16</td>
 </tr>
 <tr>
 <td>503 (0x1f7)</td>
+<td></td>
+<td>✓</td>
+<td>--</td>
 <td>V_MAX3_F16</td>
 </tr>
 <tr>
 <td>504 (0x1f8)</td>
+<td></td>
+<td>✓</td>
+<td>--</td>
 <td>V_MAX3_I16</td>
 </tr>
 <tr>
 <td>505 (0x1f9)</td>
+<td></td>
+<td>✓</td>
+<td>--</td>
 <td>V_MAX3_U16</td>
 </tr>
 <tr>
 <td>506 (0x1fa)</td>
+<td></td>
+<td>✓</td>
+<td>--</td>
 <td>V_MED3_F16</td>
 </tr>
 <tr>
 <td>507 (0x1fb)</td>
+<td></td>
+<td>✓</td>
+<td>--</td>
 <td>V_MED3_I16</td>
 </tr>
 <tr>
 <td>508 (0x1fc)</td>
+<td></td>
+<td>✓</td>
+<td>--</td>
 <td>V_MED3_U16</td>
 </tr>
 <tr>
 <td>509 (0x1fd)</td>
+<td></td>
+<td>✓</td>
+<td>--</td>
 <td>V_LSHL_ADD_U32</td>
 </tr>
 <tr>
 <td>510 (0x1fe)</td>
+<td></td>
+<td>✓</td>
+<td>--</td>
 <td>V_ADD_LSHL_U32</td>
 </tr>
 <tr>
 <td>511 (0x1ff)</td>
+<td></td>
+<td>✓</td>
+<td>--</td>
 <td>V_ADD3_U32</td>
 </tr>
 <tr>
+<td>512 (0x200)</td>
+<td>--</td>
+<td>V_LSHL_OR_B32</td>
+</tr>
+<tr>
+<td>513 (0x201)</td>
+<td>--</td>
+<td>V_AND_OR_B32</td>
+</tr>
+<tr>
+<td>514 (0x202)</td>
+<td>--</td>
+<td>V_OR3_B32</td>
+</tr>
+<tr>
+<td>515 (0x203)</td>
+<td>--</td>
+<td>V_MAD_F16</td>
+</tr>
+<tr>
+<td>516 (0x204)</td>
+<td>--</td>
+<td>V_MAD_U16</td>
+</tr>
+<tr>
+<td>517 (0x205)</td>
+<td>--</td>
+<td>V_MAD_I16</td>
+</tr>
+<tr>
+<td>518 (0x206)</td>
+<td>--</td>
+<td>V_FMA_F16</td>
+</tr>
+<tr>
+<td>519 (0x207)</td>
+<td>--</td>
+<td>V_DIV_FIXUP_F16</td>
+</tr>
+<tr>
 <td>624 (0x270)</td>
-<td>✓</td>
-<td>✓</td>
 <td>V_INTERP_P1_F32 (VINTRP)</td>
+<td>V_INTERP_P1_F32 (VINTRP)</td>
 </tr>
 <tr>
 <td>625 (0x271)</td>
-<td>✓</td>
-<td>✓</td>
 <td>V_INTERP_P2_F32 (VINTRP)</td>
+<td>V_INTERP_P2_F32 (VINTRP)</td>
 </tr>
 <tr>
 <td>626 (0x272)</td>
-<td>✓</td>
-<td>✓</td>
 <td>V_INTERP_MOV_F32 (VINTRP)</td>
+<td>V_INTERP_MOV_F32 (VINTRP)</td>
 </tr>
 <tr>
 <td>627 (0x273)</td>
-<td>✓</td>
-<td>✓</td>
 <td>V_INTERP_P1LL_F16 (VINTRP)</td>
+<td>V_INTERP_P1LL_F16 (VINTRP)</td>
 </tr>
 <tr>
 <td>628 (0x274)</td>
-<td>✓</td>
-<td>✓</td>
 <td>V_INTERP_P1LV_F16 (VINTRP)</td>
+<td>V_INTERP_P1LV_F16 (VINTRP)</td>
 </tr>
 <tr>
 <td>629 (0x275)</td>
-<td>✓</td>
-<td>✓</td>
 <td>V_INTERP_P2_F16 (VINTRP)</td>
+<td>V_INTERP_P2_F16 (VINTRP)</td>
 </tr>
 <tr>
 <td>640 (0x280)</td>
-<td>✓</td>
-<td>✓</td>
 <td>V_ADD_F64</td>
+<td>V_ADD_F64</td>
 </tr>
 <tr>
 <td>641 (0x281)</td>
-<td>✓</td>
-<td>✓</td>
 <td>V_MUL_F64</td>
+<td>V_MUL_F64</td>
 </tr>
 <tr>
 <td>642 (0x282)</td>
-<td>✓</td>
-<td>✓</td>
 <td>V_MIN_F64</td>
+<td>V_MIN_F64</td>
 </tr>
 <tr>
 <td>643 (0x283)</td>
-<td>✓</td>
-<td>✓</td>
 <td>V_MAX_F64</td>
+<td>V_MAX_F64</td>
 </tr>
 <tr>
 <td>644 (0x284)</td>
-<td>✓</td>
-<td>✓</td>
 <td>V_LDEXP_F64</td>
+<td>V_LDEXP_F64</td>
 </tr>
 <tr>
 <td>645 (0x285)</td>
-<td>✓</td>
-<td>✓</td>
 <td>V_MUL_LO_U32</td>
+<td>V_MUL_LO_U32</td>
 </tr>
 <tr>
 <td>646 (0x286)</td>
-<td>✓</td>
-<td>✓</td>
 <td>V_MUL_HI_U32</td>
+<td>V_MUL_HI_U32</td>
 </tr>
 <tr>
 <td>647 (0x287)</td>
-<td>✓</td>
-<td>✓</td>
 <td>V_MUL_HI_I32</td>
+<td>V_MUL_HI_I32</td>
 </tr>
 <tr>
 <td>648 (0x288)</td>
-<td>✓</td>
-<td>✓</td>
 <td>V_LDEXP_F32</td>
+<td>V_LDEXP_F32</td>
 </tr>
 <tr>
 <td>649 (0x289)</td>
-<td>✓</td>
-<td>✓</td>
 <td>V_READLANE_B32</td>
+<td>V_READLANE_B32</td>
 </tr>
 <tr>
 <td>650 (0x28a)</td>
-<td>✓</td>
-<td>✓</td>
 <td>V_WRITELANE_B32</td>
+<td>V_WRITELANE_B32</td>
 </tr>
 <tr>
 <td>651 (0x28b)</td>
-<td>✓</td>
-<td>✓</td>
 <td>V_BCNT_U32_B32</td>
+<td>V_BCNT_U32_B32</td>
 </tr>
 <tr>
 <td>652 (0x28c)</td>
-<td>✓</td>
-<td>✓</td>
 <td>V_MBCNT_LO_U32_B32</td>
+<td>V_MBCNT_LO_U32_B32</td>
 </tr>
 <tr>
 <td>653 (0x28d)</td>
-<td>✓</td>
-<td>✓</td>
 <td>V_MBCNT_HI_U32_B32</td>
+<td>V_MBCNT_HI_U32_B32</td>
 </tr>
 <tr>
 <td>654 (0x28e)</td>
-<td>✓</td>
-<td>✓</td>
 <td>V_MAC_LEGACY_F32</td>
+<td>V_MAC_LEGACY_F32</td>
 </tr>
 <tr>
 <td>655 (0x28f)</td>
-<td>✓</td>
-<td>✓</td>
 <td>V_LSHLREV_B64</td>
+<td>V_LSHLREV_B64</td>
 </tr>
 <tr>
 <td>656 (0x290)</td>
-<td>✓</td>
-<td>✓</td>
 <td>V_LSHRREV_B64</td>
+<td>V_LSHRREV_B64</td>
 </tr>
 <tr>
 <td>657 (0x291)</td>
-<td>✓</td>
-<td>✓</td>
 <td>V_ASHRREV_I64</td>
+<td>V_ASHRREV_I64</td>
 </tr>
 <tr>
 <td>658 (0x292)</td>
-<td>✓</td>
-<td>✓</td>
 <td>V_TRIG_PREOP_F64</td>
+<td>V_TRIG_PREOP_F64</td>
 </tr>
 <tr>
 <td>659 (0x293)</td>
-<td>✓</td>
-<td>✓</td>
 <td>V_BFM_B32</td>
+<td>V_BFM_B32</td>
 </tr>
 <tr>
 <td>660 (0x294)</td>
-<td>✓</td>
-<td>✓</td>
 <td>V_CVT_PKNORM_I16_F32</td>
+<td>V_CVT_PKNORM_I16_F32</td>
 </tr>
 <tr>
 <td>661 (0x295)</td>
-<td>✓</td>
-<td>✓</td>
 <td>V_CVT_PKNORM_U16_F32</td>
+<td>V_CVT_PKNORM_U16_F32</td>
 </tr>
 <tr>
 <td>662 (0x296)</td>
-<td>✓</td>
-<td>✓</td>
 <td>V_CVT_PKRTZ_F16_F32</td>
+<td>V_CVT_PKRTZ_F16_F32</td>
 </tr>
 <tr>
 <td>663 (0x297)</td>
-<td>✓</td>
-<td>✓</td>
 <td>V_CVT_PK_U16_U32</td>
+<td>V_CVT_PK_U16_U32</td>
 </tr>
 <tr>
 <td>664 (0x298)</td>
-<td>✓</td>
-<td>✓</td>
 <td>V_CVT_PK_I16_I32</td>
+<td>V_CVT_PK_I16_I32</td>
 </tr>
 <tr>
 <td>665 (0x299)</td>
-<td></td>
-<td>✓</td>
 <td>V_CVT_PKNORM_I16_F16</td>
+<td>V_CVT_PKNORM_I16_F16</td>
 </tr>
 <tr>
 <td>666 (0x29a)</td>
-<td></td>
-<td>✓</td>
 <td>V_CVT_PKNORM_U16_F16</td>
+<td>V_CVT_PKNORM_U16_F16</td>
 </tr>
 <tr>
 <td>667 (0x29b)</td>
-<td></td>
-<td>✓</td>
 <td>V_READLANE_REGRD_B32</td>
+<td>V_READLANE_REGRD_B32</td>
 </tr>
 <tr>
 <td>668 (0x29c)</td>
+<td></td>
+<td>✓</td>
+<td>--</td>
 <td>V_ADD_I32</td>
 </tr>
 <tr>
 <td>669 (0x29d)</td>
+<td></td>
+<td>✓</td>
+<td>--</td>
 <td>V_SUB_I32</td>
 </tr>
 <tr>
 <td>670 (0x29e)</td>
+<td></td>
+<td>✓</td>
+<td>--</td>
 <td>V_ADD_I16</td>
 </tr>
 <tr>
 <td>671 (0x29f)</td>
+<td></td>
+<td>✓</td>
+<td>--</td>
 <td>V_SUB_I16</td>
 </tr>
 <tr>
 <td>672 (0x2a0)</td>
+<td></td>
+<td>✓</td>
+<td>--</td>
 <td>V_PACK_B32_F16</td>
 </tr>
 …
 <p>Alphabetically sorted instruction list:</p>
 <h4>V_ADD_F64</h4>
 <p>Opcode: 356 (0x164) for GCN 1.0/1.1; 640 (0x280) for GCN 1.2<br />
+<p>Opcode: 356 (0x164) for GCN 1.0/1.1; 640 (0x280) for GCN 1.2/1.4<br />
 Syntax: V_ADD_F64 VDST(2), SRC0(2), SRC1(2)<br />
 Description: Add two double FP value from SRC0 and SRC1 and store result to VDST.<br />
 …
 <code>VDST = (SRC0 + SRC1) &lt;&lt; (SRC2&amp;31)</code></p>
 <h4>V_ALIGNBIT_B32</h4>
 <p>Opcode: 334 (0x14e) for GCN 1.0/1.1; 462 (0x1ce) for GCN 1.2<br />
+<p>Opcode: 334 (0x14e) for GCN 1.0/1.1; 462 (0x1ce) for GCN 1.2/1.4<br />
 Syntax: V_ALIGNBIT_B32 VDST, SRC0, SRC1, SRC2<br />
 Description: Align bit. Shift right bits in 64-bit stored in SRC1 (low part) and
 …
 <code>VDST = (((UINT64)SRC0)&lt;&lt;32) | SRC1) &gt;&gt; (SRC2&amp;31)</code></p>
 <h4>V_ALIGNBYTE_B32</h4>
 <p>Opcode: 335 (0x14f) for GCN 1.0/1.1; 463 (0x1cf) for GCN 1.2<br />
+<p>Opcode: 335 (0x14f) for GCN 1.0/1.1; 463 (0x1cf) for GCN 1.2/1.4<br />
 Syntax: V_ALIGNBYTE_B32 VDST, SRC0, SRC1, SRC2<br />
 Description: Align bit. Shift right bits in 64-bit stored in SRC1 (low part) and
 …
 Operation:<br />
 <code>VDST = (((UINT64)SRC0)&lt;&lt;32) | SRC1) &gt;&gt; ((SRC2&amp;3)*8)</code></p>
+<h4>V_AND_OR_B32</h4>
+<p>Opcode: 513 (0x201) for GCN 1.4<br />
+Syntax: V_AND_OR_B32 VDST, SRC0, SRC1, SRC2<br />
+Description: Make btwise AND with SRC0 and SRC1, make bitwise OR with result and SRC2
+and store result to VDST.<br />
+Operation:<br />
+<code>VDST = (SRC0 &amp; SRC1) | SRC2</code></p>
 <h4>V_ASHR_I64</h4>
 <p>Opcode: 355 (0x163) for GCN 1.0/1.1<br />
 …
 <code>VDST = (INT64)SRC0 &gt;&gt; (SRC1&amp;63)</code></p>
 <h4>V_ASHRREV_I64</h4>
 <p>Opcode: 657 (0x291) for GCN 1.2<br />
+<p>Opcode: 657 (0x291) for GCN 1.2/1.4<br />
 Syntax: V_ASHRREV_I32 VDST(2), SRC0, SRC1(2)<br />
 Description: Arithmetic shift right SRC1 by (SRC0&amp;63) bits and store result into VDST.<br />
 …
 <code>VDST = (INT64)SRC0 &gt;&gt; (SRC0&amp;63)</code></p>
 <h4>V_BCNT_U32_B32</h4>
 <p>Opcode: 651 (0x28b) for GCN 1.2<br />
+<p>Opcode: 651 (0x28b) for GCN 1.2/1.4<br />
 Syntax: V_BCNT_U32_B32 VDST, SRC0, SRC1<br />
 Description: Count bits in SRC0, adds SRC1, and store result to VDST.<br />
 …
 <code>VDST = SRC1 + BITCOUNT(SRC0)</code></p>
 <h4>V_BFE_I32</h4>
 <p>Opcode: 329 (0x149) for GCN 1.0/1.1; 457 (0x1c9) for GCN 1.2<br />
+<p>Opcode: 329 (0x149) for GCN 1.0/1.1; 457 (0x1c9) for GCN 1.2/1.4<br />
 Syntax: V_BFE_I32 VDST, SRC0, SRC1, SRC2<br />
 Description: Extracts bits in SRC0 from range (SRC1&amp;31) with length (SRC2&amp;31)
 …
     VDST = (INT32)SRC0 &gt;&gt; shift</code></p>
 <h4>V_BFE_U32</h4>
 <p>Opcode: 328 (0x148) for GCN 1.0/1.1; 456 (0x1c8) for GCN 1.2<br />
+<p>Opcode: 328 (0x148) for GCN 1.0/1.1; 456 (0x1c8) for GCN 1.2/1.4<br />
 Syntax: V_BFE_U32 VDST, SRC0, SRC1, SRC2<br />
 Description: Extracts bits in SRC0 from range SRC1&amp;31 with length SRC2&amp;31, and
 …
     VDST = SRC0 &gt;&gt; shift</code></p>
 <h4>V_BFI_B32</h4>
 <p>Opcode: 330 (0x14a) for GCN 1.0/1.1; 458 (0x1ca) for GCN 1.2<br />
+<p>Opcode: 330 (0x14a) for GCN 1.0/1.1; 458 (0x1ca) for GCN 1.2/1.4<br />
 Syntax: V_BFI_B32 VDST, SRC0, SRC1, SRC2<br />
 Description: Replace bits in SRC2 by bits from SRC1 marked by bits in SRC0, and store result
 …
 <code>VDST = (SRC0 &amp; SRC1) | (~SRC0 &amp; SRC2)</code></p>
 <h4>V_BFM_B32</h4>
 <p>Opcode: 659 (0x293) for GCN 1.2<br />
+<p>Opcode: 659 (0x293) for GCN 1.2/1.4<br />
 Syntax: V_BFM_B32 VDST, SRC0, SRC1<br />
 Description: Make 32-bit bitmask from (SRC1 &amp; 31) bit that have length (SRC0 &amp; 31) and
 …
 <code>VDST = ((1U &lt;&lt; (SRC0&amp;31))-1) &lt;&lt; (SRC1&amp;31)</code></p>
 <h4>V_CUBEID_F32</h4>
 <p>Opcode: 324 (0x144) for GCN 1.0/1.1; 452 (0x1c4) for GCN 1.2<br />
+<p>Opcode: 324 (0x144) for GCN 1.0/1.1; 452 (0x1c4) for GCN 1.2/1.4<br />
 Syntax: V_CUBEID_F32 VDST, SRC0, SRC1, SRC2<br />
 Description: Cubemap face identification. Determine face by comparing three single FP
 …
 VDST = OUT</code></p>
 <h4>V_CUBEMA_F32</h4>
 <p>Opcode: 327 (0x147) for GCN 1.0/1.1; 455 (0x1c7) for GCN 1.2<br />
+<p>Opcode: 327 (0x147) for GCN 1.0/1.1; 455 (0x1c7) for GCN 1.2/1.4<br />
 Syntax: V_CUBEMA_F32 VDST, SRC0, SRC1, SRC2<br />
 Description: Cubemap Major Axis. Choose highest absolute value from all three FP values
 …
 VDST = OUT</code></p>
 <h4>V_CUBESC_F32</h4>
 <p>Opcode: 325 (0x145) for GCN 1.0/1.1; 453 (0x1c5) for GCN 1.2<br />
+<p>Opcode: 325 (0x145) for GCN 1.0/1.1; 453 (0x1c5) for GCN 1.2/1.4<br />
 Syntax: V_CUBESC_F32 VDST, SRC0, SRC1, SRC2<br />
 Description: Cubemap S coordination. Algorithm below.<br />
 …
 VDST = OUT</code></p>
 <h4>V_CUBETC_F32</h4>
 <p>Opcode: 326 (0x146) for GCN 1.0/1.1; 454 (0x1c6) for GCN 1.2<br />
+<p>Opcode: 326 (0x146) for GCN 1.0/1.1; 454 (0x1c6) for GCN 1.2/1.4<br />
 Syntax: V_CUBETC_F32 VDST, SRC0, SRC1, SRC2<br />
 Description: Cubemap T coordination. Algorithm below.<br />
 …
 VDST = OUT</code></p>
 <h4>V_CVT_PK_I16_I32</h4>
 <p>Opcode: 664 (0x298) for GCN 1.2<br />
+<p>Opcode: 664 (0x298) for GCN 1.2/1.4<br />
 Syntax: V_CVT_PK_I16_I32 VDST, SRC0, SRC1<br />
 Description: Convert signed value from SRC0 and SRC1 to signed 16-bit values with
 …
 VDST = D0 | (((UINT32)D1) &lt;&lt; 16)</code></p>
 <h4>V_CVT_PK_U16_U32</h4>
 <p>Opcode: 663 (0x297) for GCN 1.2<br />
+<p>Opcode: 663 (0x297) for GCN 1.2/1.4<br />
 Syntax: V_CVT_PK_U16_U32 VDST, SRC0, SRC1<br />
 Description: Convert unsigned value from SRC0 and SRC1 to unsigned 16-bit values with
 …
 VDST = D0 | (((UINT32)D1) &lt;&lt; 16)</code></p>
 <h4>V_CVT_PK_U8_F32</h4>
 <p>Opcode: 350 (0x15e) for GCN 1.0/1.1; 477 (0x1dd) for GCN 1.2<br />
+<p>Opcode: 350 (0x15e) for GCN 1.0/1.1; 477 (0x1dd) for GCN 1.2/1.4<br />
 Syntax: V_CVT_PK_U8_F32 VDST, SRC0, SRC1, SRC2<br />
 Description: Convert floating point value from SRC0 to unsigned byte value with
 …
 VDST = (SRC2&amp;~mask) | (((UINT32)VAL8) &lt;&lt; shift)</code></p>
 <h4>V_CVT_PKACCUM_U8_F32</h4>
 <p>Opcode: 496 (0x1f0) for GCN 1.2<br />
+<p>Opcode: 496 (0x1f0) for GCN 1.2/1.4<br />
 Syntax: V_CVT_PKACCUM_U8_F32 VDST, SRC0, SRC1<br />
 Description: Convert floating point value from SRC0 to unsigned byte value with
 …
 VDST = roundNorm(ASHALF(SRC0)) | ((UINT32)roundNorm(ASHALF(SRC1)) &lt;&lt; 16)</code></p>
 <h4>V_CVT_PKNORM_I16_F32</h4>
 <p>Opcode: 660 (0x294) for GCN 1.2<br />
+<p>Opcode: 660 (0x294) for GCN 1.2/1.4<br />
 Syntax: V_CVT_PKNORM_I16_F32 VDST, SRC0, SRC1<br />
 Description: Convert normalized FP value from SRC0 and SRC1 to signed 16-bit integers with
 …
 VDST = roundNorm(ASHALF(SRC0)) | ((UINT32)roundNorm(ASHALF(SRC1)) &lt;&lt; 16)</code></p>
 <h4>V_CVT_PKNORM_U16_F32</h4>
 <p>Opcode: 661 (0x295) for GCN 1.2<br />
+<p>Opcode: 661 (0x295) for GCN 1.2/1.4<br />
 Syntax: V_CVT_PKNORM_U16_F32 VDST, SRC0, SRC1<br />
 Description: Convert normalized FP value from SRC0 and SRC1 to unsigned 16-bit integers with
 …
 VDST = roundNorm(ASFLOAT(SRC0)) | ((UINT32)roundNorm(ASFLOAT(SRC1)) &lt;&lt; 16)</code></p>
 <h4>V_CVT_PKRTZ_F16_F32</h4>
 <p>Opcode: 662 (0x296) for GCN 1.2<br />
+<p>Opcode: 662 (0x296) for GCN 1.2/1.4<br />
 Syntax: V_CVT_PKRTZ_F16_F32 VDST, SRC0, SRC1<br />
 Description: Convert normalized FP value from SRC0 and SRC1 to half floating points with
 …
 VDST = D0 | (((UINT32)D1) &lt;&lt; 16)</code></p>
 <h4>V_DIV_FIXUP_F16</h4>
 <p>Opcode: 495 (0x1ef) for GCN 1.2<br />
+<p>Opcode: 495 (0x1ef) for GCN 1.2; 519 (0x207) for GCN 1.4<br />
 Syntax: V_DIV_FIXUP_F16 VDST, SRC0, SRC1, SRC2<br />
 Description: Handle all exceptions requires for half floating point division.
 …
     VDST = SF0</code></p>
 <h4>V_DIV_FIXUP_F32</h4>
 <p>Opcode: 351 (0x15f) for GCN 1.0/1.1; 478 (0x1de) for GCN 1.2<br />
+<p>Opcode: 351 (0x15f) for GCN 1.0/1.1; 478 (0x1de) for GCN 1.2/1.4<br />
 Syntax: V_DIV_FIXUP_F32 VDST, SRC0, SRC1, SRC2<br />
 Description: Handle all exceptions requires for single floating point division.
 …
     VDST = SF0</code></p>
 <h4>V_DIV_FIXUP_F64</h4>
 <p>Opcode: 352 (0x160) for GCN 1.0/1.1; 479 (0x1df) for GCN 1.2<br />
+<p>Opcode: 352 (0x160) for GCN 1.0/1.1; 479 (0x1df) for GCN 1.2/1.4<br />
 Syntax: V_DIV_FIXUP_F64 VDST(2), SRC0(2), SRC1(2), SRC2(2)<br />
 Description: Handle all exceptions requires for double floating point division.
 …
 else
     VDST = SF0</code></p>
+<h4>V_DIV_FIXUP_LEGACY_F16</h4>
+<p>Opcode: 495 (0x1ef) for GCN 1.4<br />
+Syntax: V_DIV_FIXUP_LEGACY_F16 VDST, SRC0, SRC1, SRC2<br />
+Description: Handle all exceptions requires for half floating point division.
+SRC0 is quotient, SRC1 is denominator, SRC2 is nominator. Correct result stored to VDST.<br />
+Operation:<br />
+<code>HALF SF0 = ASHALF(SRC0)
+HALF SF1 = ASHALF(SRC1)
+HALF SF2 = ASHALF(SRC2)
+if (ISNAN(SF1) &amp;&amp; !ISNAN(SF2))
+    VDST = QUIETNAN(SF1)
+else if (ISNAN(SF2))
+    VDST = QUIETNAN(SF2)
+else if (SF1 == 0.0 &amp;&amp; SF2 == 0.0)
+    VDST = NAN_H
+else if (ABS(SF1)==INF &amp;&amp; ABS(SF2)==INF)
+    VDST = -NAN_H
+else if (SF1 == 0.0)
+    VDST = INF_H*SIGN(SF1)*SIGN(SF2)
+else if (ABS(SF1) == INF)
+    VDST = SIGN(SF1)*SIGN(SF2) &gt;=0 ? 0.0 : -0.0
+else if (ISNAN(SF0))
+    VDST = SIGN(SF1)*SIGN(SF2)*INF_H
+else
+    VDST = SF0</code></p>
 <h4>V_DIV_FMAS_F32</h4>
 <p>Opcode: 367 (0x16f) for GCN 1.0/1.1; 482 (0x1e2) for GCN 1.2<br />
+<p>Opcode: 367 (0x16f) for GCN 1.0/1.1; 482 (0x1e2) for GCN 1.2/1.4<br />
 Syntax: V_DIV_FMAS_F32 VDST, SRC0, SRC1, SRC2<br />
 Description: Special case divide FMA with scale and flags.
 …
     VDST = ASFLOAT(VDST)*POW(-2.0,64)</code></p>
 <h4>V_DIV_FMAS_F64</h4>
 <p>Opcode: 368 (0x170) for GCN 1.0/1.1; 483 (0x1e3) for GCN 1.2<br />
+<p>Opcode: 368 (0x170) for GCN 1.0/1.1; 483 (0x1e3) for GCN 1.2/1.4<br />
 Syntax: V_DIV_FMAS_F64 VDST(2), SRC0(2), SRC1(2), SRC2(2)<br />
 Description: Special case divide FMA with scale and flags.
 …
     VDST = ASDOUBLE(VDST)*POW(-2.0,128)</code></p>
 <h4>V_DIV_SCALE_F32</h4>
 <p>Opcode (VOP3B): 365 (0x16d) for GCN 1.0/1.1; 480 (0x1e0) for GCN 1.2<br />
+<p>Opcode (VOP3B): 365 (0x16d) for GCN 1.0/1.1; 480 (0x1e0) for GCN 1.2/1.4<br />
 Syntax: V_DIV_SCALE_F32 VDST, SDST(2), SRC0, SRC1, SRC2<br />
 Description: Special case divide preop and flags. SRC0 is quotient, SRC1 is denominator,
 …
 }</code></p>
 <h4>V_DIV_SCALE_F64</h4>
 <p>Opcode (VOP3B): 366 (0x16e) for GCN 1.0/1.1; 481 (0x1e1) for GCN 1.2<br />
+<p>Opcode (VOP3B): 366 (0x16e) for GCN 1.0/1.1; 481 (0x1e1) for GCN 1.2/1.4<br />
 Syntax: V_DIV_SCALE_F64 VDST(2), SDST(2), SRC0(2), SRC1(2), SRC2(2)<br />
 Description: Special case divide preop and flags. SRC0 is quotient, SRC1 is denominator,
 …
 }</code></p>
 <h4>V_FMA_F16</h4>
 <p>Opcode: 494 (0x1ee) for GCN 1.2<br />
+<p>Opcode: 494 (0x1ee) for GCN 1.2; 518 (0x206) for GCN 1.4<br />
 Syntax: V_FMA_F16 VDST, SRC0, SRC1, SRC2<br />
 Description: Fused multiply addition on half floating point values from
 …
 VDST = FMA(ASHALF(SRC0), ASHALF(SRC1), ASHALF(SRC2))</code></p>
 <h4>V_FMA_F32</h4>
 <p>Opcode: 331 (0x14b) for GCN 1.0/1.1; 459 (0x1cb) for GCN 1.2<br />
+<p>Opcode: 331 (0x14b) for GCN 1.0/1.1; 459 (0x1cb) for GCN 1.2/1.4<br />
 Syntax: V_FMA_F32 VDST, SRC0, SRC1, SRC2<br />
 Description: Fused multiply addition on single floating point values from
 …
 VDST = FMA(ASFLOAT(SRC0), ASFLOAT(SRC1), ASFLOAT(SRC2))</code></p>
 <h4>V_FMA_F64</h4>
 <p>Opcode: 332 (0x14c) for GCN 1.0/1.1; 460 (0x1cc) for GCN 1.2<br />
+<p>Opcode: 332 (0x14c) for GCN 1.0/1.1; 460 (0x1cc) for GCN 1.2/1.4<br />
 Syntax: V_FMA_F64 VDST(2), SRC0(2), SRC1(2), SRC2(2)<br />
 Description: Fused multiply addition on double floating point values from
 …
 <code>// SRC0*SRC1+SRC2
 VDST = FMA(ASDOUBLE(SRC0), ASDOUBLE(SRC1), ASDOUBLE(SRC2))</code></p>
+<h4>V_FMA_LEGACY_F16</h4>
+<p>Opcode: 494 (0x1ee) for GCN 1.4<br />
+Syntax: V_FMA_LEGACY_F16 VDST, SRC0, SRC1, SRC2<br />
+Description: Fused multiply addition on half floating point values from
+SRC0, SRC1 and SRC2. Result stored in VDST.<br />
+Operation:<br />
+<code>// SRC0*SRC1+SRC2
+VDST = FMA(ASHALF(SRC0), ASHALF(SRC1), ASHALF(SRC2))</code></p>
 <h4>V_LDEXP_F32</h4>
 <p>Opcode: 648 (0x288) for GCN 1.2<br />
+<p>Opcode: 648 (0x288) for GCN 1.2/1.4<br />
 Syntax: V_LDEXP_F32 VDST, SRC0, SRC1<br />
 Description: Do ldexp operation on SRC0 and SRC1 (multiply SRC0 by 2**(SRC1)).
 …
 <code>VDST = ASFLOAT(SRC0) * POW(2.0, (INT32)SRC1)</code></p>
 <h4>V_LDEXP_F64</h4>
 <p>Opcode: 360 (0x168) for GCN 1.0/1.1; 644 (0x284) for GCN 1.2<br />
+<p>Opcode: 360 (0x168) for GCN 1.0/1.1; 644 (0x284) for GCN 1.2/1.4<br />
 Syntax: V_LDEXP_F64 VDST(2), SRC0(2), SRC1<br />
 Description: Do ldexp operation on SRC0 and SRC1 (multiply SRC0 by 2**(SRC1)).
 …
 <code>VDST = ASDOUBLE(SRC0) * POW(2.0, (INT32)SRC1)</code></p>
 <h4>V_LERP_U8</h4>
 <p>Opcode: 333 (0x14d) for GCN 1.0/1.1; 461 (0x1cd) for GCN 1.2<br />
+<p>Opcode: 333 (0x14d) for GCN 1.0/1.1; 461 (0x1cd) for GCN 1.2/1.4<br />
 Syntax: V_LERP_U8 VDST, SRC0, SRC1, SRC2<br />
 Description: For each byte of dword, calculate average from SRC0 byte and SRC1 byte with
 …
 Operation:<br />
 <code>VDST = SRC0 &lt;&lt; (SRC1&amp;63)</code></p>
+<h4>V_LSHL_OR_B32</h4>
+<p>Opcode: 512 (0x200) for GCN 1.4<br />
+Syntax: V_LSHL_OR_B32 VDST, SRC0, SRC1, SRC2<br />
+Description: Shift left SRC0 by (SRC1&amp;31) bits and make bitwise OR with SRC2
+and store result to VDST.<br />
+Operation:<br />
+<code>VDST = (SRC0 &lt;&lt; (SRC1&amp;31)) | SRC2</code></p>
 <h4>V_LSHLREV_B64</h4>
 <p>Opcode: 655 (0x28f) for GCN 1.2<br />
+<p>Opcode: 655 (0x28f) for GCN 1.2/1.4<br />
 Syntax: V_LSHLREV_B64 VDST(2), SRC0, SRC1(2)<br />
 Description: Shift left SRC1 by (SRC0&amp;63) bits and store result into VDST.<br />
 …
 <code>VDST = SRC0 &gt;&gt; (SRC1&amp;63)</code></p>
 <h4>V_LSHRREV_B64</h4>
 <p>Opcode: 656 (0x290) for GCN 1.2<br />
+<p>Opcode: 656 (0x290) for GCN 1.2/1.4<br />
 Syntax: V_LSHRREV_B64 VDST(2), SRC0, SRC1(2)<br />
 Description: Shift right SRC1 by (SRC0&amp;63) bits and store result into VDST.<br />
 …
 <code>VDST = SRC1 &gt;&gt; (SRC0&amp;63)</code></p>
 <h4>V_MAC_LEGACY_F32</h4>
 <p>Opcode: 654 (0x28e) for GCN 1.2<br />
+<p>Opcode: 654 (0x28e) for GCN 1.2/1.4<br />
 Syntax: V_MAC_LEGACY_F32 VDST, SRC0, SRC1<br />
 Description: Multiply FP value from SRC0 by FP value from SRC1 and add result to VDST.
 …
     VDST = ASFLOAT(SRC0) * ASFLOAT(SRC1) + ASFLOAT(VDST)</code></p>
 <h4>V_MAD_F16</h4>
 <p>Opcode: 490 (0x1ea) for GCN 1.2<br />
+<p>Opcode: 490 (0x1ea) for GCN 1.2; 515 (0x203) for GCN 1.4<br />
 Syntax: V_MAD_F16 VDST, SRC0, SRC1, SRC2<br />
 Description: Multiply half FP value from SRC0 by half FP value from
 …
 <code>VDST = ASHALF(SRC0) * ASHALF(SRC1) + ASHALF(SRC2)</code></p>
 <h4>V_MAD_F32</h4>
 <p>Opcode: 321 (0x141) for GCN 1.0/1.1; 449 (0x1c1) for GCN 1.2<br />
+<p>Opcode: 321 (0x141) for GCN 1.0/1.1; 449 (0x1c1) for GCN 1.2/1.4<br />
 Syntax: V_MAD_F32 VDST, SRC0, SRC1, SRC2<br />
 Description: Multiply FP value from SRC0 by FP value from SRC1 and add SRC2, and store
 …
 <code>VDST = ASFLOAT(SRC0) * ASFLOAT(SRC1) + ASFLOAT(SRC2)</code></p>
 <h4>V_MAD_I16</h4>
 <p>Opcode: 492 (0x1ec) for GCN 1.2<br />
+<p>Opcode: 492 (0x1ec) for GCN 1.2; 517 (0x205) for GCN 1.4<br />
 Syntax: V_MAD_I16 VDST, SRC0, SRC1, SRC2<br />
 Description: Multiply 16-bit signed value from SRC0 by 16-bit signed value from
+SRC1 and add 16-bit signed value from SRC2, and store 16-bit signed result to VDST.<br />
+Operation:<br />
+<code>VDST = (INT16)((INT16)SRC0*(INT16)SRC1 + (INT16)SRC2)</code></p>
+SRC1 and add 16-bit signed value from SRC2, and store 16-bit signed result to VDST.
+If CLAMP modifier supplied, then result is saturated to 16-bit signed value.<br />
+Operation:<br />
+<code>UINT32 temp = (SEXT32((INT16)SRC0)*(INT16)SRC1 + (INT16)SRC2)
+VDST = CLAMP ? MIN(MAX(temp), -32768), 32767) : temp&amp;0xffff</code></p>
 <h4>V_MAD_I32_I16</h4>
 <p>Opcode: 498 (0x1f2) for GCN 1.4<br />
 …
 <code>VDST = (UINT32)(SEXT32((INT16)SRC0)*(INT16)SRC1) + SRC2</code></p>
 <h4>V_MAD_I32_I24</h4>
 <p>Opcode: 322 (0x142) for GCN 1.0/1.1; 450 (0x1c2) for GCN 1.2<br />
+<p>Opcode: 322 (0x142) for GCN 1.0/1.1; 450 (0x1c2) for GCN 1.2/1.4<br />
 Syntax: V_MAD_I32_I24 VDST, SRC0, SRC1, SRC2<br />
 Description: Multiply 24-bit signed integer value from SRC0 by 24-bit signed value from
 …
 VDST = V0 * V1 + SRC2</code></p>
 <h4>V_MAD_I64_I32</h4>
 <p>Opcode (VOP3B): 375 (0x177) for GCN 1.1; 489 (0x1e9) for GCN 1.2<br />
+<p>Opcode (VOP3B): 375 (0x177) for GCN 1.1; 489 (0x1e9) for GCN 1.2/1.4<br />
 Syntax: V_MAD_I64_I32 VDST(2), SDST(2), SRC0, SRC1, SRC2(2)<br />
 Description: Multiply 32-bit signed integer value from SRC0 by 32-bit signed value
 …
 UINT64 mask = (1ULL&lt;&lt;LANEID)
 //SDST = (SDST&amp;~mask) | ((?????) ? mask : 0)</code></p>
+<h4>V_MAD_LEGACY_F16</h4>
+<p>Opcode: 490 (0x1ea) for GCN 1.4<br />
+Syntax: V_MAD_LEGACY_F16 VDST, SRC0, SRC1, SRC2<br />
+Description: Multiply half FP value from SRC0 by half FP value from
+SRC1 and add SRC2, and store result to VDST.
+It applies OMOD modifier to result and it flush denormals.<br />
+Operation:<br />
+<code>VDST = ASHALF(SRC0) * ASHALF(SRC1) + ASHALF(SRC2)</code></p>
 <h4>V_MAD_LEGACY_F32</h4>
 <p>Opcode: 320 (0x140) for GCN 1.0/1.1; 448 (0x1c0) for GCN 1.2<br />
+<p>Opcode: 320 (0x140) for GCN 1.0/1.1; 448 (0x1c0) for GCN 1.2/1.4<br />
 Syntax: V_MAD_LEGACY_F32 VDST, SRC0, SRC1, SRC2<br />
 Description: Multiply FP value from SRC0 by FP value from SRC1 and add result to SRC2, and
 …
 <code>if (ASFLOAT(SRC0)!=0.0 &amp;&amp; ASFLOAT(SRC1)!=0.0)
     VDST = ASFLOAT(SRC0) * ASFLOAT(SRC1) + ASFLOAT(SRC2)</code></p>
+<h4>V_MAD_LEGACY_I16</h4>
+<p>Opcode: 492 (0x1ec) for GCN 1.4<br />
+Syntax: V_MAD_LEGACY_I16 VDST, SRC0, SRC1, SRC2<br />
+Description: Multiply 16-bit signed value from SRC0 by 16-bit signed value from
+SRC1 and add 16-bit signed value from SRC2, and store 16-bit signed result to VDST.
+If CLAMP modifier supplied, then result is saturated to 16-bit signed value.<br />
+Operation:<br />
+<code>UINT32 temp = (SEXT32((INT16)SRC0)*(INT16)SRC1 + (INT16)SRC2)
+VDST = CLAMP ? MIN(MAX(temp), -32768), 32767) : temp&amp;0xffff</code></p>
+<h4>V_MAD_LEGACY_U16</h4>
+<p>Opcode: 491 (0x1eb) for GCN 1.4<br />
+Syntax: V_MAD_LEGACY_U16 VDST, SRC0, SRC1, SRC2<br />
+Description: Multiply 16-bit unsigned value from SRC0 by 16-bit unsigned value from
+SRC1 and add 16-bit unsigned value from SRC2, and store 16-bit unsigned result to VDST.
+If CLAMP modifier supplied, then result is saturated to 16-bit unsigned value.<br />
+Operation:<br />
+<code>UINT32 temp = ((UINT16)SRC0*(UINT16)SRC1 + (UINT16)SRC2) &amp; 0xffff
+VDST = CLAMP ? MIN(temp, 0xffff) : (temp&amp;0xffff)</code></p>
 <h4>V_MAD_U16</h4>
 <p>Opcode: 491 (0x1eb) for GCN 1.2<br />
+<p>Opcode: 491 (0x1eb) for GCN 1.2; 516 (0x204) for GCN 1.4<br />
 Syntax: V_MAD_U16 VDST, SRC0, SRC1, SRC2<br />
 Description: Multiply 16-bit unsigned value from SRC0 by 16-bit unsigned value from
+SRC1 and add 16-bit unsigned value from SRC2, and store 16-bit unsigned result to VDST.<br />
+Operation:<br />
+<code>VDST = ((UINT16)SRC0*(UINT16)SRC1 + (UINT16)SRC2) &amp; 0xffff</code></p>
+SRC1 and add 16-bit unsigned value from SRC2, and store 16-bit unsigned result to VDST.
+If CLAMP modifier supplied, then result is saturated to 16-bit unsigned value.<br />
+Operation:<br />
+<code>UINT32 temp = ((UINT16)SRC0*(UINT16)SRC1 + (UINT16)SRC2) &amp; 0xffff
+VDST = CLAMP ? MIN(temp, 0xffff) : (temp&amp;0xffff)</code></p>
 <h4>V_MAD_U32_U16</h4>
 <p>Opcode: 497 (0x1f1) for GCN 1.4<br />
 …
 <code>VDST = (UINT32)((SRC0&amp;0xffff)*(SRC1&amp;0xffff)) + SRC2</code></p>
 <h4>V_MAD_U32_U24</h4>
 <p>Opcode: 323 (0x143) for GCN 1.0/1.1; 451 (0x1c3) for GCN 1.2<br />
+<p>Opcode: 323 (0x143) for GCN 1.0/1.1; 451 (0x1c3) for GCN 1.2/1.4<br />
 Syntax: V_MAD_U32_U24 VDST, SRC0, SRC1, SRC2<br />
 Description: Multiply 24-bit unsigned integer value from SRC0 by 24-bit unsigned value
 …
 <code>VDST = (UINT32)(SRC0&amp;0xffffff) * (UINT32)(SRC1&amp;0xffffff) + SRC2</code></p>
 <h4>V_MAD_U64_U32</h4>
 <p>Opcode (VOP3B): 374 (0x176) for GCN 1.1; 488 (0x1e8) for GCN 1.2<br />
+<p>Opcode (VOP3B): 374 (0x176) for GCN 1.1; 488 (0x1e8) for GCN 1.2/1.4<br />
 Syntax: V_MAD_U64_U32 VDST(2), SDST(2), SRC0, SRC1, SRC2(2)<br />
 Description: Multiply 32-bit unsigned integer value from SRC0 by 32-bit unsigned value
 …
 SDST = (SDST&amp;~mask) | ((VDST &lt; PROD) ? mask : 0)</code></p>
 <h4>V_MAX_F64</h4>
 <p>Opcode: 359 (0x167) for GCN 1.0/1.1; 643 (0x283) for GCN 1.2<br />
+<p>Opcode: 359 (0x167) for GCN 1.0/1.1; 643 (0x283) for GCN 1.2/1.4<br />
 Syntax: V_MAX_F64 VDST(2), SRC0(2), SRC1(2)<br />
 Description: Choose largest double FP value from SRC0 and SRC1, and store result to VDST.<br />
 …
     VDST = MAX(SF1, SF0)</code></p>
 <h4>V_MAX3_F32</h4>
 <p>Opcode: 340 (0x154) for GCN 1.0/1.1; 467 (0x1d3) for GCN 1.2<br />
+<p>Opcode: 340 (0x154) for GCN 1.0/1.1; 467 (0x1d3) for GCN 1.2/1.4<br />
 Syntax: V_MAX3_F32 VDST, SRC0, SRC1, SRC2<br />
 Description: Choose largest value from FP values SRC0, SRC1, SRC2, and store it to VDST.<br />
 …
     VDST = (UINT16)MAX((INT16)SRC1, (INT16)SRC0)</code></p>
 <h4>V_MAX3_I32</h4>
 <p>Opcode: 341 (0x155) for GCN 1.0/1.1; 468 (0x1d4) for GCN 1.2<br />
+<p>Opcode: 341 (0x155) for GCN 1.0/1.1; 468 (0x1d4) for GCN 1.2/1.4<br />
 Syntax: V_MAX3_I32 VDST, SRC0, SRC1, SRC2<br />
 Description: Choose largest value from signed integer values SRC0, SRC1, SRC2,
 …
     VDST = MAX((UINT16)SRC1, (UINT16)SRC0)</code></p>
 <h4>V_MAX3_U32</h4>
 <p>Opcode: 342 (0x156) for GCN 1.0/1.1; 469 (0x1d5) for GCN 1.2<br />
+<p>Opcode: 342 (0x156) for GCN 1.0/1.1; 469 (0x1d5) for GCN 1.2/1.4<br />
 Syntax: V_MAX3_U32 VDST, SRC0, SRC1, SRC2<br />
 Description: Choose largest value from unsigned integer values SRC0, SRC1, SRC2,
 …
     VDST = MAX(SRC1, SRC0)</code></p>
 <h4>V_MBCNT_HI_U32_B32</h4>
 <p>Opcode: 653 (0x28d) for GCN 1.2<br />
+<p>Opcode: 653 (0x28d) for GCN 1.2/1.4<br />
 Syntax: V_MBCNT_HI_U32_B32 VDST, SRC0, SRC1<br />
 Description: Make mask for all lanes ending at current lane,
 …
 VDST = SRC1 + BITCOUNT(MASK)</code></p>
 <h4>V_MBCNT_LO_U32_B32</h4>
 <p>Opcode: 652 (0x28c) for GCN 1.2<br />
+<p>Opcode: 652 (0x28c) for GCN 1.2/1.4<br />
 Syntax: V_MBCNT_LO_U32_B32 VDST, SRC0, SRC1<br />
 Description: Make mask for all lanes ending at current lane,
 …
     VDST = SF0</code></p>
 <h4>V_MED3_F32</h4>
 <p>Opcode: 343 (0x157) for GCN 1.0/1.1; 470 (0x1d6) for GCN 1.2<br />
+<p>Opcode: 343 (0x157) for GCN 1.0/1.1; 470 (0x1d6) for GCN 1.2/1.4<br />
 Syntax: V_MED3_F32 VDST, SRC0, SRC1, SRC2<br />
 Description: Choose medium value from FP values SRC0, SRC1, SRC2, and store it to VDST.<br />
 …
     VDST = (UINT16)S0</code></p>
 <h4>V_MED3_I32</h4>
 <p>Opcode: 344 (0x158) for GCN 1.0/1.1; 471 (0x1d7) for GCN 1.2<br />
+<p>Opcode: 344 (0x158) for GCN 1.0/1.1; 471 (0x1d7) for GCN 1.2/1.4<br />
 Syntax: V_MED3_I32 VDST, SRC0, SRC1, SRC2<br />
 Description: Choose medium value from signed integer values SRC0, SRC1, SRC2,
 …
     VDST = S0</code></p>
 <h4>V_MED3_U32</h4>
 <p>Opcode: 345 (0x159) for GCN 1.0/1.1; 472 (0x1d8) for GCN 1.2<br />
+<p>Opcode: 345 (0x159) for GCN 1.0/1.1; 472 (0x1d8) for GCN 1.2/1.4<br />
 Syntax: V_MED3_U32 VDST, SRC0, SRC1, SRC2<br />
 Description: Choose medium value from unsigned integer values SRC0, SRC1, SRC2,
 …
     VDST = SRC0</code></p>
 <h4>V_MIN_F64</h4>
 <p>Opcode: 358 (0x166) for GCN 1.0/1.1; 642 (0x282) for GCN 1.2<br />
+<p>Opcode: 358 (0x166) for GCN 1.0/1.1; 642 (0x282) for GCN 1.2/1.4<br />
 Syntax: V_MIN_F64 VDST(2), SRC0(2), SRC1(2)<br />
 Description: Choose smallest double FP value from SRC0 and SRC1, and store result to VDST.<br />
 …
     VDST = MIN(SF1, SF0)</code></p>
 <h4>V_MIN3_F32</h4>
 <p>Opcode: 337 (0x151) for GCN 1.0/1.1; 464 (0x1d0) for GCN 1.2<br />
+<p>Opcode: 337 (0x151) for GCN 1.0/1.1; 464 (0x1d0) for GCN 1.2/1.4<br />
 Syntax: V_MIN3_F32 VDST, SRC0, SRC1, SRC2<br />
 Description: Choose smallest value from FP values SRC0, SRC1, SRC2, and store it to VDST.<br />
 …
     VDST = (UINT16)MIN((INT16)SRC1, (INT16)SRC0)</code></p>
 <h4>V_MIN3_I32</h4>
 <p>Opcode: 338 (0x152) for GCN 1.0/1.1; 465 (0x1d1) for GCN 1.2<br />
+<p>Opcode: 338 (0x152) for GCN 1.0/1.1; 465 (0x1d1) for GCN 1.2/1.4<br />
 Syntax: V_MIN3_I32 VDST, SRC0, SRC1, SRC2<br />
 Description: Choose smallest value from signed integer values SRC0, SRC1, SRC2,
 …
     VDST = MIN(S(UINT16)RC1, (UINT16)SRC0)</code></p>
 <h4>V_MIN3_U32</h4>
 <p>Opcode: 339 (0x153) for GCN 1.0/1.1; 466 (0x1d2) for GCN 1.2<br />
+<p>Opcode: 339 (0x153) for GCN 1.0/1.1; 466 (0x1d2) for GCN 1.2/1.4<br />
 Syntax: V_MIN3_U32 VDST, SRC0, SRC1, SRC2<br />
 Description: Choose smallest value from unsigned integer values SRC0, SRC1, SRC2,
 …
     VDST = MIN(SRC1, SRC0)</code></p>
 <h4>V_MQSAD_U32_U8</h4>
 <p>Opcode: 373 (0x175) for GCN 1.1; 487 (0x1e7) for GCN 1.2<br />
+<p>Opcode: 373 (0x175) for GCN 1.1; 487 (0x1e7) for GCN 1.2/1.4<br />
 Syntax: V_MQSAD_U32_U8 VDST(4), SRC0(2), SRC1, SRC2(4)<br />
 Description: Compute four masked sum of absolute differences with accumulation.
 …
 VDST |= (MSADU8((UINT32)(SRC0&gt;&gt;24), SRC1, SRC2&gt;&gt;96)&lt;&lt;96</code></p>
 <h4>V_MQSAD_U8, V_MQSAD_PK_U16_U8</h4>
 <p>Opcode: 371 (0x173) for GCN 1.0/1.1; 486 (0x1e6) for GCN 1.2<br />
+<p>Opcode: 371 (0x173) for GCN 1.0/1.1; 486 (0x1e6) for GCN 1.2/1.4<br />
 Syntax (GCN 1.0): V_MQSAD_U8 VDST(2), SRC0(2), SRC1, SRC2(2)<br />
 Syntax (GCN 1.1/1.2): V_MQSAD_PK_U16_U8 VDST(2), SRC0(2), SRC1, SRC2(2)<br />
 …
 VDST |= (MSADU8((UINT32)(SRC0&gt;&gt;24), SRC1, (SRC2&gt;&gt;48) &amp; 0xffff)&lt;&lt;48</code></p>
 <h4>V_MSAD_U8</h4>
 <p>Opcode: 369 (0x171) for GCN 1.0/1.1; 484 (0x1e4) for GCN 1.2<br />
+<p>Opcode: 369 (0x171) for GCN 1.0/1.1; 484 (0x1e4) for GCN 1.2/1.4<br />
 Syntax: V_MSAD_U8 VDST, SRC0, SRC1, SRC2<br />
 Description: Calculate sum of absolute differences in SRC0 and SRC1 for bytes that have
 …
         VDST += ABS(((SRC0 &gt;&gt; (i*8)) &amp; 0xff) - ((SRC1 &gt;&gt; (i*8)) &amp; 0xff))</code></p>
 <h4>V_MUL_F64</h4>
 <p>Opcode: 357 (0x165) for GCN 1.0/1.1; 641 (0x281) for GCN 1.2<br />
+<p>Opcode: 357 (0x165) for GCN 1.0/1.1; 641 (0x281) for GCN 1.2/1.4<br />
 Syntax: V_MUL_F64 VDST(2), SRC0(2), SRC1(2)<br />
 Description: Multiply two double FP values from SRC0 and SRC1 and store result to VDST.<br />
 …
 <code>VDST = ASDOUBLE(SRC0) * ASDOUBLE(SRC1)</code></p>
 <h4>V_MUL_HI_I32</h4>
 <p>Opcode: 364 (0x16c) for GCN 1.0/1.1; 647 (0x287) for GCN 1.2<br />
+<p>Opcode: 364 (0x16c) for GCN 1.0/1.1; 647 (0x287) for GCN 1.2/1.4<br />
 Syntax: V_MUL_HI_I32 VDST, SRC0, SRC1<br />
 Description: Multiply 32-bit signed value SRC0 and SRC1, and store higher part of
 …
 <code>VDST = ((INT64)SRC0 * (INT32)SRC1) &gt;&gt; 32</code></p>
 <h4>V_MUL_HI_U32</h4>
 <p>Opcode: 362 (0x16a) for GCN 1.0/1.1; 646 (0x286) for GCN 1.2<br />
+<p>Opcode: 362 (0x16a) for GCN 1.0/1.1; 646 (0x286) for GCN 1.2/1.4<br />
 Syntax: V_MUL_HI_U32 VDST, SRC0, SRC1<br />
 Description: Multiply 32-bit unsigned value SRC0 and SRC1, and store higher part of
 …
 <code>VDST = (INT32)SRC0 * (INT32)SRC1</code></p>
 <h4>V_MUL_LO_U32</h4>
 <p>Opcode: 361 (0x169) for GCN 1.0/1.1; 645 (0x285) for GCN 1.2<br />
+<p>Opcode: 361 (0x169) for GCN 1.0/1.1; 645 (0x285) for GCN 1.2/1.4<br />
 Syntax: V_MUL_LO_U32 VDST, SRC0, SRC1<br />
 Description: Multiply 32-bit unsigned value SRC0 and SRC1, and store lower part of
 …
         VDST = ASFLOAT(SRC0) * ASFLOAT(SRC1)
 }</code></p>
+<h4>V_OR3_B32</h4>
+<p>Opcode: 514 (0x202) for GCN 1.4<br />
+Syntax: V_OR3_B32 VDST, SRC0, SRC1, SRC2<br />
+Description: Make bitwise OR with SRC0, SRC1 and SRC2 and store result to VDST.<br />
+Operation:<br />
+<code>VDST = SRC0 | SRC1 | SRC2</code></p>
 <h4>V_PACK_B32_F16</h4>
 <p>Opcode: 672 (0x2a0) for GCN 1.4<br />
 …
 <code>VDST = (SRC0&amp;0xffff) | (SRC1&lt;&lt;16)</code></p>
 <h4>V_PERM_B32</h4>
 <p>Opcode: 493 (0x1ed) for GCN 1.2<br />
+<p>Opcode: 493 (0x1ed) for GCN 1.2/1.4<br />
 Syntax: V_PERM_B32 VDST, SRC0, SRC1, SRC2<br />
 Description: Permute bytes. Choose for every byte in dword, specified value. Bytes in
 …
 }</code></p>
 <h4>V_QSAD_U8, V_QSAD_PK_U16_U8</h4>
 <p>Opcode: 370 (0x172) for GCN 1.0/1.1; 485 (0x1e5) for GCN 1.2<br />
+<p>Opcode: 370 (0x172) for GCN 1.0/1.1; 485 (0x1e5) for GCN 1.2/1.4<br />
 Syntax (GCN 1.0): V_QSAD_U8 VDST(2), SRC0(2), SRC1, SRC2(2)<br />
 Syntax (GCN 1.1/1.2): V_QSAD_PK_U16_U8 VDST(2), SRC0(2), SRC1, SRC2(2)<br />
 …
 VDST |= (SADU8((UINT32)(SRC0&gt;&gt;24), SRC1, (SRC2&gt;&gt;48) &amp; 0xffff)&lt;&lt;48</code></p>
 <h4>V_READLANE_B32</h4>
 <p>Opcode: 649 (0x289) for GCN 1.2<br />
+<p>Opcode: 649 (0x289) for GCN 1.2/1.4<br />
 Syntax: V_READLANE_B32 SDST, VSRC0, SSRC1<br />
 Description: Copy one VSRC0 lane value to one SDST. Lane (thread id) choosen from SSRC1&amp;63.
 …
 <code>SDST = VSRC0[SSRC1 &amp; 63]</code></p>
 <h4>V_SAD_HI_U8</h4>
 <p>Opcode: 347 (0x15b) for GCN 1.0/1.1; 474 (0x1da) for GCN 1.2<br />
+<p>Opcode: 347 (0x15b) for GCN 1.0/1.1; 474 (0x1da) for GCN 1.2/1.4<br />
 Syntax: V_SAD_HI_U8 VDST, SRC0, SRC1, SRC2<br />
 Description: Calculate sum of absolute differences for all four bytes in SRC0 and SRC1,
 …
     VDST += (ABS(((SRC0 &gt;&gt; (i*8)) &amp; 0xff) - ((SRC1 &gt;&gt; (i*8)) &amp; 0xff)))&lt;&lt;16</code></p>
 <h4>V_SAD_U16</h4>
 <p>Opcode: 348 (0x15c) for GCN 1.0/1.1; 475 (0x1db) for GCN 1.2<br />
+<p>Opcode: 348 (0x15c) for GCN 1.0/1.1; 475 (0x1db) for GCN 1.2/1.4<br />
 Syntax: V_SAD_U16 VDST, SRC0, SRC1, SRC2<br />
 Description: Calculate sum of absolute differences for two 16-bit words in SRC0 and SRC1,
 …
 VDST += ABS((SRC0 &gt;&gt; 16) - (SRC1 &gt;&gt; 16))</code></p>
 <h4>V_SAD_U32</h4>
 <p>Opcode: 349 (0x15d) for GCN 1.0/1.1; 476 (0x1dc) for GCN 1.2<br />
+<p>Opcode: 349 (0x15d) for GCN 1.0/1.1; 476 (0x1dc) for GCN 1.2/1.4<br />
 Syntax: V_SAD_U32 VDST, SRC0, SRC1, SRC2<br />
 Description: Calculate sum of absolute difference for SRC0 and SRC1, add
 …
 <code>VDST = SRC2 + ABS(SRC0 - SRC1)</code></p>
 <h4>V_SAD_U8</h4>
 <p>Opcode: 346 (0x15a) for GCN 1.0/1.1; 473 (0x1d9) for GCN 1.2<br />
+<p>Opcode: 346 (0x15a) for GCN 1.0/1.1; 473 (0x1d9) for GCN 1.2/1.4<br />
 Syntax: V_SAD_U8 VDST, SRC0, SRC1, SRC2<br />
 Description: Calculate sum of absolute differences for all four bytes in SRC0 and SRC1, add
 …
 }</code></p>
 <h4>V_TRIG_PREOP_F64</h4>
 <p>Opcode: 372 (0x174) for GCN 1.0/1.1; 658 (0x292) for GCN 1.2<br />
+<p>Opcode: 372 (0x174) for GCN 1.0/1.1; 658 (0x292) for GCN 1.2/1.4<br />
 Syntax: V_TRIG_PREOP_F64 VDST(2), SRC0(2), SRC1<br />
 Description:  D.d = Look Up 2/PI (S0.d) with segment select S1.u[4:0].
 …
 VDST = (DOUBLE)(TWOPERPI[BIT:BIT+52]) * POW(2.0, -BIT-53)</code></p>
 <h4>V_WRITELANE_B32</h4>
 <p>Opcode: 650 (0x28a) for GCN 1.2<br />
+<p>Opcode: 650 (0x28a) for GCN 1.2/1.4<br />
 Syntax: V_WRITELANE_B32 VDST, VSRC0, SSRC1<br />
 Description: Copy SGPR to one lane of VDST. Lane choosen (thread id) from SSRC1&amp;63.