Context Navigation

Changes between Version 5 and Version 6 of GcnInstrsVop1

Timestamp:: 11/28/15 15:00:15 (8 years ago)
Author:: trac
Comment:: --

Legend:

: Unmodified
: Added
: Removed
: Modified

GcnInstrsVop1

-                      v5
+                      v6
 <p>Alphabetically sorted instruction list:</p>
 <h4>V_CVT_F16_F32</h4>
 <p>Opcode VOP2: 10 (0xa)<br />
+<p>Opcode VOP1: 10 (0xa)<br />
 Opcode VOP3A: 394 (0x18a) for GCN 1.0/1.1; 330 (0x14a) for GCN 1.2<br />
 Syntax: V_CVT_F16_F32 VDST, SRC0<br />
 …
 If absolute value is too high, then store -/+infinity to VDST.<br />
 Operation:<br />
 <code>VDST = RNDHALF(ASFLOAT(SRC0))</code></p>
+<code>VDST = CVTHALF(ASFLOAT(SRC0))</code></p>
 <h4>V_CVT_F32_F16</h4>
 <p>Opcode VOP2: 11 (0xb)<br />
+<p>Opcode VOP1: 11 (0xb)<br />
 Opcode VOP3A: 395 (0x18b) for GCN 1.0/1.1; 331 (0x14b) for GCN 1.2<br />
 Syntax: V_CVT_F32_F16 VDST, SRC0<br />
 …
 Operation:<br />
 <code>VDST = (FLOAT)(ASHALF(SRC0))</code></p>
+<h4>V_CVT_F32_F64</h4>
+<p>Opcode VOP1: 15 (0xf)<br />
+Opcode VOP3A: 399 (0x18f) for GCN 1.0/1.1; 335 (0x14f) for GCN 1.2<br />
+Syntax: V_CVT_F32_F64 VDST, SRC0(2)<br />
+Description: Convert double FP value to single floating point value with rounding from
+MODE register (single FP rounding mode), and store result to VDST.
+If absolute value is too high, then store -/+infinity to VDST.<br />
+Operation:<br />
+<code>VDST = CVTHALF(ASDOUBLE(SRC0))</code></p>
 <h4>V_CVT_F32_I32</h4>
 <p>Opcode VOP2: 5 (0x5)<br />
+<p>Opcode VOP1: 5 (0x5)<br />
 Opcode VOP3A: 389 (0x185) for GCN 1.0/1.1; 325 (0x145) for GCN 1.2<br />
 Syntax: V_CVT_F32_I32 VDST, SRC0<br />
 …
 <code>VDST = (FLOAT)(INT32)SRC0</code></p>
 <h4>V_CVT_F32_U32</h4>
 <p>Opcode VOP2: 6 (0x6)<br />
+<p>Opcode VOP1: 6 (0x6)<br />
 Opcode VOP3A: 390 (0x186) for GCN 1.0/1.1; 326 (0x146) for GCN 1.2<br />
 Syntax: V_CVT_F32_U32 VDST, SRC0<br />
 …
 Operation:<br />
 <code>VDST = (FLOAT)SRC0</code></p>
+<h4>V_CVT_F32_UBYTE0</h4>
+<p>Opcode VOP1: 17 (0x11)<br />
+Opcode VOP3A: 401 (0x191) for GCN 1.0/1.1; 337 (0x151) for GCN 1.2<br />
+Syntax: V_CVT_F32_UBYTE0 VDST, SRC0<br />
+Description: Convert the first unsigned 8-bit byte from SRC0 to single FP value,
+and store it to VDST.<br />
+Operation:<br />
+<code>VDST = (FLOAT)(SRC0 &amp; 0xff)</code></p>
+<h4>V_CVT_F32_UBYTE1</h4>
+<p>Opcode VOP1: 18 (0x12)<br />
+Opcode VOP3A: 402 (0x192) for GCN 1.0/1.1; 338 (0x152) for GCN 1.2<br />
+Syntax: V_CVT_F32_UBYTE1 VDST, SRC0<br />
+Description: Convert the second unsigned 8-bit byte from SRC0 to single FP value,
+and store it to VDST.<br />
+Operation:<br />
+<code>VDST = (FLOAT)((SRC0&gt;&gt;8) &amp; 0xff)</code></p>
+<h4>V_CVT_F32_UBYTE2</h4>
+<p>Opcode VOP1: 19 (0x13)<br />
+Opcode VOP3A: 403 (0x193) for GCN 1.0/1.1; 339 (0x153) for GCN 1.2<br />
+Syntax: V_CVT_F32_UBYTE2 VDST, SRC0<br />
+Description: Convert the third unsigned 8-bit byte from SRC0 to single FP value,
+and store it to VDST.<br />
+Operation:<br />
+<code>VDST = (FLOAT)((SRC0&gt;&gt;16) &amp; 0xff)</code></p>
+<h4>V_CVT_F32_UBYTE3</h4>
+<p>Opcode VOP1: 20 (0x14)<br />
+Opcode VOP3A: 404 (0x194) for GCN 1.0/1.1; 340 (0x154) for GCN 1.2<br />
+Syntax: V_CVT_F32_UBYTE3 VDST, SRC0<br />
+Description: Convert the fourth unsigned 8-bit byte from SRC0 to single FP value,
+and store it to VDST.<br />
+Operation:<br />
+<code>VDST = (FLOAT)(SRC0&gt;&gt;24)</code></p>
+<h4>V_CVT_F64_F32</h4>
+<p>Opcode VOP1: 16 (0x10)<br />
+Opcode VOP3A: 400 (0x190) for GCN 1.0/1.1; 336 (0x150) for GCN 1.2<br />
+Syntax: V_CVT_F64_F32 VDST(2), SRC0<br />
+Description: Convert single FP value to double FP value, and store result to VDST.<br />
+Operation:<br />
+<code>VDST = (DOUBLE)(ASFLOAT(SRC0))</code></p>
 <h4>V_CVT_F64_I32</h4>
 <p>Opcode VOP2: 4 (0x4)<br />
+<p>Opcode VOP1: 4 (0x4)<br />
 Opcode VOP3A: 388 (0x184) for GCN 1.0/1.1; 324 (0x144) for GCN 1.2<br />
 Syntax: V_CVT_F64_I32 VDST(2), SRC0<br />
 …
 <code>VDST = (DOUBLE)(INT32)SRC0</code></p>
 <h4>V_CVT_FLR_I32_F32</h4>
 <p>Opcode VOP2: 13 (0xd)<br />
+<p>Opcode VOP1: 13 (0xd)<br />
 Opcode VOP3A: 397 (0x18d) for GCN 1.0/1.1; 333 (0x14d) for GCN 1.2<br />
 Syntax: V_CVT_FLR_I32_F32 VDST, SRC0<br />
 …
     VDST = (INT32)SRC0&gt;=0 ? 2147483647 : -2147483648</code></p>
 <h4>V_CVT_I32_F32</h4>
 <p>Opcode VOP2: 8 (0x8)<br />
+<p>Opcode VOP1: 8 (0x8)<br />
 Opcode VOP3A: 392 (0x188) for GCN 1.0/1.1; 328 (0x148) for GCN 1.2<br />
 Syntax: V_CVT_I32_F32 VDST, SRC0<br />
 …
     VDST = (INT32)MAX(MIN(RNDTZINT(ASFLOAT(SRC0)), 2147483647.0), -2147483648.0)</code></p>
 <h4>V_CVT_I32_F64</h4>
 <p>Opcode VOP2: 3 (0x3)<br />
+<p>Opcode VOP1: 3 (0x3)<br />
 Opcode VOP3A: 387 (0x183) for GCN 1.0/1.1; 323 (0x143) for GCN 1.2<br />
 Syntax: V_CVT_I32_F64 VDST, SRC0(2)<br />
 …
 if (SRC0!=NAN)
     VDST = (INT32)MAX(MIN(RNDTZINT(ASDOUBLE(SRC0)), 2147483647.0), -2147483648.0)</code></p>
+<h4>V_CVT_OFF_F32_I4</h4>
+<p>Opcode VOP1: 14 (0xe)<br />
+Opcode VOP3A: 398 (0x18e) for GCN 1.0/1.1; 334 (0x14e) for GCN 1.2<br />
+Syntax: V_CVT_OFF_F32_I4 VDST, SRC0<br />
+Description: Convert 4-bit signed value from SRC0 to floating point value, normalize that
+value to range -0.5:0.4375 and store result to VDST.<br />
+Operation:<br />
+<code>VDST = (FLOAT)((SRC0 &amp; 0xf) ^ 8) / 16.0 - 0.5</code></p>
 <h4>V_CVT_RPI_I32_F32</h4>
 <p>Opcode VOP2: 12 (0xc)<br />
+<p>Opcode VOP1: 12 (0xc)<br />
 Opcode VOP3A: 396 (0x18c) for GCN 1.0/1.1; 332 (0x14c) for GCN 1.2<br />
 Syntax: V_CVT_RPI_I32_F32 VDST, SRC0<br />
 …
     VDST = (INT32)SRC0&gt;=0 ? 2147483647 : -2147483648</code></p>
 <h4>V_CVT_U32_F32</h4>
 <p>Opcode VOP2: 7 (0x7)<br />
+<p>Opcode VOP1: 7 (0x7)<br />
 Opcode VOP3A: 391 (0x187) for GCN 1.0/1.1; 327 (0x147) for GCN 1.2<br />
 Syntax: V_CVT_U32_F32 VDST, SRC0<br />
 …
     VDST = (UINT32)MIN(RNDTZINT(ASFLOAT(SRC0)), 4294967295.0)</code></p>
 <h4>V_MOV_FED_B32</h4>
 <p>Opcode VOP2: 9 (0x9)<br />
+<p>Opcode VOP1: 9 (0x9)<br />
 Opcode VOP3A: 393 (0x189) for GCN 1.0/1.1; 329 (0x149) for GCN 1.2<br />
 Syntax: V_MOV_FED_B32 VDST, SRC0<br />
 …
 (???).</p>
 <h4>V_MOV_B32</h4>
 <p>Opcode VOP2: 1 (0x1)<br />
+<p>Opcode VOP1: 1 (0x1)<br />
 Opcode VOP3A: 385 (0x181) for GCN 1.0/1.1; 321 (0x141) for GCN 1.2<br />
 Syntax: V_MOV_B32 VDST, SRC0<br />
 …
 <code>VDST = SRC0</code></p>
 <h4>V_NOP</h4>
 <p>Opcode VOP2: 0 (0x0)<br />
+<p>Opcode VOP1: 0 (0x0)<br />
 Opcode VOP3A: 384 (0x180) for GCN 1.0/1.1; 320 (0x140) for GCN 1.2<br />
 Syntax: V_NOP<br />
 Description: Do nothing.</p>
 <h4>V_READFIRSTLANE_B32</h4>
 <p>Opcode VOP2: 2 (0x2)<br />
+<p>Opcode VOP1: 2 (0x2)<br />
 Opcode VOP3A: 386 (0x182) for GCN 1.0/1.1; 322 (0x142) for GCN 1.2<br />
 Syntax: V_READFIRSTLANE_B32 SDST, VSRC0<br />