Context Navigation

Changes between Version 12 and Version 13 of GcnInstrsVop1

Timestamp:: 11/29/15 21:00:17 (8 years ago)
Author:: trac
Comment:: --

Legend:

: Unmodified
: Added
: Removed
: Modified

GcnInstrsVop1

-                      v12
+                      v13
 (ceilling), and store result to VDST. Implemented by flooring.
 If SRC0 is infinity or NaN then copy SRC0 to VDST.<br />
 Operation:
+Operation:<br />
 <code>FLOAT F = FLOOR(ASFLOAT(SRC0))
 if (ASFLOAT(SRC0) &gt; 0.0 &amp;&amp; ASFLOAT(SRC0) != F)
 …
     if ((1U&lt;&lt;i) &amp; SRC0) != 0)
     { VDST = 31-i; break; }</code></p>
+<h4>V_FFBH_I32</h4>
+<p>Opcode VOP1: 59 (0x3b) for GCN 1.0/1.1; 47 (0x2f) for GCN 1.2<br />
+Opcode VOP3A: 443 (0x1bb) for GCN 1.0/1.1; 367 (0x16f) for GCN 1.2<br />
+Syntax: V_FFBH_I32 VDST, SRC0<br />
+Description: Find last opposite bit to sign in SRC0. If found, store number of skipped bits
+to VDST, otherwise set VDST to -1.<br />
+Operation:<br />
+<code>VDST = -1
+UINT32 bitval = (INT32)SRC0&gt;=0 ? 1 : 0
+for (INT8 i = 31; i &gt;= 0; i--)
+    if ((1U&lt;&lt;i) &amp; SRC0) == (bitval&lt;&lt;i))
+    { VDST = 31-i; break; }</code></p>
 <h4>V_FFBL_B32</h4>
 <p>Opcode VOP1: 58 (0x3a) for GCN 1.0/1.1; 46 (0x2e) for GCN 1.2<br />
 …
     if ((1U&lt;&lt;i) &amp; SRC0) != 0)
     { VDST = i; break; }</code></p>
-<h4>V_FFBH_I32</h4>
-<p>Opcode VOP1: 59 (0x3b) for GCN 1.0/1.1; 47 (0x2f) for GCN 1.2<br />
-Opcode VOP3A: 443 (0x1bb) for GCN 1.0/1.1; 367 (0x16f) for GCN 1.2<br />
-Syntax: V_FFBH_I32 VDST, SRC0<br />
-Description: Find last opposite bit to sign in SRC0. If found, store number of skipped bits
-to VDST, otherwise set VDST to -1.<br />
-Operation:<br />
-<code>VDST = -1
-UINT32 bitval = (INT32)SRC0&gt;=0 ? 1 : 0
-for (INT8 i = 31; i &gt;= 0; i--)
-    if ((1U&lt;&lt;i) &amp; SRC0) == (bitval&lt;&lt;i))
-    { VDST = 31-i; break; }</code></p>
 <h4>V_FLOOR_F32</h4>
 <p>Opcode VOP1: 36 (0x24) for GCN 1.0/1.1; 31 (0x1f) for GCN 1.2<br />
 Opcode VOP3A: 420 (0x1a4) for GCN 1.0/1.1; 351 (0x15f) for GCN 1.2<br />
 Syntax: V_FLOOR_F32 VDST, SRC0<br />
 Description: Truncate floating point valu from SRC0 with rounding to positive infinity
+Description: Truncate floating point value SRC0 with rounding to positive infinity
 (flooring), and store result to VDST. If SRC0 is infinity or NaN then copy SRC0 to VDST.<br />
 Operation:
+Operation:<br />
 <code>VDST = FLOOR(ASFLOAT(SRC0))</code></p>
 <h4>V_FRACT_F32</h4>
 …
     VDST = NAN * SIGN(SF)</code></p>
 <h4>V_FRACT_F64</h4>
 <p>Opcode VOP1: 62 (0x3e) for GCN 1.0/1.1; 51 (0x33) for GCN 1.2<br />
 Opcode VOP3A: 446 (0x1be) for GCN 1.0/1.1; 371 (0x173) for GCN 1.2<br />
+<p>Opcode VOP1: 62 (0x3e) for GCN 1.0/1.1; 52 (0x32) for GCN 1.2<br />
+Opcode VOP3A: 446 (0x1be) for GCN 1.0/1.1; 372 (0x172) for GCN 1.2<br />
 Syntax: V_FRACT_F64 VDST(2), SRC0(2)<br />
 Description: Get fractional from double floating point value SRC0 and store it to VDST.
 …
 Opcode VOP3A: 447 (0x1bf) for GCN 1.0/1.1; 371 (0x173) for GCN 1.2<br />
 Syntax: V_FREXP_EXP_I32_F32 VDST, SRC0<br />
 Description: Get exponent minus 1 from single FP value SRC0, and store that exponent to VDST.
+Description: Get exponent plus 1 from single FP value SRC0, and store that exponent to VDST.
 This instruction realizes frexp function.
 If SRC0 is infinity or NAN then store -1 to VDST.<br />
 …
 Opcode VOP3A: 444 (0x1bc) for GCN 1.0/1.1; 368 (0x170) for GCN 1.2<br />
 Syntax: V_FREXP_EXP_I32_F64 VDST, SRC0(2)<br />
 Description: Get exponent minus 1 from double FP value SRC0, and store that exponent to VDST.
+Description: Get exponent plus 1 from double FP value SRC0, and store that exponent to VDST.
 This instruction realizes frexp function.
 If SRC0 is infinity or NAN then store -1 to VDST.<br />
 …
 <h4>V_LOG_F32</h4>
 <p>Opcode VOP1: 39 (0x27) for GCN 1.0/1.1; 33 (0x21) for GCN 1.2<br />
 Opcode VOP3A: 422 (0x1a6) for GCN 1.0/1.1; 353 (0x161) for GCN 1.2<br />
+Opcode VOP3A: 423 (0x1a7) for GCN 1.0/1.1; 353 (0x161) for GCN 1.2<br />
 Syntax: V_LOG_F32 VDST, SRC0<br />
 Description: Approximate logarithm of base 2 from floating point value SRC0, and store result
 …
 else
     VDST = APPROX_LOG2(F)</code></p>
+<h4>V_MOV_B32</h4>
+<p>Opcode VOP1: 1 (0x1)<br />
+Opcode VOP3A: 385 (0x181) for GCN 1.0/1.1; 321 (0x141) for GCN 1.2<br />
+Syntax: V_MOV_B32 VDST, SRC0<br />
+Description: Move SRC0 into VDST.<br />
+Operation:<br />
+<code>VDST = SRC0</code></p>
 <h4>V_MOV_FED_B32</h4>
 <p>Opcode VOP1: 9 (0x9)<br />
 …
 Description: Introduce edc double error upon write to dest vgpr without causing an exception
 (???).</p>
-<h4>V_MOV_B32</h4>
-<p>Opcode VOP1: 1 (0x1)<br />
-Opcode VOP3A: 385 (0x181) for GCN 1.0/1.1; 321 (0x141) for GCN 1.2<br />
-Syntax: V_MOV_B32 VDST, SRC0<br />
-Description: Move SRC0 into VDST.<br />
-Operation:<br />
-<code>VDST = SRC0</code></p>
 <h4>V_MOVRELD_B32</h4>
 <p>Opcode VOP1: 66 (0x42) for GCN 1.0/1.1; 54 (0x35) for GCN 1.2<br />
 Opcode VOP3A: 450 (0x1c2) for GCN 1.0/1.1; 374 (0x175) for GCN 1.2<br />
+<p>Opcode VOP1: 66 (0x42) for GCN 1.0/1.1; 54 (0x34) for GCN 1.2<br />
+Opcode VOP3A: 450 (0x1c2) for GCN 1.0/1.1; 374 (0x174) for GCN 1.2<br />
 Syntax: V_MOVRELD VDST, VSRC0<br />
 Description: Move SRC0 to VGPR[VDST_NUMBER+M0].<br />
 …
 <code>VGPR[VDST_NUMBER+M0] = SRC0</code></p>
 <h4>V_MOVRELS_B32</h4>
 <p>Opcode VOP1: 67 (0x43) for GCN 1.0/1.1; 55 (0x36) for GCN 1.2<br />
 Opcode VOP3A: 451 (0x1c3) for GCN 1.0/1.1; 375 (0x176) for GCN 1.2<br />
+<p>Opcode VOP1: 67 (0x43) for GCN 1.0/1.1; 55 (0x35) for GCN 1.2<br />
+Opcode VOP3A: 451 (0x1c3) for GCN 1.0/1.1; 375 (0x175) for GCN 1.2<br />
 Syntax: V_MOVRELS VDST, VSRC0<br />
 Description: Move SRC0[SRC0_NUMBER+M0] to VDST.<br />
 …
 <code>VDST = VGPR[SRC0_NUMBER+M0]</code></p>
 <h4>V_MOVRELSD_B32</h4>
 <p>Opcode VOP1: 67 (0x43) for GCN 1.0/1.1; 55 (0x36) for GCN 1.2<br />
 Opcode VOP3A: 451 (0x1c3) for GCN 1.0/1.1; 375 (0x176) for GCN 1.2<br />
+<p>Opcode VOP1: 68 (0x44) for GCN 1.0/1.1; 56 (0x36) for GCN 1.2<br />
+Opcode VOP3A: 452 (0x1c4) for GCN 1.0/1.1; 376 (0x176) for GCN 1.2<br />
 Syntax: V_MOVRELSD VDST, VSRC0<br />
 Description: Move SRC0[SRC0_NUMBER+M0] to VGPR[VDST_NUMBER+M0].<br />
 …
 Opcode VOP3A: 439 (0x1b7) for GCN 1.0/1.1; 363 (0x16b) for GCN 1.2<br />
 Syntax: V_NOT_B32 VDST, SRC0<br />
 Description: Do bitwise negation on 32-bit SRC0, and store result to VDST.
+Description: Do bitwise negation on 32-bit SRC0, and store result to VDST.<br />
 Operation:<br />
 <code>VDST = ~SRC0</code></p>