Context Navigation

Changes between Version 9 and Version 10 of GcnInstrsVop1

Timestamp:: 11/29/15 14:00:16 (8 years ago)
Author:: trac
Comment:: --

Legend:

: Unmodified
: Added
: Removed
: Modified

GcnInstrsVop1

-                      v9
+                      v10
     F += 1.0
 VDST = F</code></p>
+<h4>V_COS_F32</h4>
+<p>Opcode VOP1: 54 (0x36) for GCN 1.0/1.1; 42 (0x2a) for GCN 1.2<br />
+Opcode VOP3A: 438 (0x1b6) for GCN 1.0/1.1; 362 (0x16a) for GCN 1.2<br />
+Syntax: V_COS_F32 VDST, SRC0<br />
+Description: Compute cosine of FP value from SRC0. Input value must be normalized to range
+.0 - 1.0 (-360 degree : 360 degree). If SRC0 value is out of range then store 1.0 to VDST.
+If SRC0 value is infinity, store -NAN to VDST.<br />
+Operation:<br />
+<code>FLOAT SF = ASFLOAT(SRC0)
+VDST = 1.0
+if (SF &gt;= -1.0 &amp;&amp; SF &lt;= 1.0)
+    VDST = APPROX_COS(SF)
+else if (ABS(SF)==INF)
+    VDST = -NAN
+else if (ABS(SF)==NAN)
+    VDST = SRC0</code></p>
 <h4>V_CVT_F16_F32</h4>
 <p>Opcode VOP1: 10 (0xa)<br />
 …
 If value is higher/lower than maximal/minimal integer then store MAX_INT32/MIN_INT32 to VDST.
 If input value is NaN/-NaN then store MAX_INT32/MIN_INT32 to VDST.<br />
 Description:<br />
+Operation:<br />
 <code>FLOAT SF = ASFLOAT(SRC0)
 if (ABS(SF)!=NAN)
 …
 Description: Approximate reciprocal from floating point value SRC0 and store it to VDST.
 Guaranted error below 1ulp. Result is clamped to MAX_FLOAT including sign of a result.<br />
 Description:<br />
+Operation:<br />
 <code>VDST = APPROX_RCP(ASFLOAT(SRC0))
 if (ABS(ASFLOAT(VDST))==INF)
     VDST = SIGN(ASFLOAT(VDST)) * MAX_FLOAT</code></p>
+<h4>V_RCP_CLAMP_F64</h4>
+<p>Opcode VOP1: 48 (0x30) for GCN 1.0/1.1<br />
+Opcode VOP3A: 432 (0x1b0) for GCN 1.0/1.1<br />
+Syntax: V_RCP_CLAMP_F64 VDST(2), SRC0(2)<br />
+Description: Approximate reciprocal from double FP value SRC0 and store it to VDST.
+Relative error of approximation is ~1e-8.
+Result is clamped to MAX_DOUBLE value including sign of a result.<br />
+Operation:<br />
+<code>VDST = APPROX_RCP(ASDOUBLE(SRC0))
+if (ABS(ASDOUBLE(VDST))==INF)
+    VDST = SIGN(ASDOUBLE(VDST)) * MAX_DOUBLE</code></p>
 <h4>V_RCP_F32</h4>
 <p>Opcode VOP1: 42 (0x2a) for GCN 1.0/1.1; 34 (0x22) for GCN 2.0<br />
 …
 Description: Approximate reciprocal from floating point value SRC0 and store it to VDST.
 Guaranted error below 1ulp.<br />
 Description:<br />
+Operation:<br />
 <code>VDST = APPROX_RCP(ASFLOAT(SRC0))</code></p>
 <h4>V_RCP_F64</h4>
 …
 Description: Approximate reciprocal from double FP value SRC0 and store it to VDST.
 Relative error of approximation is ~1e-8.<br />
 Description:<br />
+Operation:<br />
 <code>VDST = APPROX_RCP(ASDOUBLE(SRC0))</code></p>
-<h4>V_RCP_CLAMP_F64</h4>
-<p>Opcode VOP1: 48 (0x30) for GCN 1.0/1.1<br />
-Opcode VOP3A: 432 (0x1b0) for GCN 1.0/1.1<br />
-Syntax: V_RCP_CLAMP_F64 VDST(2), SRC0(2)<br />
-Description: Approximate reciprocal from double FP value SRC0 and store it to VDST.
-Relative error of approximation is ~1e-8.
-Result is clamped to MAX_DOUBLE value including sign of a result.<br />
-Description:<br />
-<code>VDST = APPROX_RCP(ASDOUBLE(SRC0))
-if (ABS(ASDOUBLE(VDST))==INF)
-    VDST = SIGN(ASDOUBLE(VDST)) * MAX_DOUBLE</code></p>
 <h4>V_RCP_IFLAG_F32</h4>
 <p>Opcode VOP1: 43 (0x2b) for GCN 1.0/1.1; 35 (0x23) for GCN 2.0<br />
 …
 Guaranted error below 1ulp. This instruction signals integer division by zero, instead
 any floating point exception when error is occurred.<br />
 Description:<br />
+Operation:<br />
 <code>VDST = APPROX_RCP_IFLAG(ASFLOAT(SRC0))</code></p>
 <h4>V_RCP_LEGACY_F32</h4>
 …
 <p>Opcode VOP1: 44 (0x2c) for GCN 1.0/1.1<br />
 Opcode VOP3A: 428 (0x1ac) for GCN 1.0/1.1<br />
 Syntax: V_RCP_CLAMP_F32 VDST, SRC0<br />
+Syntax: V_RSQ_CLAMP_F32 VDST, SRC0<br />
 Description: Approximate reciprocal square root from floating point value SRC0 with
 clamping to MAX_FLOAT, and store result to VDST.
 If SRC0 is negative value, store -NAN to VDST.
 This instruction doesn't handle denormalized values regardless FLOAT MODE register setup.<br />
 Description:<br />
+Operation:<br />
 <code>VDST = APPROX_RSQRT(ASFLOAT(SRC0))
 if (ASFLOAT(VDST)==INF)
     VDST = MAX_FLOAT</code></p>
+<h4>V_RSQ_CLAMP_F64</h4>
+<p>Opcode VOP1: 50 (0x32) for GCN 1.0/1.1
+Opcode VOP3A: 434 (0x1b2) for GCN 1.0/1.1
+Syntax: V_RSQ_CLAMP_F64 VDST(2), SRC0(2)<br />
+Description: Approximate reciprocal square root from double floating point value SRC0
+with clamping to MAX_DOUBLE ,and store it to VDST. If SRC0 is negative value,
+store -NAN to VDST.<br />
+Operation:<br />
+<code>VDST = APPROX_RSQRT(ASDOUBLE(SRC0))
+if (ASDOUBLE(VDST)==INF)
+    VDST = MAX_DOUBLE</code></p>
 <h4>V_RSQ_F32</h4>
 <p>Opcode VOP1: 46 (0x2e) for GCN 1.0/1.1; 36 (0x24) for GCN 2.0<br />
 Opcode VOP3A: 430 (0x1ae) for GCN 1.0/1.1; 356 (0x164) for GCN 2.0<br />
 Syntax: V_RCP_F32 VDST, SRC0<br />
+Syntax: V_RSQ_F32 VDST, SRC0<br />
 Description: Approximate reciprocal square root from floating point value SRC0 and
 store it to VDST. If SRC0 is negative value, store -NAN to VDST.
 This instruction doesn't handle denormalized values regardless FLOAT MODE register setup.<br />
 Description:<br />
+Operation:<br />
 <code>VDST = APPROX_RSQRT(ASFLOAT(SRC0))</code></p>
+<h4>V_RSQ_F64</h4>
+<p>Opcode VOP1: 49 (0x31) for GCN 1.0/1.1; 38 (0x26) for GCN 2.0<br />
+Opcode VOP3A: 433 (0x1b1) for GCN 1.0/1.1; 358 (0x166) for GCN 2.0<br />
+Syntax: V_RSQ_F64 VDST(2), SRC0(2)<br />
+Description: Approximate reciprocal square root from double floating point value SRC0 and
+store it to VDST. If SRC0 is negative value, store -NAN to VDST.<br />
+Operation:<br />
+<code>VDST = APPROX_RSQRT(ASDOUBLE(SRC0))</code></p>
 <h4>V_RSQ_LEGACY_F32</h4>
 <p>Opcode VOP1: 45 (0x2d) for GCN 1.0/1.1<br />
 …
 If result is zero then store 0.0 to VDST.
 This instruction doesn't handle denormalized values regardless FLOAT MODE register setup.<br />
 Description:<br />
+Operation:<br />
 <code>VDST = APPROX_RSQRT(ASFLOAT(SRC0))
 if (ASFLOAT(VDST)==INF)
     VDST = 0.0</code></p>
+<h4>V_SIN_F32</h4>
+<p>Opcode VOP1: 53 (0x35) for GCN 1.0/1.1; 41 (0x29) for GCN 1.2<br />
+Opcode VOP3A: 437 (0x1b5) for GCN 1.0/1.1; 361 (0x169) for GCN 1.2<br />
+Syntax: V_SIN_F32 VDST, SRC0<br />
+Description: Compute sine of FP value from SRC0. Input value must be normalized to range
+.0 - 1.0 (-360 degree : 360 degree). If SRC0 value is out of range then store 0.0 to VDST.
+If SRC0 value is infinity, store -NAN to VDST.<br />
+Operation:<br />
+<code>FLOAT SF = ASFLOAT(SRC0)
+VDST = 0.0
+if (SF &gt;= -1.0 &amp;&amp; SF &lt;= 1.0)
+    VDST = APPROX_SIN(SF)
+else if (ABS(SF)==INF)
+    VDST = -NAN
+else if (ABS(SF)==NAN)
+    VDST = SRC0</code></p>
+<h4>V_SQRT_F32</h4>
+<p>Opcode VOP1: 51 (0x33) for GCN 1.0/1.1; 39 (0x27) for GCN 1.2<br />
+Opcode VOP3A: 435 (0x1b3) for GCN 1.0/1.1; 359 (0x167) for GCN 1.2<br />
+Syntax: V_SQRT_F32 VDST, SRC0<br />
+Description: Compute square root of floating point value SRC0, and store result to VDST.
+If SRC0 is negative value then store -NaN to VDST.<br />
+Operation:<br />
+<code>if (ASFLOAT(SRC0)&gt;=0.0)
+    VDST = APPROX_SQRT(ASFLOAT(SRC0))
+else
+    VDST = -NAN</code></p>
+<h4>V_SQRT_F64</h4>
+<p>Opcode VOP1: 52 (0x34) for GCN 1.0/1.1; 40 (0x28) for GCN 1.2<br />
+Opcode VOP3A: 436 (0x1b4) for GCN 1.0/1.1; 360 (0x168) for GCN 1.2<br />
+Syntax: V_SQRT_F64 VDST(2), SRC0(2)<br />
+Description: Compute square root of double floating point value SRC0, and store result
+to VDST. Relative error of approximation is ~1e-8.
+If SRC0 is negative value then store -NaN to VDST.<br />
+Operation:<br />
+<code>if (ASDOUBLE(SRC0)&gt;=0.0)
+    VDST = APPROX_SQRT(ASDOUBLE(SRC0))
+else
+    VDST = -NAN</code></p>
 <h4>V_TRUNC_F32</h4>
 <p>Opcode VOP1: 33 (0x21) for GCN 1.0/1.1; 28 (0x1c) for GCN 1.2<br />