Context Navigation

Changes between Version 28 and Version 29 of GcnInstrsVop2

Timestamp:: 06/16/17 19:00:24 (7 years ago)
Author:: trac
Comment:: --

Legend:

: Unmodified
: Added
: Removed
: Modified

GcnInstrsVop2

-                      v28
+                      v29
 NOTE: OMOD and CLAMP modifier affects only for instruction that output is
 floating point value.<br />
+NOTE: ABS and negation is applied to source operand for any instruction.<br />
+NOTE: OMOD modifier doesn't work for half precision (FP16) instructions (except V_MAC_F16).</p>
+NOTE: ABS and negation is applied to source operand for any instruction.  </p>
 <p>Negation and absolute value can be combined: <code>-ABS(V0)</code>. Modifiers CLAMP and
 OMOD (MUL:2, MUL:4 and DIV:2) can be given in random order.</p>
 …
 Syntax: V_MAC_F16 VDST, SRC0, SRC1<br />
 Description: Multiply FP16 value from SRC0 by FP16 value from SRC1 and
 add result to VDST. It applies OMOD modifier to result.<br />
+add result to VDST. It applies OMOD modifier to result and it flush denormals.<br />
 Operation:<br />
 <code>VDST = ASHALF(SRC0) * ASHALF(SRC1) + ASHALF(VDST)</code></p>
 …
 Opcode VOP3A: 287 (0x11f) for GCN 1.0/1.1; 278 (0x116) for GCN 1.2<br />
 Syntax: V_MAC_F32 VDST, SRC0, SRC1<br />
+Description: Multiply FP value from SRC0 by FP value from SRC1 and add result to VDST.<br />
+Description: Multiply FP value from SRC0 by FP value from SRC1 and add result to VDST.
+It applies OMOD modifier to result and it flush denormals.<br />
 Operation:<br />
 <code>VDST = ASFLOAT(SRC0) * ASFLOAT(SRC1) + ASFLOAT(VDST)</code></p>
 …
 Syntax: V_MAC_LEGACY_F32 VDST, SRC0, SRC1<br />
 Description: Multiply FP value from SRC0 by FP value from SRC1 and add result to VDST.
+If one of value is 0.0 then always do not change VDST (do not apply IEEE rules for 0.0*x).<br />
+If one of value is 0.0 then always do not change VDST (do not apply IEEE rules for 0.0*x).
+It applies OMOD modifier to result and it flush denormals.<br />
 Operation:<br />
 <code>if (ASFLOAT(SRC0)!=0.0 &amp;&amp; ASFLOAT(SRC1)!=0.0)
     VDST = ASFLOAT(SRC0) * ASFLOAT(SRC1) + ASFLOAT(VDST)</code></p>
+<h4>V_MADAK_F16</h4>
+<p>Opcode: 37 (0x25) for GCN 1.2<br />
+Opcode: 293 (0x125) for GCN 1.2<br />
+Syntax: V_MADAK_F16 VDST, SRC0, SRC1, FLOAT16LIT<br />
+Description: Multiply FP16 value from SRC0 with FP16 value from SRC1 and add
+the constant literal FLOATLIT16; and store result to VDST. Constant literal follows
+after instruction word. It flush denormals.<br />
+Operation:
+<code>VDST = ASHALF(SRC0) * ASHALF(SRC1) + ASHALF(FLOAT16LIT)</code></p>
+<h4>V_MADAK_F32</h4>
+<p>Opcode: VOP2: 33 (0x21) for GCN 1.0/1.1; 24 (0x18) for GCN 1.2<br />
+Opcode: VOP3A: 289 (0x121) for GCN 1.0/1.1; 280 (0x118) for GCN 1.2<br />
+Syntax: V_MADAK_F32 VDST, SRC0, SRC1, FLOATLIT<br />
+Description: Multiply FP value from SRC0 with FP value from SRC1 and add
+the constant literal FLOATLIT; and store result to VDST. Constant literal follows
+after instruction word. It flush denormals.<br />
+Operation:
+<code>VDST = ASFLOAT(SRC0) * ASFLOAT(SRC1) + ASFLOAT(FLOATLIT)</code></p>
 <h4>V_MADMK_F16</h4>
 <p>Opcode: 36 (0x24) for GCN 1.2<br />
 …
 Description: Multiply FP16 value from SRC0 with the constant literal FLOAT16LIT and add
 FP16 value from SRC1; and store result to VDST. Constant literal follows
 after instruction word. Use nearest-even rouding.<br />
+after instruction word. It flush denormals.<br />
 Operation:
 <code>VDST = ASHALF(SRC0) * ASHALF(FLOAT16LIT) + ASHALF(SRC1)</code></p>
 …
 Description: Multiply FP value from SRC0 with the constant literal FLOATLIT and add
 FP value from SRC1; and store result to VDST. Constant literal follows
 after instruction word.<br />
+after instruction word. It flush denormals.<br />
 Operation:
 <code>VDST = ASFLOAT(SRC0) * ASFLOAT(FLOATLIT) + ASFLOAT(SRC1)</code></p>
-<h4>V_MADAK_F16</h4>
-<p>Opcode: 37 (0x25) for GCN 1.2<br />
-Opcode: 293 (0x125) for GCN 1.2<br />
-Syntax: V_MADAK_F16 VDST, SRC0, SRC1, FLOAT16LIT<br />
-Description: Multiply FP16 value from SRC0 with FP16 value from SRC1 and add
-the constant literal FLOATLIT16; and store result to VDST. Constant literal follows
-after instruction word.<br />
-Operation:
-<code>VDST = ASHALF(SRC0) * ASHALF(SRC1) + ASHALF(FLOAT16LIT)</code></p>
-<h4>V_MADAK_F32</h4>
-<p>Opcode: VOP2: 33 (0x21) for GCN 1.0/1.1; 24 (0x18) for GCN 1.2<br />
-Opcode: VOP3A: 289 (0x121) for GCN 1.0/1.1; 280 (0x118) for GCN 1.2<br />
-Syntax: V_MADAK_F32 VDST, SRC0, SRC1, FLOATLIT<br />
-Description: Multiply FP value from SRC0 with FP value from SRC1 and add
-the constant literal FLOATLIT; and store result to VDST. Constant literal follows
-after instruction word.<br />
-Operation:
-<code>VDST = ASFLOAT(SRC0) * ASFLOAT(SRC1) + ASFLOAT(FLOATLIT)</code></p>
 <h4>V_MAX_F16</h4>
 <p>Opcode VOP2: 45 (0x2d) for GCN 1.2<br />