Context Navigation

← Previous Change
Wiki History
Next Change →

Changes between Initial Version and Version 1 of GcnInstrsVopc

Timestamp:: 12/04/15 19:00:16 (8 years ago)
Author:: trac
Comment:: --

Legend:

: Unmodified
: Added
: Removed
: Modified

GcnInstrsVopc

                       v1
+[wiki:ClrxToc Back to Table of content]
+{{{
+#!html
+<h2>GCN ISA VOPC/VOP3 instructions</h2>
+<p>VOPC instructions can be encoded in the VOPC encoding and the VOP3A/VOP3B encoding.
+List of fields for VOPC encoding:</p>
+<table>
+<thead>
+<tr>
+<th>Bits</th>
+<th>Name</th>
+<th>Description</th>
+</tr>
+</thead>
+<tbody>
+<tr>
+<td>0-8</td>
+<td>SRC0</td>
+<td>First (scalar or vector) source operand</td>
+</tr>
+<tr>
+<td>9-16</td>
+<td>VSRC1</td>
+<td>Second (scalar or vector) source operand</td>
+</tr>
+<tr>
+<td>17-24</td>
+<td>OPCODE</td>
+<td>Operation code</td>
+</tr>
+<tr>
+<td>25-31</td>
+<td>ENCODING</td>
+<td>Encoding type. Must be 0b0111110</td>
+</tr>
+</tbody>
+</table>
+<p>Syntax: INSTRUCTION VCC, SRC0, VSRC1</p>
+<p>List of fields for VOP3A/VOP3B encoding (GCN 1.0/1.1):</p>
+<table>
+<thead>
+<tr>
+<th>Bits</th>
+<th>Name</th>
+<th>Description</th>
+</tr>
+</thead>
+<tbody>
+<tr>
+<td>0-7</td>
+<td>SDST</td>
+<td>Scalar destination operand</td>
+</tr>
+<tr>
+<td>8-10</td>
+<td>ABS</td>
+<td>Absolute modifiers for source operands (VOP3A)</td>
+</tr>
+<tr>
+<td>8-14</td>
+<td>SDST</td>
+<td>Scalar destination operand (VOP3B)</td>
+</tr>
+<tr>
+<td>11</td>
+<td>CLAMP</td>
+<td>CLAMP modifier (VOP3A)</td>
+</tr>
+<tr>
+<td>15</td>
+<td>CLAMP</td>
+<td>CLAMP modifier (VOP3B)</td>
+</tr>
+<tr>
+<td>17-25</td>
+<td>OPCODE</td>
+<td>Operation code</td>
+</tr>
+<tr>
+<td>26-31</td>
+<td>ENCODING</td>
+<td>Encoding type. Must be 0b110100</td>
+</tr>
+<tr>
+<td>32-40</td>
+<td>SRC0</td>
+<td>First (scalar or vector) source operand</td>
+</tr>
+<tr>
+<td>41-49</td>
+<td>SRC1</td>
+<td>Second (scalar or vector) source operand</td>
+</tr>
+<tr>
+<td>50-58</td>
+<td>SRC2</td>
+<td>Third (scalar or vector) source operand</td>
+</tr>
+<tr>
+<td>59-60</td>
+<td>OMOD</td>
+<td>OMOD modifier. Multiplication modifier</td>
+</tr>
+<tr>
+<td>61-63</td>
+<td>NEG</td>
+<td>Negation modifier for source operands</td>
+</tr>
+</tbody>
+</table>
+<p>List of fields for VOP3A/VOP3B encoding (GCN 1.2):</p>
+<table>
+<thead>
+<tr>
+<th>Bits</th>
+<th>Name</th>
+<th>Description</th>
+</tr>
+</thead>
+<tbody>
+<tr>
+<td>0-7</td>
+<td>SDST</td>
+<td>Scalar destination operand</td>
+</tr>
+<tr>
+<td>8-10</td>
+<td>ABS</td>
+<td>Absolute modifiers for source operands (VOP3A)</td>
+</tr>
+<tr>
+<td>8-14</td>
+<td>SDST</td>
+<td>Scalar destination operand (VOP3B)</td>
+</tr>
+<tr>
+<td>15</td>
+<td>CLAMP</td>
+<td>CLAMP modifier</td>
+</tr>
+<tr>
+<td>16-25</td>
+<td>OPCODE</td>
+<td>Operation code</td>
+</tr>
+<tr>
+<td>26-31</td>
+<td>ENCODING</td>
+<td>Encoding type. Must be 0b110100</td>
+</tr>
+<tr>
+<td>32-40</td>
+<td>SRC0</td>
+<td>First (scalar or vector) source operand</td>
+</tr>
+<tr>
+<td>41-49</td>
+<td>SRC1</td>
+<td>Second (scalar or vector) source operand</td>
+</tr>
+<tr>
+<td>50-58</td>
+<td>SRC2</td>
+<td>Third (scalar or vector) source operand</td>
+</tr>
+<tr>
+<td>59-60</td>
+<td>OMOD</td>
+<td>OMOD modifier. Multiplication modifier</td>
+</tr>
+<tr>
+<td>61-63</td>
+<td>NEG</td>
+<td>Negation modifier for source operands</td>
+</tr>
+</tbody>
+</table>
+<p>Syntax: INSTRUCTION SDST(2), SRC0, SRC1 [MODIFIERS]</p>
+<p>Modifiers:</p>
+<ul>
+<li>-SRC - negate floating point value from source operand. Applied after ABS modifier.</li>
+<li>ABS(SRC) - apply absolute value to source operand</li>
+</ul>
+<p>NOTE: ABS and negation is applied to source operand for any instruction.</p>
+<p>Negation and absolute value can be combined: <code>-ABS(V0)</code>.</p>
+<p>Limitations for operands:</p>
+<ul>
+<li>only one SGPR can be read by instruction. Multiple occurrences of this same
+SGPR is allowed</li>
+<li>only one literal constant can be used, and only when a SGPR or M0 is not used in
+source operands</li>
+<li>only SRC0 can holds LDS_DIRECT</li>
+</ul>
+<p>VOPC opcodes (0-255) and VOP3 opcodes are same.</p>
+<h3>Tables of opcodes and their descriptions (GCN 1.0/1.1)</h3>
+<p>Table of floating point comparison instructions by opcode:</p>
+<table>
+<thead>
+<tr>
+<th>Opcode range</th>
+<th>Instruction</th>
+<th>Description</th>
+</tr>
+</thead>
+<tbody>
+<tr>
+<td>0-15 (0x00-0x0f)</td>
+<td>V_CMP_{OP16}_F32</td>
+<td>Signal on sNAN input only. Single FP values.</td>
+</tr>
+<tr>
+<td>16-31 (0x10-0x1f)</td>
+<td>V_CMPX_{OP16}_F32</td>
+<td>Signal on sNAN input only. Write result to EXEC. Single FP values.</td>
+</tr>
+<tr>
+<td>32-47 (0x20-0x2f)</td>
+<td>V_CMP_{OP16}_F64</td>
+<td>Signal on sNAN input only. Double FP values.</td>
+</tr>
+<tr>
+<td>48-63 (0x30-0x3f)</td>
+<td>V_CMPX_{OP16}_F64</td>
+<td>Signal on sNAN input only. Write result to EXEC. Double FP values.</td>
+</tr>
+<tr>
+<td>64-79 (0x40-0x4f)</td>
+<td>V_CMP_{OP16}_F32</td>
+<td>Signal on any sNAN. Single FP values.</td>
+</tr>
+<tr>
+<td>80-95 (0x50-0x5f)</td>
+<td>V_CMPX_{OP16}_F32</td>
+<td>Signal on any sNAN. Write result to EXEC. Single FP values.</td>
+</tr>
+<tr>
+<td>96-111 (0x60-0x6f)</td>
+<td>V_CMP_{OP16}_F64</td>
+<td>Signal on any sNAN. Double FP values.</td>
+</tr>
+<tr>
+<td>112-127 (0x70-0x7f)</td>
+<td>V_CMPX_{OP16}_F64</td>
+<td>Signal on any sNAN. Write result to EXEC. Double FP values.</td>
+</tr>
+</tbody>
+</table>
+<p>Table of OP16 (compare operations) for floating point values comparisons:</p>
+<table>
+<thead>
+<tr>
+<th>Opcode offset</th>
+<th>OP16 name</th>
+<th>Description</th>
+</tr>
+</thead>
+<tbody>
+<tr>
+<td>0 (0x0)</td>
+<td>F</td>
+<td>SDST(LANEID) = 0</td>
+</tr>
+<tr>
+<td>1 (0x1)</td>
+<td>LT</td>
+<td>SDST(LANEID) = (TYPE)SRC0 &lt; (TYPE)SRC1</td>
+</tr>
+<tr>
+<td>2 (0x2)</td>
+<td>EQ</td>
+<td>SDST(LANEID) = (TYPE)SRC0 == (TYPE)SRC1</td>
+</tr>
+<tr>
+<td>3 (0x3)</td>
+<td>LE</td>
+<td>SDST(LANEID) = (TYPE)SRC0 &lt;= (TYPE)SRC1</td>
+</tr>
+<tr>
+<td>4 (0x4)</td>
+<td>GT</td>
+<td>SDST(LANEID) = (TYPE)SRC0 &gt; (TYPE)SRC1</td>
+</tr>
+<tr>
+<td>5 (0x5)</td>
+<td>LG</td>
+<td>SDST(LANEID) = (TYPE)SRC0 != (TYPE)SRC1</td>
+</tr>
+<tr>
+<td>6 (0x6)</td>
+<td>GE</td>
+<td>SDST(LANEID) = (TYPE)SRC0 &gt;= (TYPE)SRC1</td>
+</tr>
+<tr>
+<td>7 (0x7)</td>
+<td>O</td>
+<td>SDST(LANEID) = ((TYPE)SRC0!=NAN &amp;&amp; (TYPE)SRC1!=NAN)</td>
+</tr>
+<tr>
+<td>8 (0x8)</td>
+<td>U</td>
+<td>SDST(LANEID) = ((TYPE)SRC0!=NAN</td>
+</tr>
+<tr>
+<td>9 (0x9)</td>
+<td>NGE</td>
+<td>SDST(LANEID) = !((TYPE)SRC0 &gt;= (TYPE)SRC1)</td>
+</tr>
+<tr>
+<td>10 (0xa)</td>
+<td>NLG</td>
+<td>SDST(LANEID) = !((TYPE)SRC0 != (TYPE)SRC1)</td>
+</tr>
+<tr>
+<td>11 (0xb)</td>
+<td>NGT</td>
+<td>SDST(LANEID) = !((TYPE)SRC0 &gt; (TYPE)SRC1)</td>
+</tr>
+<tr>
+<td>12 (0xc)</td>
+<td>NLE</td>
+<td>SDST(LANEID) = !((TYPE)SRC0 &lt;= (TYPE)SRC1)</td>
+</tr>
+<tr>
+<td>13 (0xd)</td>
+<td>NEG</td>
+<td>SDST(LANEID) = !((TYPE)SRC0 == (TYPE)SRC1)</td>
+</tr>
+<tr>
+<td>14 (0xe)</td>
+<td>NLT</td>
+<td>SDST(LANEID) = !((TYPE)SRC0 &lt; (TYPE)SRC1)</td>
+</tr>
+<tr>
+<td>15 (0xf)</td>
+<td>T</td>
+<td>SDST(LANEID) = 1</td>
+</tr>
+</tbody>
+</table>
+<p>NOTE: Comparison operators (&lt;,&lt;=,!=,==) compares only non NaN values. If any operand is NaN
+then returns false. By contrast, negations of comparisons (NLT, NGT) returns true
+if any operand is NaN value. This feature distinguish for example NGE from LT.  </p>
+<p>LANEID in description is lane id. TYPE is type of compared values (FLOAT for _FP32,
+DOUBLE for _FP64).</p>
+<p>Sample instructions:<br />
+<code>V_CMPX_LT_F32 VCC, V0, V1  # V0&lt;V1
+V_CMPSX_EQ_F32 VCC, V0, V1 # V0==V1, store result to EXEC, signal for any sNaN
+V_CMPX_LT_F64 VCC, V[2:3], V[4:5]  # V[2:3]&lt;V[4:5]</code></p>
+<p>Table of integer comparison instructions by opcode:</p>
+<table>
+<thead>
+<tr>
+<th>Opcode range</th>
+<th>Instruction</th>
+<th>Description</th>
+</tr>
+</thead>
+<tbody>
+<tr>
+<td>128-135 (0x80-0x87)</td>
+<td>V_CMP_{OP8}_I32</td>
+<td>Signed 32-bit values.</td>
+</tr>
+<tr>
+<td>144-151 (0x90-0x97)</td>
+<td>V_CMPX_{OP8}_I32</td>
+<td>Write result to EXEC. Signed 32-bit values.</td>
+</tr>
+<tr>
+<td>160-167 (0xa0-0xa7)</td>
+<td>V_CMP_{OP8}_I64</td>
+<td>Signed 64-bit values.</td>
+</tr>
+<tr>
+<td>176-183 (0xb0-0xb7)</td>
+<td>V_CMPX_{OP8}_I64</td>
+<td>Write result to EXEC. Signed 64-bit values.</td>
+</tr>
+<tr>
+<td>192-199 (0xc0-0xc7)</td>
+<td>V_CMP_{OP8}_U32</td>
+<td>Unsigned 32-bit values.</td>
+</tr>
+<tr>
+<td>208-215 (0xd0-0xd7)</td>
+<td>V_CMPX_{OP8}_U32</td>
+<td>Write result to EXEC. Unsigned 32-bit values.</td>
+</tr>
+<tr>
+<td>224-231 (0xe0-0xe7)</td>
+<td>V_CMP_{OP8}_U64</td>
+<td>Unsigned 64-bit values.</td>
+</tr>
+<tr>
+<td>240-247 (0xf0-0xf7)</td>
+<td>V_CMPX_{OP8}_U64</td>
+<td>Write result to EXEC. Unsigned 64-bit values.</td>
+</tr>
+</tbody>
+</table>
+<p>Table of OP16 (compare operations) for integer values comparisons:</p>
+<table>
+<thead>
+<tr>
+<th>Opcode offset</th>
+<th>OP8 name</th>
+<th>Description</th>
+</tr>
+</thead>
+<tbody>
+<tr>
+<td>0 (0x0)</td>
+<td>F</td>
+<td>SDST(LANEID) = 0</td>
+</tr>
+<tr>
+<td>1 (0x1)</td>
+<td>LT</td>
+<td>SDST(LANEID) = (TYPE)SRC0 &lt; (TYPE)SRC1</td>
+</tr>
+<tr>
+<td>2 (0x2)</td>
+<td>EQ</td>
+<td>SDST(LANEID) = (TYPE)SRC0 == (TYPE)SRC1</td>
+</tr>
+<tr>
+<td>3 (0x3)</td>
+<td>LE</td>
+<td>SDST(LANEID) = (TYPE)SRC0 &lt;= (TYPE)SRC1</td>
+</tr>
+<tr>
+<td>4 (0x4)</td>
+<td>GT</td>
+<td>SDST(LANEID) = (TYPE)SRC0 &gt; (TYPE)SRC1</td>
+</tr>
+<tr>
+<td>5 (0x5)</td>
+<td>LG, NE</td>
+<td>SDST(LANEID) = (TYPE)SRC0 != (TYPE)SRC1</td>
+</tr>
+<tr>
+<td>6 (0x6)</td>
+<td>GE</td>
+<td>SDST(LANEID) = (TYPE)SRC0 &gt;= (TYPE)SRC1</td>
+</tr>
+<tr>
+<td>7 (0x7)</td>
+<td>T</td>
+<td>SDST(LANEID) = 1</td>
+</tr>
+</tbody>
+</table>
+<p>LANEID in description is lane id. TYPE is type of compared values (UINT32 for _U32,
+INT32 for _I32,...).</p>
+<p>Sample instructions:<br />
+<code>V_CMP_LT_U32 VCC, V0, V1  # V0&lt;V1
+V_CMPX_EQ_U32 VCC, V0, V1 # V0==V1, store result to EXEC, signal for any sNaN</code></p>
+<p>Table of class instructions:</p>
+<h3>Tables of opcodes and their descriptions (GCN 1.2)</h3>
+<h4>V_CMP_CLASS_F32</h4>
+<p>Opcode: 0x88<br />
+Syntax VOPC: V_CMP_CLASS_F32 VCC, SRC0, SRC1<br />
+Syntax VOP3: V_CMP_CLASS_F32 SDST, SRC0, SRC1<br />
+Operation: Check whether SSRC0 single floating point value belongs to one of specified class.
+Classes are specified as set bits in SRC1. If that condition is satisfied then store
+to bit of SDST with number of current lane id, otherwise clear that bit.
+No flushing denormalized values for SRC0. List of classes:</p>
+<table>
+<thead>
+<tr>
+<th>Bit</th>
+<th>Description</th>
+</tr>
+</thead>
+<tbody>
+<tr>
+<td>0</td>
+<td>Signaling NaN</td>
+</tr>
+<tr>
+<td>1</td>
+<td>quiet Nan</td>
+</tr>
+<tr>
+<td>2</td>
+<td>-INF</td>
+</tr>
+<tr>
+<td>3</td>
+<td>negative normalized value</td>
+</tr>
+<tr>
+<td>4</td>
+<td>negative dernormalized value</td>
+</tr>
+<tr>
+<td>5</td>
+<td>negative zero</td>
+</tr>
+<tr>
+<td>6</td>
+<td>positive zero</td>
+</tr>
+<tr>
+<td>7</td>
+<td>positive denormalized value</td>
+</tr>
+<tr>
+<td>8</td>
+<td>positive normalized value</td>
+</tr>
+<tr>
+<td>9</td>
+<td>+INF</td>
+</tr>
+</tbody>
+</table>
+<h4>V_CMPX_CLASS_F32</h4>
+<p>Opcode: 0x98<br />
+Syntax VOPC: V_CMPX_CLASS_F32 VCC, SRC0, SRC1<br />
+Syntax VOP3: V_CMPX_CLASS_F32 SDST, SRC0, SRC1<br />
+Operation: Check whether SSRC0 single floating point value belongs to one of specified class.
+Classes are specified as set bits in SRC1. If that condition is satisfied then store
+to bit of SDST and EXEC with number of current lane id, otherwise clear that bit.
+No flushing denormalized values for SRC0. List of classes:</p>
+<table>
+<thead>
+<tr>
+<th>Bit</th>
+<th>Description</th>
+</tr>
+</thead>
+<tbody>
+<tr>
+<td>0</td>
+<td>Signaling NaN</td>
+</tr>
+<tr>
+<td>1</td>
+<td>quiet Nan</td>
+</tr>
+<tr>
+<td>2</td>
+<td>-INF</td>
+</tr>
+<tr>
+<td>3</td>
+<td>negative normalized value</td>
+</tr>
+<tr>
+<td>4</td>
+<td>negative dernormalized value</td>
+</tr>
+<tr>
+<td>5</td>
+<td>negative zero</td>
+</tr>
+<tr>
+<td>6</td>
+<td>positive zero</td>
+</tr>
+<tr>
+<td>7</td>
+<td>positive denormalized value</td>
+</tr>
+<tr>
+<td>8</td>
+<td>positive normalized value</td>
+</tr>
+<tr>
+<td>9</td>
+<td>+INF</td>
+</tr>
+</tbody>
+</table>
+<h4>V_CMP_CLASS_F64</h4>
+<p>Opcode: 0xa8<br />
+Syntax VOPC: V_CMP_CLASS_F64 VCC, SRC0, SRC1(2)<br />
+Syntax VOP3: V_CMP_CLASS_F64 SDST, SRC0(2), SRC1(2)<br />
+Operation: Check whether SSRC0 double floating point value belongs to one of specified class.
+Classes are specified as set bits in SRC1. If that condition is satisfied then store
+to bit of SDST with number of current lane id, otherwise clear that bit.
+No flushing denormalized values for SRC0. List of classes:</p>
+<table>
+<thead>
+<tr>
+<th>Bit</th>
+<th>Description</th>
+</tr>
+</thead>
+<tbody>
+<tr>
+<td>0</td>
+<td>Signaling NaN</td>
+</tr>
+<tr>
+<td>1</td>
+<td>quiet Nan</td>
+</tr>
+<tr>
+<td>2</td>
+<td>-INF</td>
+</tr>
+<tr>
+<td>3</td>
+<td>negative normalized value</td>
+</tr>
+<tr>
+<td>4</td>
+<td>negative dernormalized value</td>
+</tr>
+<tr>
+<td>5</td>
+<td>negative zero</td>
+</tr>
+<tr>
+<td>6</td>
+<td>positive zero</td>
+</tr>
+<tr>
+<td>7</td>
+<td>positive denormalized value</td>
+</tr>
+<tr>
+<td>8</td>
+<td>positive normalized value</td>
+</tr>
+<tr>
+<td>9</td>
+<td>+INF</td>
+</tr>
+</tbody>
+</table>
+<h4>V_CMPX_CLASS_F64</h4>
+<p>Opcode: 0xb8<br />
+Syntax VOPC: V_CMPX_CLASS_F64 VCC, SRC0(2), SRC1(2)<br />
+Syntax VOP3: V_CMPX_CLASS_F64 SDST, SRC0(2), SRC1(2)<br />
+Operation: Check whether SSRC0 double floating point value belongs to one of specified class.
+Classes are specified as set bits in SRC1. If that condition is satisfied then store
+to bit of SDST and EXEC with number of current lane id, otherwise clear that bit.
+No flushing denormalized values for SRC0. List of classes:</p>
+<table>
+<thead>
+<tr>
+<th>Bit</th>
+<th>Description</th>
+</tr>
+</thead>
+<tbody>
+<tr>
+<td>0</td>
+<td>Signaling NaN</td>
+</tr>
+<tr>
+<td>1</td>
+<td>quiet Nan</td>
+</tr>
+<tr>
+<td>2</td>
+<td>-INF</td>
+</tr>
+<tr>
+<td>3</td>
+<td>negative normalized value</td>
+</tr>
+<tr>
+<td>4</td>
+<td>negative dernormalized value</td>
+</tr>
+<tr>
+<td>5</td>
+<td>negative zero</td>
+</tr>
+<tr>
+<td>6</td>
+<td>positive zero</td>
+</tr>
+<tr>
+<td>7</td>
+<td>positive denormalized value</td>
+</tr>
+<tr>
+<td>8</td>
+<td>positive normalized value</td>
+</tr>
+<tr>
+<td>9</td>
+<td>+INF</td>
+</tr>
+</tbody>
+</table>
+}}}