ISA Extension Spec

Instructions

The matrix register model is defined in: src/arch/riscv/regs/mat.hh

The same raw storage is reinterpreted as:

mat.hh also contains the helper functions used by the memory micro-ops:

decoder.isa:

检查删除 ENABLE_QMAT

decode RD 命名不合法

Matrix instruction format support lives in: src/arch/riscv/isa/formats/matrix.isa

Includes:

Arithmetic instructions:

mzero clears the destination matrix register
mmaqa_b performs 16-element signed byte dot products per output element
mmada_h performs 8-element signed halfword dot products per output element
mmasa_w performs 4-element signed word dot products per output element
fmmacc_s performs 4-element fp32 dot products per output element
fmmacc_h converts fp16 bit patterns to fp32 and accumulates in fp32
fmmacc_b currently treats 8-bit lanes as integer values converted to float
before accumulation

matrix memory instructions were expanded into row-level micro-ops during decoding:

mld_w expands into 4 row-load micro-ops
mst_b, mst_h, mst_w each expand into 4 row-store micro-ops
- mst_w: 4 x 32-bit words per row
- mst_h: each 32-bit word split into low 16 bits then high 16 bits
- mst_b: each 32-bit word split into 4 little-endian bytes

Architectural visibility:
mld_w uses atomic final visibility semantics: