2 \documentclass[slidestop
]{beamer
}
3 \usepackage{beamerthemesplit
}
8 \title{Simple-V RISC-V Extension for Vectorisation and SIMD
}
9 \author{Luke Kenneth Casson Leighton
}
16 \huge{Simple-V RISC-V Extension for Vectors and SIMD
}\\
18 \Large{Flexible Vectorisation
}\\
19 \Large{Chennai
9th Workshop
}\\
25 \frame{\frametitle{Why another Vector Extension?
}
28 \item RVV very heavy-duty (excellent for supercomputing)
\vspace{10pt
}
29 \item Simple-V abstracts parallelism (based on best of RVV)
\vspace{10pt
}
30 \item Graded levels: hardware or software-emulation
\vspace{10pt
}
31 \item Even Compressed instructions become vectorised
\vspace{10pt
}
33 What Simple-V is not:
\vspace{12pt
}
35 \item A full supercomputer-level Vector Proposal
\vspace{12pt
}
36 \item A replacement for RVV (designed to be augmented)
\vspace{12pt
}
40 \frame{\frametitle{Quick refresher on SIMD
}
43 \item SIMD very easy to implement
\vspace{10pt
}
44 \item Parallelism is in the ALU
\vspace{10pt
}
45 \item Negligeable impact for rest of core
\vspace{10pt
}
47 Where SIMD Goes Wrong:
\vspace{12pt
}
49 \item See "Why SIMD considered harmful"
\vspace{12pt
}
50 \item (Corner-cases alone are extremely complex)
\vspace{12pt
}
51 \item O($N^
{6}$) ISA proliferation
\vspace{12pt
}
55 \frame{\frametitle{Quick refresher on RVV
}
58 \item Extremely powerful (extensible to
256 registers)
\vspace{10pt
}
59 \item Supports polymorphism, several datatypes (inc. FP16)
\vspace{10pt
}
60 \item Requires a separate Register File
\vspace{10pt
}
61 \item Can be implemented as a separate pipeline
\vspace{10pt
}
63 However...
\vspace{12pt
}
65 \item 98 percent opcode duplication with rest of RV (CLIP)
\vspace{12pt
}
66 \item Extending RVV requires customisation
\vspace{12pt
}
71 \frame{\frametitle{How is Parallelism abstracted?
}
74 \item Almost all opcodes removed in favour of implicit "typing"
\vspace{10pt
}
75 \item Primarily at the Instruction issue phase (except SIMD)
\vspace{10pt
}
76 \item Standard (and future, and custom) opcodes now parallel
\vspace{10pt
}
78 What Simple-V is not:
\vspace{12pt
}
80 \item A full supercomputer-level Vector Proposal
\vspace{12pt
}
81 \item A replacement for RVV (designed to be augmented)
\vspace{12pt
}
86 \frame{\frametitle{How are SIMD Instructions Vectorised?
}
89 \item SIMD ALU(s) primarily unchanged
\vspace{10pt
}
90 \item Predication is added to each SIMD element
\vspace{10pt
}
91 \item End of Vector implicitly enables predication
\vspace{10pt
}
93 Considerations:
\vspace{12pt
}
95 \item Many SIMD ALUs possible (parallel execution)
\vspace{12pt
}
96 \item Very long SIMD ALUs could waste die area (short vectors)
\vspace{12pt
}
97 \item Implementor free to choose (API remains the same)
\vspace{12pt
}
101 \frame{\frametitle{Including a plot
}
103 % \includegraphics[height=2in]{dental.ps}\\
104 {\bf \red Dental trajectories for
27 children:
}
108 \frame{\frametitle{Creating .pdf slides in WinEdt
}
111 \item LaTeX
[Shift-Control-L
]\vspace{12pt
}
112 \item dvi2pdf
[click the button
]\vspace{24pt
}
114 To print
4 slides per page in acrobat click
\vspace{12pt
}
116 \item File/print/properties
\vspace{12pt
}
117 \item Change ``pages per sheet'' to
4\vspace{12pt
}