FBGEMM and FBGEMM_GPU Documentation Homepage¶
Welcome to the documentation page for the FBGEMM and FBGEMM_GPU libraries!
FBGEMM (Facebook GEneral Matrix Multiplication) is a low-precision, high-performance matrix-matrix multiplications and convolution library for server-side inference. This library is used as a backend of PyTorch quantized operators on x86 machines.
FBGEMM_GPU (FBGEMM GPU Kernels Library) is a collection of high-performance PyTorch GPU operator libraries for training and inference. This library is built on top of FBGEMM and provides efficient table batched embedding bag, data layout transformation, and quantization support.
Table of Contents