FBGEMM and FBGEMM_GPU Documentation Homepage¶

Welcome to the documentation page for the FBGEMM and FBGEMM_GPU libraries!

FBGEMM (Facebook GEneral Matrix Multiplication) is a low-precision, high-performance matrix-matrix multiplications and convolution library for server-side inference. This library is used as a backend of PyTorch quantized operators on x86 machines.

FBGEMM_GPU (FBGEMM GPU Kernels Library) is a collection of high-performance PyTorch GPU operator libraries for training and inference. This library is built on top of FBGEMM and provides efficient table batched embedding bag, data layout transformation, and quantization support.

Table of Contents

General Info

FBGEMM Development

Build Instructions

FBGEMM_GPU Development

FBGEMM_GPU Overview

Jagged Tensor Operators

FBGEMM C++ API

Quantization Utilities

FBGEMM_GPU C++ API

FBGEMM_GPU Python API

FBGEMM and FBGEMM_GPU Documentation Homepage¶

Docs

Tutorials

Resources