Stars
5
stars
written in C++
Clear filter
The C based gRPC (C++, Python, Ruby, Objective-C, PHP, C#)
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…
Transformer related optimization, including BERT, GPT
Platform for building access networks and modular network services