Installation

This guide covers how to install Tiny-LLM on your system.

System Requirements

Hardware Requirements

GPU: NVIDIA GPU with Compute Capability 7.0+ (Volta or newer)
Memory: Minimum 8GB GPU memory for inference
Storage: 2GB for build artifacts

Software Requirements

Operating System: Linux (Ubuntu 20.04+ recommended) or Windows 10+
CUDA Toolkit: 11.0 or higher
CMake: 3.18 or higher
C++ Compiler: C++17 compatible (GCC 9+, Clang 10+, MSVC 2019+)

Installation Steps

1. Install CUDA Toolkit

Ensure CUDA Toolkit 11.0+ is installed and nvcc is in your PATH:

bash

nvcc --version

2. Clone the Repository

bash

git clone https://github.com/AICL-Lab/tiny-llm.git
cd tiny-llm

3. Configure the Build

bash

cmake -S . -B build -DCMAKE_BUILD_TYPE=Release

4. Build

bash

cmake --build build -j$(nproc)

5. Run Tests (Optional)

bash

ctest --test-dir build --output-on-failure

Build Options

Option	Default	Description
`CMAKE_BUILD_TYPE`	`Release`	Build type (Release/Debug)
`BUILD_TESTS`	`OFF`	Build test suite
`BUILD_EXAMPLES`	`OFF`	Build example programs
`CUDA_ARCH`	Auto	CUDA architecture (e.g., `80;86`)

Troubleshooting

See the Troubleshooting Guide for common issues.

Next Steps

Quick Start Tutorial - Build your first inference
Architecture Overview - Understand the internals

Installation ​

System Requirements ​

Hardware Requirements ​

Software Requirements ​

Installation Steps ​

1. Install CUDA Toolkit ​

2. Clone the Repository ​

3. Configure the Build ​

4. Build ​

5. Run Tests (Optional) ​

Build Options ​

Troubleshooting ​

Next Steps ​

Installation

System Requirements

Hardware Requirements

Software Requirements

Installation Steps

1. Install CUDA Toolkit

2. Clone the Repository

3. Configure the Build

4. Build

5. Run Tests (Optional)

Build Options

Troubleshooting

Next Steps