Skip to content
Change the repository type filter

All

    Repositories list

    • Tests/Scripts for use with foundation-model-stack models running on AIU
      Python
      314227Updated Dec 23, 2025Dec 23, 2025
    • Tiny adapter that runs on top of vllm and provides detector APIs
      Python
      7473Updated Dec 23, 2025Dec 23, 2025
    • FMS Model Optimizer is a framework for developing reduced precision neural network models.
      Python
      1720167Updated Dec 19, 2025Dec 19, 2025
    • 🚀 Collection of tuning recipes with HuggingFace SFTTrainer and PyTorch FSDP.
      Python
      65542419Updated Dec 18, 2025Dec 18, 2025
    • A Triton-only attention backend for vLLM
      Python
      72302Updated Dec 18, 2025Dec 18, 2025
    • 🚀 Collection of libraries used with fms-hf-tuning to accelerate fine-tuning and training of large models.
      Python
      1913272Updated Dec 18, 2025Dec 18, 2025
    • High-performance safetensors model loader
      Python
      168710Updated Dec 17, 2025Dec 17, 2025
    • Go
      84080Updated Dec 16, 2025Dec 16, 2025
    • Go
      0000Updated Dec 15, 2025Dec 15, 2025
    • ⚡️⚡️ Supercharge your fine-tuning users with completely automated tuning configurations!
      Python
      2430Updated Dec 14, 2025Dec 14, 2025
    • 🚀 Collection of components for development, training, tuning, and inference of foundation models leveraging PyTorch native components.
      Python
      992164760Updated Dec 11, 2025Dec 11, 2025
    • Estimate resources needed to train LLMs
      Python
      91330Updated Dec 11, 2025Dec 11, 2025
    • 🚀 Guardrails orchestration server for application of various detections on text generation input and output.
      Rust
      3727292Updated Dec 9, 2025Dec 9, 2025
    • fms-fsdp

      Public
      🚀 Efficiently (pre)training foundation models with native PyTorch features, including FSDP for training and SDPA implementation of Flash attention v2.
      Python
      462752310Updated Nov 24, 2025Nov 24, 2025
    • fms-dgt

      Public archive
      Synthetic Data Generation for Foundation Models
      Python
      242121Updated Nov 10, 2025Nov 10, 2025
    • Operator that enables EFA and/or GDRCOPY in an OpenShift cluster
      Go
      0100Updated Jul 23, 2025Jul 23, 2025
    • bamba

      Public
      Train, tune, and infer Bamba model
      Python
      1413741Updated Jun 4, 2025Jun 4, 2025
    • 🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
      Python
      32k000Updated Feb 7, 2025Feb 7, 2025
    • Scan resources consumed during LLM training
      Python
      4810Updated Jan 14, 2025Jan 14, 2025
    • blog

      Public
      Public repo for HF blog posts
      Jupyter Notebook
      958001Updated Dec 18, 2024Dec 18, 2024
    • Demonstration of MoE distributed training using various techniques
      Python
      01010Updated Oct 31, 2024Oct 31, 2024
    • Dockerfile
      4310Updated Oct 28, 2024Oct 28, 2024
    • pod-vllm

      Public
      Source code to launch a number of pods, performing synthetic data generation
      Python
      1101Updated Oct 22, 2024Oct 22, 2024
    • Go
      2191Updated Oct 10, 2024Oct 10, 2024
    • Python
      112435Updated Sep 9, 2024Sep 9, 2024
    • avengers

      Public
      Shell
      0040Updated Jul 20, 2024Jul 20, 2024
    • trl

      Public
      Train transformer language models with reinforcement learning.
      Python
      2.4k002Updated Mar 5, 2024Mar 5, 2024
    • Training job management tool for foundation model service
      Python
      4560Updated Feb 28, 2024Feb 28, 2024
    • Training operators on Kubernetes.
      Python
      855000Updated Nov 16, 2022Nov 16, 2022