Skip to content

AI-Hypercomputer/accelerator-agents

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 

Accelerator Agents (Preview)

🚧 Work in Progress / Coming Soon 🚧

This project is currently under active development. The agents and features described below represent the product roadmap and are not yet fully available for use. Stay tuned for our initial release!

Accelerator Agents is a collection of AI-powered tools designed to accelerate machine learning development on Google Cloud TPUs. This repository will host agents that assist with code migration, kernel optimization, and performance tuning, enabling developers to leverage the full power of TPUs with greater velocity.

Disclaimer: This is not an officially supported Google product. This project is not eligible for the Google Open Source Software Vulnerability Rewards Program.

Overview

As machine learning models grow in complexity, optimizing them for specific hardware accelerators like TPUs becomes increasingly challenging. This project aims to provide a suite of "Agents"—specialized AI tools powered by Gemini—to automate and assist with these complex tasks.

Our upcoming releases will focus on two primary agents:

1. PyTorch to JAX Migration Agent (Coming Soon)

The PyTorch to JAX Migration Agent will facilitate the conversion of existing PyTorch models and codebases into JAX. It is designed to help users migrate their workloads to run efficiently on TPUs, leveraging high-performance frameworks like MaxText.

Planned Features:

  • Automated Conversion: Converts functional code blocks and model layers from PyTorch to JAX.
  • MaxText Integration: Generates JAX code compatible with the MaxText framework for immediate training and inference on TPUs.
  • Human-in-the-Loop: Designed to draft initial implementations that developers can review and refine.

2. Kernel Agent (Coming Soon)

The Kernel Agent is a specialized tool planned for high-performance kernel development on TPUs. It will assist engineers in writing, optimizing, and debugging custom kernels, specifically focusing on Pallas (JAX's kernel language).

Planned Features:

  • Kernel Writing: Drafts Pallas kernels from scratch or based on JAX reference implementations.
  • CUDA to Pallas Conversion: Assists in porting custom CUDA/GPU kernels to run optimally on TPUs.
  • Optimization & Profiling: Provides profiling insights and optimization suggestions to improve kernel performance (MFU).
  • Test Harness Generation: Automatically generates boilerplate code for correctness testing and compilation checks.

Getting Started

Note: These instructions are for the future release of the agents.

Prerequisites

  • A Google Cloud TPU VM (recommended for running the Kernel Agent).
  • Python 3.10+
  • Access to Gemini API (for agent reasoning capabilities).

Installation

Once released, you will be able to clone the repository:

git clone https://github.com/AI-Hypercomputer/accelerator-agents.git
cd accelerator-agents

(Note: Specific installation instructions for each agent can be found in their respective subdirectories.)

Contributing

We welcome contributions! Please see CONTRIBUTING.md for details on how to submit pull requests, report issues, and contribute to the project.

License

This project is licensed under the Apache License, Version 2.0. See LICENSE for the full license text.

About

No description, website, or topics provided.

Resources

License

Code of conduct

Contributing

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published