Skip to content

Backend.AI GO

Quickstart

English
한국어

Initializing search

Home
Getting Started
Core Features
Cowork
Acceleration
API Server
Cloud Integration
Multi-Node
Advanced
Use Cases
Reference

Backend.AI GO

Home
Getting Started
Getting Started
- What is Backend.AI GO?
- Quickstart Quickstart
  Table of contents
- Installation
- First-Time Setup
- Landing Page
Core Features
Core Features
Cowork
Cowork
Acceleration
Acceleration
API Server
API Server
- Continuum Router
- External Access
Cloud Integration
Cloud Integration
Multi-Node
Multi-Node
Advanced
Advanced
Use Cases
Use Cases
Reference
Reference

Table of contents

Step 1: Install Backend.AI GO
Step 2: Complete First-Time Setup
Step 3: Download Your First Model
Step 4: Load the Model
Step 5: Start Chatting!
Next Steps

1.2. Quickstart Guide¶

Get Backend.AI GO up and running and start your first local AI chat in 5 minutes.

Step 1: Install Backend.AI GO¶

Download the installer for your platform from the official website or the GitHub Releases page.

macOS: Open the .dmg file and drag Backend.AI GO to your Applications folder.
Windows: Run the .exe or .msi installer and follow the prompts.
Linux: Install the .deb package or use the .flatpak bundle.

For more details, see the Installation Guide.

Step 2: Complete First-Time Setup¶

When you first launch Backend.AI GO, you'll be guided through a setup wizard:

Select Language: Choose English or Korean for the interface.
Choose Engine: Select your preferred inference engine (or let the app choose automatically).
Set Models Directory: Confirm or change where models will be stored.
Get Started: Review your settings and enter the app.

For detailed information about each step, see First-Time Setup.

Step 3: Download Your First Model¶

After completing the initial setup, the app will start. You need to download a model to begin chatting.

Click on the Search (Hugging Face) icon in the sidebar.
Type in a popular model name like Gemma3-4B, Qwen3-4B, or gpt-oss-20B.
Look for models tagged as GGUF (most common) or MLX (if you are on macOS).
Click the Download button next to a model variant (Q4KM is usually a good balance of speed and quality).
Wait for the download to complete in the Downloads tab.

Step 4: Load the Model¶

Once downloaded, your model will appear in the Local Models library.

Go to the Models tab.
Find your downloaded model and click the Load button.
The status bar at the bottom will show the progress. Once it says "Ready," the model is active in your system's memory.

Step 5: Start Chatting!¶

Click on the Chat icon in the sidebar.
Type a message in the text box at the bottom (e.g., "Hello! Can you explain quantum physics in simple terms?").
Press Enter and watch your local AI respond!

Next Steps¶

Now that you've completed your first chat, explore more advanced features:

Using Cowork to perform complex tasks.
Connecting Cloud Providers to combine local and cloud AI.
Benchmarking to see how fast your machine really is.

What is Backend.AI GO?

Made with Material for MkDocs