Ollama

Run models locally using Ollama for privacy, offline access, and control. Requires initial setup and sufficient hardware. Website: https://ollama.com/

Setup

Install Ollama: Download from ollama.com and install
Start Ollama: Run ollama serve in terminal
Download a model:
```
ollama pull qwen2.5-coder:32b
```

Configure context window:

ollama run qwen2.5-coder:32b
/set parameter num_ctx 32768
/save your_custom_model_name

Configuration in CodinIT

Click the settings icon (⚙️) in CodinIT
Select “ollama” as the API Provider
Enter your saved model name
(Optional) Set base URL if not using default http://localhost:11434

Recommended Models

qwen2.5-coder:32b - Excellent for coding
codellama:34b-code - High quality, large size
deepseek-coder:6.7b-base - Effective for coding
llama3:8b-instruct-q5_1 - General tasks

See Ollama model library for full list.

Notes

Context window: Minimum 12,000 tokens recommended, 32,000 ideal
Resource demands: Large models require significant system resources
Offline capability: Works without internet after model download
Performance: May be slow on average hardware

Introduction

Getting Started

Features

Providers

Comparisons

Prompting

Model Configuration

Hosting

Running Models Locally

Setup

Configuration in CodinIT

Recommended Models

Notes

Introduction

Getting Started

Features

Providers

Comparisons

Prompting

Model Configuration

Hosting

Running Models Locally

​Setup

​Configuration in CodinIT

​Recommended Models

​Notes

Setup

Configuration in CodinIT

Recommended Models

Notes