AI Resources

Cohere North Mini Code

Cohere North Mini Code is a Cohere Labs coding model release aimed at code generation, agentic software engineering, and terminal-based tasks.

Cohere and Hugging Face describe North Mini Code as a 30B-total, 3B-active mixture-of-experts model with Apache 2.0 Hugging Face weights, BF16 and FP8 variants, and try paths through OpenCode, Cohere API, and related Cohere surfaces. Use this as a first read, not a recommendation. Open the original project before trusting details like terms, limits, privacy, cost, setup, or safety.

Open Hugging Face Back to AI Resources

What it is

A coding-focused MoE model

North Mini Code is presented as a 30B-total, 3B-active sparse mixture-of-experts model for coding work, software-engineering tasks, and agent-style terminal workflows.

Why it stands out

Built around coding-agent harnesses

The release materials frame the model around code generation, agentic software engineering, terminal tasks, OpenCode, SWE-Bench-style harnesses, and Terminal-Bench-style evaluation rather than only chat completions.

Availability

Weights and hosted try paths

The official materials list BF16 and FP8 Hugging Face model pages, Apache 2.0 licensing, OpenCode access before downloading, and Cohere API or hosted Cohere deployment paths to inspect separately.

What makes it useful

Cohere North Mini Code connects a coding-focused MoE model to agentic software engineering, terminal tasks, OpenCode access, BF16 and FP8 variants, and benchmark harnesses. That gives readers a model-plus-workflow source trail to inspect rather than a plain chat-model listing.

What to know

Where it fits

Open it beside coding-agent tools, model-serving runtimes, and other agentic coding models when comparing what can run locally, what needs hosted access, and how much of the workflow comes from the model versus the surrounding harness.

Notable points

What stands out

The Hugging Face and Cohere materials list a 256K total context length, 64K max generation, Apache 2.0 licensing, BF16 and FP8 variants, SGLang and vLLM-oriented setup notes, OpenCode access, and Cohere-reported benchmark methodology across SWE-Bench, Terminal-Bench, SciCode, and LiveCodeBench-style checks.

Before using

What to review

The Hugging Face model cards, Cohere blog post, license text, acceptable-use terms, setup notes, and current access limits before relying on the model in a project.

Which path is actually being used: BF16 weights, FP8 weights, local runtime, OpenCode, Cohere API, Model Vault, or another provider route.

Hardware and runtime requirements, especially GPU memory, SGLang or vLLM version notes, tool-call parsing, and whether the FP8 checkpoint fits the planned serving stack.

Coding-agent permissions before connecting the model to terminals, repositories, package managers, browsers, credentials, private code, or production systems.

Generated-code review, tests, dependency checks, security review, and privacy settings before using model output in real software work.

Cohere-reported benchmark and provider claims as source claims to inspect, not as a LifeHubber performance judgment.

Reader fit

Who may find it relevant

Readers comparing public coding models that can sit underneath software agents and terminal-based coding workflows.

Builders looking at OpenCode-style harnesses, local or hosted inference paths, and model choices for agentic software engineering experiments.

Less relevant for readers who only want a general consumer chatbot, a no-setup coding assistant, or a small model for everyday laptop use.

Editorial note

Why LifeHubber lists it

North Mini Code is included as a source-visible coding-model release for readers tracking how coding assistants and software agents are moving from chat-style help toward model-plus-harness workflows that operate in terminals and repositories.

Source links

Source materials

Hugging Face article

BF16 Hugging Face model page

FP8 Hugging Face model page

Cohere blog post

Reader note

Before relying on this entry

LifeHubber lists entries to help readers inspect AI projects, not to endorse them or prove they are safe, suitable, accurate, maintained, or right for a specific use. We do not verify every entry in depth. Before relying on anything listed, review the original materials, terms, privacy practices, limits, and risks that matter for your situation.

Keep browsing this category

A few more places to continue in ai models.

AI Models Hugging Face

Gemma 4

google/gemma-4

A Google DeepMind Gemma 4 model family collection with public checkpoints including Gemma 4 12B, a dense multimodal model Google describes around local agentic workflows, native audio input, and encoder-free vision/audio handling.

Multimodal models, local agents 4 readers found this useful

Read overview View Hugging Face

AI Models GitHub

DeepSeek-OCR-2

deepseek-ai/DeepSeek-OCR-2

A newer DeepSeek OCR model release for image/PDF OCR, document-to-Markdown workflows, dynamic resolution, vLLM/Transformers inference, and visual causal flow research.

OCR, document understanding 3 readers found this useful

Read overview View GitHub

AI Models Hugging Face

MiniMax-M2.7

MiniMaxAI/MiniMax-M2.7

A large MiniMax model focused on agentic work, software engineering, tool use, and complex productivity workflows.

Agentic models 3 readers found this useful