What this is
A selective reference
A selective collection of AI projects, tools, and resources organized for easier browsing.
AI Resources
A selective list of notable AI models, tools, datasets, and experiments organized for useful browsing.
LifeHubber AI Resources is selective, not exhaustive. It is an editorial starting point, not an endorsement. Availability, access, usage limits, and terms can vary, so please review original project materials before relying on a resource.
What this is
A selective collection of AI projects, tools, and resources organized for easier browsing.
How to use it
Each entry includes a short description, source label, and direct link to make scanning simpler.
Editorial approach
The aim is to keep the signal cleaner, with fewer items, clearer categories, and room for future updates over time.
Popular now
A few popular places to start, or scroll down for the full list.
google/gemma-4
A family of multimodal models from Google DeepMind that handle text and image input and generate text output.
MiniMaxAI/MiniMax-M2.7
A large MiniMax model focused on agentic work, software engineering, tool use, and complex productivity workflows.
Donchitos/Claude-Code-Game-Studios
A multi-agent game-development studio system for Claude Code, organized around specialized agents, workflow skills, hooks, rules, and templates.
paperclipai/paperclip
A Node.js server and React UI for orchestrating teams of AI agents, assigning goals, and tracking work and costs from one dashboard.
ace-step/ACE-Step-1.5
A local music generation model aimed at fast song creation on consumer hardware, with support across CUDA, AMD, Intel, Mac, and CPU setups.
Recently added
Fresh AI resources added for browsing.
VectifyAI/PageIndex
A vectorless, reasoning-based RAG framework for long-document retrieval, tree-structured indexing, traceable document search, and agent context workflows.
GVCLab/PersonaLive
A portrait image-animation framework for live-streaming-style video generation research, with offline and online inference, pretrained weights, a Web UI, and acceleration notes.
bytedance/deer-flow
A ByteDance long-horizon agent harness for deep research, coding, file work, report generation, skills, sub-agents, memory, sandboxed execution, and message gateways.
deepseek-ai/DeepSeek-OCR-2
A newer DeepSeek OCR model release for image/PDF OCR, document-to-Markdown workflows, dynamic resolution, vLLM/Transformers inference, and visual causal flow research.
1weiho/open-slide
An agent-native slide framework for building React-based decks with coding agents, browser preview, comments, assets, present mode, and HTML/PDF export.
Browse the list
Showing 102 of 102 resources
Showing all resources
AI Models
louis-e/arnis
Generates real-world locations inside Minecraft with a surprisingly high level of detail.
deepseek-ai/DeepSeek-OCR-2
A newer DeepSeek OCR model release for image/PDF OCR, document-to-Markdown workflows, dynamic resolution, vLLM/Transformers inference, and visual causal flow research.
deepseek-ai/deepseek-v4
A DeepSeek model family release positioned around long-context intelligence, reasoning modes, coding benchmarks, and agentic task evaluation.
google/gemma-4
A family of multimodal models from Google DeepMind that handle text and image input and generate text output.
zai-org/GLM-5.1
A flagship text-generation model positioned around agentic engineering, stronger coding performance, and longer-horizon tool use.
zai-org/GLM-OCR
A multimodal OCR model for complex document understanding, positioned around strong real-world document parsing and efficient deployment.
AngelSlim/Hy-MT1.5-1.8B-1.25bit
A low-bit on-device translation model from AngelSlim, positioned around 33-language offline translation, GGUF access, Android demo use, and 1.25-bit compression.
tencent/Hy3-preview
A Tencent Hy Team MoE model positioned around long-context reasoning, instruction following, coding, and agent task evaluation.
moonshotai/Kimi-K2.6
A multimodal agentic model positioned around long-horizon coding, tool use, autonomous execution, and broader software workflows.
LiquidAI/LFM2.5-350M
A hybrid model in the LFM2.5 family built for on-device deployment, with extended pre-training and reinforcement learning.
inclusionAI/Ling-2.6-flash
An inclusionAI instruct model positioned around faster responses, token efficiency, tool use, multi-step planning, and agent-oriented workloads.
robbyant/lingbot-map
A feed-forward 3D foundation model for streaming scene reconstruction, positioned around geometric consistency, long sequences, and efficient real-time inference.
nv-tlabs/lyra
A series of generative 3D world models from NVIDIA, positioned around explorable scenes, 3D consistency, and world-scale generation workflows.
XiaomiMiMo/mimo-v25
A Xiaomi MiMo model family positioned around multimodal understanding, agentic workflows, long-context use, and Pro variants for harder software and tool-heavy tasks.
MiniMaxAI/MiniMax-M2.7
A large MiniMax model focused on agentic work, software engineering, tool use, and complex productivity workflows.
OpenMOSS-Team/moss-vl
An OpenMOSS vision-language family with Base and Instruct releases for image, video, OCR, and document understanding work.
Qwen/Qwen3.6-35B-A3B
An open-weight multimodal model positioned around agentic coding, tool use, long-context work, and real-world software workflows.
google-deepmind/tips
A family of vision-language encoders from Google DeepMind, positioned around image-text pretraining, spatial awareness, and general-purpose multimodal applications.
microsoft/TRELLIS.2
A Microsoft 3D generation model for high-fidelity image-to-3D asset creation, using O-Voxel structured latents, PBR materials, inference code, and training tools.
arcee-ai/trinity-large-thinking
A model designed for coherent multi-turn behavior, clean tool use, constrained instruction following, and efficient serving at scale.
allenai/WildDet3D
A promptable 3D detection system for real-world scenes, positioned around text, point, and box prompts for spatial perception workflows.
Speech Models
CohereLabs/cohere-transcribe-03-2026
A 2B parameter automatic speech recognition model for audio-in, text-out transcription across 14 languages.
fishaudio/s2-pro
A text-to-speech model with detailed control over prosody and emotional delivery.
KittenML/KittenTTS
A very small text-to-speech model designed to stay lightweight without feeling toy-like.
XiaomiMiMo/MiMo-V2.5-ASR
A Xiaomi MiMo speech-recognition model focused on Mandarin, English, Chinese dialects, code-switched speech, noisy audio, songs, and multi-speaker transcription.
OpenMOSS/MOSS-Audio
An audio-understanding model family for speech, sound, music, captioning, time-aware QA, ASR, and reasoning over real-world audio.
OpenMOSS/MOSS-TTS-Nano
A tiny multilingual speech generation model positioned for real-time TTS, CPU-friendly local use, and lightweight deployment.
NVIDIA/personaplex
A real-time full-duplex speech-to-speech conversational model with persona control through role prompts and voice conditioning.
sbintuitions/sarashina2.2-tts
A Japanese-centric text-to-speech system from SB Intuitions, with Japanese and English generation, style transfer, and zero-shot voice generation support.
HumeAI/tada
A speech-language model that aligns speech and text into a single synchronized stream.
openbmb/VoxCPM2
A multilingual text-to-speech model with voice design, controllable voice cloning, and streaming support.
Music / Image Gen Models
ace-step/ACE-Step-1.5
A local music generation model aimed at fast song creation on consumer hardware, with support across CUDA, AMD, Intel, Mac, and CPU setups.
VAST-AI-Research/AniGen
A framework for generating animatable 3D assets from a single image, with mesh, skeleton, and skinning outputs for downstream animation and simulation workflows.
IGL-HKUST/CoMoVi
A framework for co-generating 3D human motion and realistic videos, with a focus on motion-conditioned video generation and training workflows.
lllyasviel/Fooocus
A local image-generation interface built around prompt-focused SDXL workflows, with Windows downloads, Colab access, inpainting, outpainting, image prompts, and presets.
GVCLab/PersonaLive
A portrait image-animation framework for live-streaming-style video generation research, with offline and online inference, pretrained weights, a Web UI, and acceleration notes.
AI Agents
Panniantong/Agent-Reach
A CLI that gives AI agents broader web reach across platforms like Twitter, Reddit, YouTube, GitHub, Bilibili, and XiaoHongShu without paid API usage.
agentscope-ai/agentscope
A production-ready agent framework with core abstractions, visibility tooling, and built-in support for fine-tuning workflows.
aipoch/medical-research-skills
A curated library of medical research agent skills designed to support evidence review, protocol design, data analysis, and academic writing workflows.
hilash/cabinet
An AI-first knowledge base and workspace system with agents, memory, scheduled jobs, and local file-based storage.
HKUDS/CatchMe
A lightweight, vectorless system for capturing a broader digital footprint as usable context.
Donchitos/Claude-Code-Game-Studios
A multi-agent game-development studio system for Claude Code, organized around specialized agents, workflow skills, hooks, rules, and templates.
trycua/cua
Infrastructure for computer-use agents, with sandboxes, SDKs, benchmarks, and model integrations for agents working across desktop environments.
HKUDS/DeepTutor
An agent-native personalized tutoring system with tutoring workflows, persistent memory, a web app, CLI access, and a broader learning-support architecture.
bytedance/deer-flow
A ByteDance long-horizon agent harness for deep research, coding, file work, report generation, skills, sub-agents, memory, sandboxed execution, and message gateways.
nico-martin/gemma4-browser-extension
An independent Chrome extension experiment for running an on-device browser agent with Transformers.js, WebGPU, Gemma 4, page RAG, tab tools, and semantic history search.
open-gitagent/gitagent
A framework-agnostic, git-native standard for defining and sharing AI agents.
block/goose
An on-machine AI agent for complex development work, including coding, execution, debugging, workflow orchestration, and API interaction.
vectorize-io/hindsight
An agent memory system designed to help agents learn over time rather than only recall conversation history.
Intelligent-Internet/ii-agent
An AI agent for practical work, built to be run, forked, and extended across solo, team, and internal-tooling use cases.
MiniMax-AI/skills
A development skills library for AI coding agents, with structured guidance across frontend, fullstack, Android, iOS, and shader work.
allenai/molmoweb
A multimodal web agent from Ai2 that can navigate browser tasks from natural-language instructions.
HKUDS/nanobot
An ultra-lightweight personal AI agent project focused on core agent workflows, compact implementation, and readable extension points.
qwibitai/nanoclaw
A lightweight personal agent system that runs agents in isolated containers and connects them to messaging channels, memory, and scheduled jobs.
onyx-dot-app/onyx
An application layer for LLMs with a self-hostable interface and capabilities like RAG, web search, code execution, file creation, and deep research.
openagents-org/openagents
A collaboration project centered on AI agent networks designed to work together across shared workflows.
openai/openai-agents-python
A lightweight framework for multi-agent workflows, with tools, handoffs, guardrails, sessions, tracing, sandbox agents, and realtime voice support.
THU-MAIC/OpenMAIC
A multi-agent interactive classroom designed to offer an immersive learning experience with one-click setup.
rui-ye/OpenSeeker
A search agent system built around released training data, released models, and tool-based web information seeking.
HKUDS/OpenSpace
A framework focused on building agents that are smarter, lower-cost, and able to improve through self-evolving workflows.
alibaba/page-agent
A JavaScript in-page GUI agent for controlling web interfaces with natural language, aimed at browser-based workflows and interface automation.
VectifyAI/PageIndex
A vectorless, reasoning-based RAG framework for long-document retrieval, tree-structured indexing, traceable document search, and agent context workflows.
paperclipai/paperclip
A Node.js server and React UI for orchestrating teams of AI agents, assigning goals, and tracking work and costs from one dashboard.
infiniflow/ragflow
A practical RAG and agent-context platform for document ingestion, chunking, retrieval, citations, knowledge workflows, and self-hosted AI applications.
agentscope-ai/ReMe
A memory management framework for AI agents, with file-based and vector-based systems for long-term memory and cross-session recall.
openai/symphony
An OpenAI engineering preview and specification for orchestrating coding agents from project work queues into isolated autonomous implementation runs.
tinyfish-io/skills
A public skills repo for TinyFish agent workflows, including web-agent automation and related utility skills.
Embodied / Physical AI
dimensionalOS/dimos
An operating system layer for controlling robots and other hardware platforms with natural-language workflows.
norma-core/hardware/elrobot
A low-cost 3D-printed robotic arm intended for physical AI research and imitation learning.
freemocap/freemocap
A research-grade motion capture system designed to stay low-cost, hardware-agnostic, and accessible for scientific, educational, and training use.
wu-yc/LabClaw
A large package of workflow skills for biomedical and scientific AI work across multiple lab-heavy domains.
unitreerobotics/unifolm-wbt-dataset
A real-world humanoid robot whole-body teleoperation dataset for open environments.
Productivity
1weiho/open-slide
An agent-native slide framework for building React-based decks with coding agents, browser preview, comments, assets, present mode, and HTML/PDF export.
yazinsai/OpenOats
A meeting note-taking assistant designed to be more conversational and responsive than passive transcription.
warpdotdev/warp
An agentic development environment born out of the terminal, with built-in coding-agent workflows and support for bringing external CLI agents into developer work.
Ecosystem
TencentCloud/CubeSandbox
Sandbox infrastructure for AI agents, positioned around fast startup, isolation, high concurrency, and self-hosted code-execution workflows.
NVIDIA-NeMo/DataDesigner
A synthetic data generation framework for creating structured datasets from scratch or seed data, with dependency-aware generation, validation, and quality scoring.
google-labs-code/design.md
A format specification and CLI toolkit for describing a design system to coding agents, positioned around persistent visual guidance, linting, and token-level design workflows.
googleworkspace/cli
A single command-line interface for Drive, Gmail, Calendar, Docs, Sheets, Chat, Admin, and related workflows.
heygen-com/hyperframes
A video rendering framework for HTML-based compositions, positioned around agent-friendly workflows, previewing, and MP4 rendering.
Vaibhavs10/insanely-fast-whisper
An opinionated CLI for very fast on-device transcription with Whisper.
K-Dense-AI/k-dense-byok
A desktop co-scientist setup built around scientific skills and bring-your-own-key workflows.
yichuan-w/LEANN
A lightweight vector database for personal RAG and semantic search, designed to run locally with much lower storage overhead.
lightpanda-io/browser
A headless browser designed with AI automation use cases in mind.
run-llama/liteparse
A local PDF parsing tool focused on fast, lightweight parsing, bounding boxes, OCR flexibility, and screenshots for agent workflows.
hiyouga/LlamaFactory
A unified fine-tuning and deployment platform for 100+ LLMs and VLMs, with a zero-code CLI, web UI, and support for many training approaches.
mnfst/manifest
A smart model router for personal AI agents, positioned around cost-aware request routing, fallbacks, provider control, and self-hosted agent workflows.
MiniMax-AI/cli
The official MiniMax CLI for terminal and agent workflows, with commands for text, image, video, speech, music, vision, and search.
openai/plugins
A curated collection of Codex plugin examples, manifests, and supporting files for extending Codex-based workflows.
openai/privacy-filter
A privacy-filtering model and local toolkit for detecting and masking personally identifiable information in text, positioned around high-throughput sanitization workflows.
PaddlePaddle/PaddleOCR
A document AI toolkit for turning PDFs and images into structured, LLM-ready data, positioned around multilingual OCR, document parsing, and agent-ready extraction workflows.
yusufkaraaslan/Skill_Seekers
A preprocessing layer for turning raw documentation into reusable inputs for skills, RAG pipelines, and AI coding tools.
github/spec-kit
A toolkit for spec-driven development, positioned around structured workflows, predictable implementation paths, and AI coding agent integrations.
vllm-project/vllm-omni
A framework for serving and running omni-modality models more efficiently.
jamiepine/voicebox
A local-first voice synthesis studio for voice cloning, speech generation, effects, and voice-powered app workflows.
Datasets
evolvent-ai/ClawMark
A living-world benchmark for multi-day, multimodal coworker agents, spanning 100 tasks across professional domains and real tool environments.
meituan-longcat/General365
A manually curated benchmark for general reasoning in LLMs, designed around high difficulty, broad task diversity, K-12-scope knowledge, and hybrid scoring.
meituan-longcat/LARYBench
A benchmark for evaluating latent action representations, with pipelines for action semantics, robotic control regression, and broader vision-to-action alignment.
openai/monitorability-evals
An OpenAI evaluation-data release for studying monitorability, with public eval splits, prompt templates, dataset mappings, and metric code from the Monitoring Monitorability paper.
allenai/olmOCR-bench
A benchmark for evaluating how well OCR systems convert PDFs into useful markdown while preserving structure.
run-llama/ParseBench
A document parsing benchmark for AI-agent workflows, focused on whether parsed PDFs preserve enough structure and meaning for reliable downstream use.
google/WaxalNLP
A large multilingual speech corpus for African languages introduced through the WAXAL paper.
Also in AI
Browse the other AI destinations for more tools to explore, a clearer view of reader interest, and practical guidance.