Build software better, together

VectorInstitute / vector-inference

Efficient LLM inference on Slurm clusters.

inference vlm text-embedding multimodal vllm reward-model llm-infernece sglang llm-infrastructure

Updated Jan 22, 2026
Python

nshkrdotcom / json_remedy

A practical, multi-layered JSON repair library for Elixir that intelligently fixes malformed JSON strings commonly produced by LLMs, legacy systems, and data pipelines.

json elixir otp functional-programming data-validation beam json-parser erlang-vm error-recovery data-quality json-validation json-repair json-fix llm-infrastructure malformed-json nshkr-ai-infra

Updated Dec 29, 2025
Elixir

iBacklight / PipelineLLM

Star

PipelineLLM 是一个系统性的大语言模型（LLM）后训练学习项目，涵盖从监督微调（SFT）到偏好优化（DPO）、强化学习（RLHF/PPO/GRPO）再到持续学习（Continual Learning)的完整技术栈。

reinforcement-learning lora fine-tuning post-training continual-learning sft rlhf llm-reasoning preference-optimization llm-infrastructure llm-processing

Updated Jan 16, 2026
Python

Testune-AI / express-template

Star

A lightweight Bun + Express template that connects to the Testune AI API and streams chat responses in real time using Server-Sent Events (SSE)

infrastructure ai llm llm-infra llm-infrastructure

Updated Aug 26, 2025
TypeScript

nshkrdotcom / mcp_client

Sponsor

Star

Full-featured Elixir client for the Model Context Protocol (MCP) with multi-transport support, resources, prompts, tools, and telemetry.

Updated Nov 8, 2025
Elixir

KG-Strategist / agentsocket

Star

A Compute-Agnostic, WebSocket-first protocol for AI Agents. The high-performance alternative to MCP. Runs on Serverless or stateful servers with sub-30ms latency.

agent distributed-systems serverless protocol websockets low-latency ai-agents event-driven-architecture bidirectional-streaming protocol-design agentic agent-protocol real-time-ai llm-infrastructure mcp-alternative

Updated Dec 10, 2025

ppppangu / dechaos

Star

【非结构化数据pipeline】目标是自动化原始数据—>特定信息提取。first_example:收集任何的文档将其可视化为思维导图（进度1/3）

document-processing llm-infrastructure

Updated Sep 4, 2025
Python

bitsandbrains / agentic-rag-n8n-ingestion-pipeline

Star

A production-ready, enterprise-grade Agentic RAG ingestion pipeline built with n8n, Supabase (pgvector), and AI embeddings. Implements event-driven orchestration, hybrid RAG for structured and unstructured data, vector similarity search, and multi-tenant architecture to deliver client-isolated, retrieval-ready knowledge bases.

Updated Jan 10, 2026
PLpgSQL

ivanGrzegorczyk / ai-infra-lab

Star

llm-serving ai-infrastructure llm-integration llm-infrastructure

Updated Jan 23, 2026
Go

PEACEBINFLOW / mindscript-ledger

Star

MindScript Ledger is the temporal memory and pattern-recall layer of the MindScript ecosystem. It stores normalized prompts, logic states, patterns, and threads as a structured ledger, enabling deterministic recall and reconstruction of long-running AI interactions.

prompt-engineering llm-tools ai-memory structured-prompts llm-infrastructure minds-eye mindscript-language mindscript-ledger temporal-ledger

Updated Dec 9, 2025

informatico-madrid / Sovereign-Blackwell-vLLM-Stack

Star

Enterprise-grade Sovereign AI Stack optimized for NVIDIA Blackwell (sm_120) & vLLM. Features 256K context window, 5.8k tok/s prefill, and integrated observability via Langfuse.

cuda blackwell vllm langfuse litellm rtx-5090 qwen3 sovereign-ai self-hosted-llm llm-infrastructure

Updated Jan 21, 2026
Python

Clement391 / agentsocket

Star

⚡ Streamline AI agent communication with Agent Socket, a high-performance WebSocket protocol designed for diverse computing environments.

agent distributed-systems serverless protocol low-latency ai-agents event-driven-architecture bidirectional-streaming protocol-design agentic agent-protocol real-time-ai llm-infrastructure mcp-alternative

Updated Jan 23, 2026

anshwysmcbel2710 / agentic-rag-n8n-ingestion-pipeline

Star

A production-ready, enterprise-grade Agentic RAG ingestion pipeline built with n8n, Supabase (pgvector), and AI embeddings. Implements event-driven orchestration, hybrid RAG for structured and unstructured data, vector similarity search, and multi-tenant architecture to deliver client-isolated, retrieval-ready knowledge bases.

Updated Jan 10, 2026
PLpgSQL

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

llm-infrastructure

Here are 13 public repositories matching this topic...

VectorInstitute / vector-inference

nshkrdotcom / json_remedy

iBacklight / PipelineLLM

Testune-AI / express-template

nshkrdotcom / mcp_client

KG-Strategist / agentsocket

ppppangu / dechaos

bitsandbrains / agentic-rag-n8n-ingestion-pipeline

ivanGrzegorczyk / ai-infra-lab

PEACEBINFLOW / mindscript-ledger

informatico-madrid / Sovereign-Blackwell-vLLM-Stack

Clement391 / agentsocket

anshwysmcbel2710 / agentic-rag-n8n-ingestion-pipeline

Improve this page

Add this topic to your repo