Hey there! wave
I'm a software engineer and researcher focused on AI reliability, distributed systems, and functional programming. I build infrastructure for LLM research on the Elixir/BEAM platform.
microscope Crucible Framework - LLM Reliability Research
I'm the creator of the Crucible Framework , a platform for conducting reproducible experiments on large language model reliability, built on Elixir/OTP.
Key Goal: Building towards 99%+ LLM reliability through ensemble voting and request hedging, with comprehensive statistical testing and transparent causal reasoning chains.
All published under the @North-Shore-AI organization:
Library | Description |
---|---|
crucible_framework | Documentation hub & research framework |
crucible_bench | Statistical testing & analysis (15+ tests, effect sizes, power analysis) |
crucible_ensemble | Multi-model voting strategies for improved reliability |
crucible_hedging | Request hedging for latency reduction |
crucible_trace | Causal reasoning chain logging for LLM transparency |
crucible_datasets | Unified interface to benchmark datasets (MMLU, HumanEval, GSM8K) |
crucible_telemetry | Research-grade instrumentation & metrics collection |
crucible_harness | Automated experiment orchestration & reporting |
Library | Description |
---|---|
crucible_adversary | Adversarial testing & robustness evaluation framework |
crucible_xai | Explainable AI tools (LIME, SHAP, feature attribution) |
ExDataCheck | Data validation & quality library for ML pipelines |
ExFairness | Fairness & bias detection library for AI/ML systems |
LLMGuard | AI firewall & guardrails for LLM-based applications |
Tech Stack: Elixir, OTP, BEAM VM, Telemetry Research Areas: LLM reliability, ensemble methods, tail latency optimization, statistical testing Status: Active development, v0.1.0 released
rocket Elixir Projects
- synapse ⭐ 20 - Synapse: Elixir-powered AI agent orchestration, built on the battle-teste...
- ds_ex ⭐ 14 - DSPEx - Declarative Self-improving Elixir | A BEAM-Native AI Program O...
- DSPex ⭐ 8 - Declarative Self Improving Elixir - DSPy Orchestration in Elixir
- mabeam ⭐ 4 - Multi Agent BEAM
- AutoElixir ⭐ 3 - AI Multi Agent Swarms in Elixir
- ALTAR ⭐ 4 - The Agent & Tool Arbitration Protocol
- gemini_ex ⭐ 16 - Elixir Interface / Adapter for Google Gemini LLM, for both AI Studio a...
- claude_agent_sdk ⭐ 7 - Elixir SDK for Claude AI Agent API - Renamed from claude_code_sdk_elix...
- codex_sdk ⭐ 0 - OpenAI Codex SDK written in Elixir
- jules_ex ⭐ 0 - Elixir client SDK for the Jules API - orchestrate AI coding sessions
- pipeline_ex ⭐ 6 - Claude Code + Gemini AI collaboration orchestration tools
- weaviate_ex ⭐ 0 - Modern Elixir client for Weaviate vector database with health checks...
- json_remedy ⭐ 20 - A practical, multi-layered JSON repair library for Elixir that intelli...
- snakepit ⭐ 8 - High-performance, generalized process pooler and session manager for e...
- sinter ⭐ 8 - Unified schema definition, validation, and JSON generation for Elixir
- exdantic ⭐ 8 - A powerful, flexible schema definition and validation library for Elix...
- perimeter ⭐ 6 - Elixir Typing Mechanism
- ex_dbg ⭐ 9 - State-of-the-Art Introspection and Debugging System for Elixir/Phoenix...
- elixir_scope ⭐ 4 - Revolutionary AST-based debugging and code intelligence platform for E...
- ElixirScope ⭐ 3 - AI-Powered Execution Cinema Debugger for Elixir/BEAM
- superlearner ⭐ 6 - OTP Supervisor Educational Platform
- apex ⭐ 3 - Core Apex framework for OTP supervision and monitoring
- apex_ui ⭐ 3 - Web UI for Apex OTP supervision and monitoring tools
- arsenal ⭐ 3 - Metaprogramming framework for automatic REST API generation from OTP o...
- arsenal_plug ⭐ 2 - Phoenix/Plug adapter for Apex Arsenal framework
- supertester ⭐ 3 - A battle-hardened testing toolkit for building robust and resilient El...
- sandbox ⭐ 3 - Isolated OTP application management system for Elixir/Erlang
- cluster_test ⭐ 3 - Distributed Erlang/Elixir test cluster management via Mix tasks
- foundation ⭐ 10 - Elixir infrastructure and Observability Library
- AITrace ⭐ 0 - The unified observability layer for the AI Control Plane
- Assessor ⭐ 0 - The definitive CI/CD platform for AI Quality.
- Citadel ⭐ 0 - The command and control layer for the AI-powered enterprise
- cf_ex ⭐ 3 - Elixir libraries for Cloudflare edge computing services. Battle-tested...
- ex_cloudflare_phoenix ⭐ 0 - Cloudflare Durable Objects and Calls for Phoenix Framework
- playwriter ⭐ 6 - Elixir WSL-to-Windows browser integration
- youtube_audio_dl ⭐ 0 - Download high-quality audio from YouTube as MP3 files using Elixir. Fe...
- tools ⭐ 0 - Elixir repository
chart GitHub Stats
tools Tech Stack
Languages: Elixir, Erlang, Python, JavaScript/TypeScript, Rust
Frameworks: Phoenix, OTP, FastAPI, React
Specialties:
- Distributed systems & fault tolerance
- AI/LLM infrastructure & reliability
- Functional programming & metaprogramming
- Statistical analysis & experimental design
- Developer tools & productivity
Platforms: BEAM VM, AWS, GCP, Cloudflare Workers, Edge Computing
document Current Focus
- microscope Research: LLM reliability through ensemble methods and statistical testing
- building Building: AI infrastructure on Elixir/OTP
- learning Learning: Advanced OTP patterns, distributed systems optimization
- growing Growing: The Crucible framework ecosystem
globe Connect
- GitHub: @nshkrdotcom
- Organization: @North-Shore-AI
lightbulb Philosophy
"Build infrastructure that researchers and engineers actually want to use. Make reliability measurable. Make experiments reproducible. Make the BEAM shine for AI workloads."
target Open to
- collaboration Collaboration on Elixir AI tooling
- consulting Consulting for distributed systems & AI infrastructure
- speaking Speaking about LLM reliability, Elixir/OTP, or functional programming
- research Research partnerships in AI reliability & distributed systems
- open source Open source contributions - PRs welcome on any project!
Last updated: 2025年10月18日