Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings
@nshkrdotcom
nshkrdotcom
Follow

nshkrdotcom nshkrdotcom

Block or report nshkrdotcom

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
nshkrdotcom /README.md

Hey there! wave

I'm a software engineer and researcher focused on AI reliability, distributed systems, and functional programming. I build infrastructure for LLM research on the Elixir/BEAM platform.

GitHub followers


microscope Crucible Framework - LLM Reliability Research

I'm the creator of the Crucible Framework , a platform for conducting reproducible experiments on large language model reliability, built on Elixir/OTP.

Key Goal: Building towards 99%+ LLM reliability through ensemble voting and request hedging, with comprehensive statistical testing and transparent causal reasoning chains.

Core Libraries

All published under the @North-Shore-AI organization:

Library Description
crucible_framework Documentation hub & research framework
crucible_bench Statistical testing & analysis (15+ tests, effect sizes, power analysis)
crucible_ensemble Multi-model voting strategies for improved reliability
crucible_hedging Request hedging for latency reduction
crucible_trace Causal reasoning chain logging for LLM transparency
crucible_datasets Unified interface to benchmark datasets (MMLU, HumanEval, GSM8K)
crucible_telemetry Research-grade instrumentation & metrics collection
crucible_harness Automated experiment orchestration & reporting

In-Progress Extensions

Library Description
crucible_adversary Adversarial testing & robustness evaluation framework
crucible_xai Explainable AI tools (LIME, SHAP, feature attribution)
ExDataCheck Data validation & quality library for ML pipelines
ExFairness Fairness & bias detection library for AI/ML systems
LLMGuard AI firewall & guardrails for LLM-based applications

Tech Stack: Elixir, OTP, BEAM VM, Telemetry Research Areas: LLM reliability, ensemble methods, tail latency optimization, statistical testing Status: Active development, v0.1.0 released


rocket Elixir Projects

AI Agent Orchestration & Multi-Agent Systems

  • synapse ⭐ 20 - Synapse: Elixir-powered AI agent orchestration, built on the battle-teste...
  • ds_ex ⭐ 14 - DSPEx - Declarative Self-improving Elixir | A BEAM-Native AI Program O...
  • DSPex ⭐ 8 - Declarative Self Improving Elixir - DSPy Orchestration in Elixir
  • mabeam ⭐ 4 - Multi Agent BEAM
  • AutoElixir ⭐ 3 - AI Multi Agent Swarms in Elixir
  • ALTAR ⭐ 4 - The Agent & Tool Arbitration Protocol

AI SDKs & API Clients

  • gemini_ex ⭐ 16 - Elixir Interface / Adapter for Google Gemini LLM, for both AI Studio a...
  • claude_agent_sdk ⭐ 7 - Elixir SDK for Claude AI Agent API - Renamed from claude_code_sdk_elix...
  • codex_sdk ⭐ 0 - OpenAI Codex SDK written in Elixir
  • jules_ex ⭐ 0 - Elixir client SDK for the Jules API - orchestrate AI coding sessions
  • pipeline_ex ⭐ 6 - Claude Code + Gemini AI collaboration orchestration tools

AI Infrastructure & Utilities

  • weaviate_ex ⭐ 0 - Modern Elixir client for Weaviate vector database with health checks...
  • json_remedy ⭐ 20 - A practical, multi-layered JSON repair library for Elixir that intelli...
  • snakepit ⭐ 8 - High-performance, generalized process pooler and session manager for e...

Schema & Data Validation

  • sinter ⭐ 8 - Unified schema definition, validation, and JSON generation for Elixir
  • exdantic ⭐ 8 - A powerful, flexible schema definition and validation library for Elix...
  • perimeter ⭐ 6 - Elixir Typing Mechanism

Developer Tools & Debugging

  • ex_dbg ⭐ 9 - State-of-the-Art Introspection and Debugging System for Elixir/Phoenix...
  • elixir_scope ⭐ 4 - Revolutionary AST-based debugging and code intelligence platform for E...
  • ElixirScope ⭐ 3 - AI-Powered Execution Cinema Debugger for Elixir/BEAM

OTP & Distributed Systems

  • superlearner ⭐ 6 - OTP Supervisor Educational Platform
  • apex ⭐ 3 - Core Apex framework for OTP supervision and monitoring
  • apex_ui ⭐ 3 - Web UI for Apex OTP supervision and monitoring tools
  • arsenal ⭐ 3 - Metaprogramming framework for automatic REST API generation from OTP o...
  • arsenal_plug ⭐ 2 - Phoenix/Plug adapter for Apex Arsenal framework

Testing & Quality Assurance

  • supertester ⭐ 3 - A battle-hardened testing toolkit for building robust and resilient El...
  • sandbox ⭐ 3 - Isolated OTP application management system for Elixir/Erlang
  • cluster_test ⭐ 3 - Distributed Erlang/Elixir test cluster management via Mix tasks

Infrastructure & Observability

  • foundation ⭐ 10 - Elixir infrastructure and Observability Library
  • AITrace ⭐ 0 - The unified observability layer for the AI Control Plane
  • Assessor ⭐ 0 - The definitive CI/CD platform for AI Quality.
  • Citadel ⭐ 0 - The command and control layer for the AI-powered enterprise

Cloud & Edge Computing

  • cf_ex ⭐ 3 - Elixir libraries for Cloudflare edge computing services. Battle-tested...
  • ex_cloudflare_phoenix ⭐ 0 - Cloudflare Durable Objects and Calls for Phoenix Framework

Browser & Platform Integration

  • playwriter ⭐ 6 - Elixir WSL-to-Windows browser integration

Utilities

  • youtube_audio_dl ⭐ 0 - Download high-quality audio from YouTube as MP3 files using Elixir. Fe...
  • tools ⭐ 0 - Elixir repository

chart GitHub Stats

GitHub stats

Top Languages


tools Tech Stack

Languages: Elixir, Erlang, Python, JavaScript/TypeScript, Rust

Frameworks: Phoenix, OTP, FastAPI, React

Specialties:

  • Distributed systems & fault tolerance
  • AI/LLM infrastructure & reliability
  • Functional programming & metaprogramming
  • Statistical analysis & experimental design
  • Developer tools & productivity

Platforms: BEAM VM, AWS, GCP, Cloudflare Workers, Edge Computing


document Current Focus

  • microscope Research: LLM reliability through ensemble methods and statistical testing
  • building Building: AI infrastructure on Elixir/OTP
  • learning Learning: Advanced OTP patterns, distributed systems optimization
  • growing Growing: The Crucible framework ecosystem

globe Connect


lightbulb Philosophy

"Build infrastructure that researchers and engineers actually want to use. Make reliability measurable. Make experiments reproducible. Make the BEAM shine for AI workloads."


target Open to

  • collaboration Collaboration on Elixir AI tooling
  • consulting Consulting for distributed systems & AI infrastructure
  • speaking Speaking about LLM reliability, Elixir/OTP, or functional programming
  • research Research partnerships in AI reliability & distributed systems
  • open source Open source contributions - PRs welcome on any project!

Last updated: 2025年10月18日

Pinned Loading

  1. gemini_ex gemini_ex Public

    Elixir Interface / Adapter for Google Gemini LLM, for both AI Studio and Vertex AI

    Elixir 16 6

  2. snakepit snakepit Public

    High-performance, generalized process pooler and session manager for external language integrations. Orchestrates and supervises languages like Python and Javascript from Elixir.

    Elixir 8 2

  3. DSPex DSPex Public

    Declarative Self Improving Elixir - DSPy Orchestration in Elixir

    Elixir 8

  4. perimeter perimeter Public

    Elixir Typing Mechanism

    Elixir 6

  5. ALTAR ALTAR Public

    The Agent & Tool Arbitration Protocol

    Elixir 4

  6. arsenal arsenal Public

    Metaprogramming framework for automatic REST API generation from OTP operations

    Elixir 3 1

AltStyle によって変換されたページ (->オリジナル) /