Projects

Open source repositories spanning machine learning, cryptography, statistical computing, and software engineering

All Projects
Filter and Search Projects
Category
Primary Language
Showing all 124 projects
Wolfram Language

bernoulli_sets

Bernoulli sets: a compositional algebra for random approximate sets with quantifiable false positive/negative error rates

GPL-3.0

Trapdoor computing: a paradigm for computing with values hidden behind one-way trapdoors. Cipher maps, Bernoulli approximations, and composable encrypted …

Claude Code plugin for academic research paper lifecycle management — thesis refinement, literature surveys, experimental design, editorial review, and …

MIT
Python

chop

Unix-philosophy image manipulation CLI with lazy evaluation, JSON piping, and multi-image composition

A self-describing, TOML-backed personal metadata store. Unix-philosophy CLI for making personal details instantly available to coding agents and scripts.

Python

dapple

Unified terminal graphics library with multiple renderers (braille, quadrants, sextants, ASCII, sixel, kitty)

Python

aptus

aptus

Latin: fitted, adapted

REST API and Python client for remote LLM fine-tuning. Run the server on your GPU machine, submit training jobs from anywhere. …

Shell

eidola

Generate a conversable persona from personal data: conversations, writings, emails, bookmarks, photos, reading notes

MIT
Python

agentum

A unified framework for sequential decision-making: from classical search to deep RL. All methods are approximations of expectimax with different representation …

MIT
Makefile

sicp

Posts influenced by SICP—on abstraction, composition, and computation as a medium for expressing ideas

Python

ptk

Photo Toolkit - CLI for managing personal photo libraries with AI-powered organization, SHA256 deduplication, and semantic search

Makefile

dual

dual

Forward-mode automatic differentiation via dual numbers for C++20.

Overview

Dual numbers are a simple yet powerful technique for computing exact …

MIT
Python

clerk

clerk

A thin CLI for LLM agents to interact with email via IMAP/SMTP.

License: MIT Python 3.11+

Philosophy

Clerk is intentionally dumb. It’s a bridge, not a brain. …

MIT
Python

mtk

Mail Toolkit - Personal email archive management with semantic search, relationship mapping, and privacy controls

1
Go

jot

A CLI-first, plaintext-native toolkit for capturing and organizing ideas, plans, tasks, and notes. Designed for the LLM era.

The Call of Asheron: An epic fantasy novel exploring forced migration, consciousness, and transformation through four protagonists on an alien world where …

HTML

pfc

A header-only C++20 library for zero-copy, prefix-free data representations with algebraic types and succinct data structures

Instrumental Goals and Latent Codes in RL-Fine-Tuned Language Models

A comprehensive theoretical and empirical analysis of mesa-optimization risks, deceptive …

Python

nfa-tools

NFA Tools: Regular Languages and Finite Automata

An elegant, pedagogical implementation of finite automata with NFA to DFA conversion, regex parsing, and …

MIT
TeX

cbt

Computational Basis Transforms - A header-only C++17 library for transformations between computational domains

MIT
Python

seqwise

Seqwise - Sequential Image Analysis with Vision Language Models

A simple, cost-free approach to analyzing sequences of images using local Vision Language Models …

Python

dotsuite

The Dot Ecosystem

“What started as a single, humble function evolved into a complete, coherent ecosystem for manipulating data structures—a journey in …

MIT

Fisher Flow: A unified information-geometric framework for sequential inference revealing how modern optimizers (Adam, Natural Gradient, K-FAC, EWC) emerge as …

MIT

Cognitive MRI of AI Conversations: Network analysis of ChatGPT conversation logs using semantic embeddings to reveal knowledge topology, community structure, …

MIT
Python

src2md

Convert source code to structured, context-optimized markdown for LLMs with intelligent summarization.

1 MIT

CLI tool for managing ebooks with semantic search, virtual libraries, annotations, and multi-format export. Part of the Long Echo toolkit for personal data …

2 MIT
Python

RPSDG

Reverse-Process Synthetic Data Generation: Automatically Generating Training Language Models for Complex Problem Solving

Abstract:

This paper introduces a …

1
Python

zeroipc

ZeroIPC - High-Performance Shared Memory IPC

Zero-copy data sharing between processes in C++, Python, Go, and C. No serialization, no bindings — parallel native …

MIT

LangCalc: A Calculus for Language Models

An elegant mathematical framework for composing language models through algebraic operations, featuring efficient …

1

Tree Rewriter

A minimal term rewriting system. 15 lines of code. Infinite possibilities.

The Insight

What if we could express computational rules as simple …

MIT
Python

AlgoTree

A powerful, immutable-by-default tree manipulation library for Python with functional programming patterns, composable transformations, and advanced pattern …

16 MIT

A consistent API for hypothesis testing in R. Provides generic methods for p-values, test statistics, degrees of freedom, and significance testing. Includes LRT …

2


marp: true #theme: uncover math: mathjax

SLUUG Talk: Large Language Models

This repository contains the slides and code for the talk:

  • Demystifying Large …

1 MIT

Anonymous batch job execution system with Linux namespace/seccomp sandboxing, resource limits, and WebSocket streaming

MIT

Model selection for reliability estimation in series systems with Weibull components: when can engineers safely use simpler models?

ChatGPT chat search

This was the first python app I developed in quite some time. I wanted to host ChatGPT logs, experiment with heroku, and see how easy it …

1

R package for specifying and using likelihood models for statistical inference. Provides a flexible framework for independent likelihood contributions across …

2

Seeing how easy it is to convert an old project on Google App Engine to a modern framework with the help of ChatGPT

Apache-2.0

Accumux

C++ License: MIT Build Status

A modern C++ library for compositional online data reductions with mathematical rigor and exceptional performance.

🎯 Overview

Accumux provides a …

MIT

[Archived] Masked data tools for R. Superseded by internalized utilities in likelihood.model.series.md and dfr.lik.series.md.

mdrelax

Relaxed Candidate Set Models for Masked Data in Series Systems

Overview

This R package implements likelihood-based inference for series systems with …

1 GPL-3.0

Apertures

A minimal Lisp-like language where “holes” (written ?x or ?ns.x) represent unknown values that can be filled later. This enables pausable, …

GPL-2.0
C++

alga

Alga

A mathematically elegant C++20 library for algebraic text processing and compositional parsing with fuzzy matching. Built on rigorous algebraic foundations …

GPL-2.0

Composable calculus expressions for C++20: symbolic differentiation, numerical integration, and algebraic composition

Closed-form MLE and Fisher information for exponential series systems with masked failure data. Includes theoretical results, proofs, and numerical validation.

1 MIT
C++

maph

Space-efficient approximate mappings using perfect hash functions. Supports arbitrary function approximation (X→Y) with configurable storage (8/16/32/64-bit) …

Bernoulli Data Type

A general framework for understanding and constructing probabilistic data structures with controlled error rates. This framework can also …

3