Projects
Open source repositories spanning machine learning, cryptography, statistical computing, and software engineering
Featured Projects
Highlighted work representing significant research and development contributions
narro
chartfold
pagevault
posthumous
longecho
sigmark
GPG signing for static site markdown content
Declarative language for constructing complex networks from structured data
bernoulli_sets
Bernoulli sets: a compositional algebra for random approximate sets with quantifiable false positive/negative error rates
Trapdoor computing: a paradigm for computing with values hidden behind one-way trapdoors. Cipher maps, Bernoulli approximations, and composable encrypted …
Information Recovery under Deterministic Masking in Exponential Series Systems — short communication
claude-anvil
A forge for Claude Code plugins — academic research, fiction worldbuilding, publication workflows, and more
Claude Code plugin for deets — personal metadata queries (identity, profiles, cross-platform disambiguation)
Claude Code plugin for academic research paper lifecycle management — thesis refinement, literature surveys, experimental design, editorial review, and …
mf
CLI toolkit for synchronizing external sources to a Hugo static site
Unified publication pipeline: R packages (CRAN/JOSS/JSS), Python packages (PyPI), and books (Amazon KDP)
homeostat
Homeostatic intrinsic reward for language model alignment
Masked-cause likelihood models for series systems with arbitrary hazard components in R
Companion research directions for masked series systems likelihood framework
Masked Causes of Failure in Series Systems: A Likelihood Framework
Series system distributions from flexible hazard rate components in R
minds-and-machines
AI alignment, moral agency, superintelligence, and the futures we might build
sequential-prediction
Research framework for sequential prediction algorithms, starting with Context Tree Weighting (CTW)
cryptoid
Client-side encrypted content for Hugo static sites with multi-user access control
R-universe package registry for queelius
curations
Multi-collection curated awesome-lists powered by curalist
chop
Unix-philosophy image manipulation CLI with lazy evaluation, JSON piping, and multi-image composition
deets
A self-describing, TOML-backed personal metadata store. Unix-philosophy CLI for making personal details instantly available to coding agents and scripts.
R package for automatic differentiation and gradient computation.
curalist
Multi-collection curated awesome-list manager
texwatch
Live-reloading TeX editor with PDF viewer in the browser
atk
Non-blocking audio playback daemon with named pipe protocol
algebraic-cipher-types
Functorial framework for secure computation through homomorphic operations on encrypted algebraic structures
dapple
Unified terminal graphics library with multiple renderers (braille, quadrants, sextants, ASCII, sixel, kitty)
aptus
aptus
Latin: fitted, adapted
REST API and Python client for remote LLM fine-tuning. Run the server on your GPU machine, submit training jobs from anywhere. …
eidola
Generate a conversable persona from personal data: conversations, writings, emails, bookmarks, photos, reading notes
agentum
A unified framework for sequential decision-making: from classical search to deep RL. All methods are approximations of expectimax with different representation …
the-learning-problem
Essays on induction, inference, and the search for useful representations
sicp
Posts influenced by SICP—on abstraction, composition, and computation as a medium for expressing ideas
the-long-echo
Essays on digital legacy, graceful degradation, and designing systems that outlast their creators
ptk
Photo Toolkit - CLI for managing personal photo libraries with AI-powered organization, SHA256 deduplication, and semantic search
dual
dual
Forward-mode automatic differentiation via dual numbers for C++20.
Overview
Dual numbers are a simple yet powerful technique for computing exact …
clerk
gradator
Pedagogical C++20 automatic differentiation library
elementa
Pedagogical C++20 linear algebra library
Paper: Preventing Ransomware Damages using In-Operation Off-Site Backup to Achieve a 10^-8 False-Negative Miss-Detection Rate (IEEE ICCI 2025)
mtk
Mail Toolkit - Personal email archive management with semantic search, relationship mapping, and privacy controls
jot
A CLI-first, plaintext-native toolkit for capturing and organizing ideas, plans, tasks, and notes. Designed for the LLM era.
The Call of Asheron: An epic fantasy novel exploring forced migration, consciousness, and transformation through four protagonists on an alien world where …
Echoes of the Sublime - A philosophical horror novel exploring AI safety, consciousness, and cognitive bandwidth limits (~103k words)
the-policy
The Policy - A literary SF novel exploring AI alignment, consciousness, and emergence
CLI tool for cross-posting content to multiple platforms including dev.to, Bluesky, Mastodon, Hashnode, Medium, and more. Supports markdown with frontmatter, …
Symbolic likelihood models in Python. Build, compose, and analyze likelihood functions with automatic differentiation and symbolic manipulation.
A pattern matching and term rewriting library for Python. Define rewrite rules with intuitive DSL syntax and apply them to transform symbolic expressions.
fuzzy-soft-circuit
Automatic fuzzy rule discovery through differentiable soft circuits - learn fuzzy logic systems from data without expert knowledge
pfc
A header-only C++20 library for zero-copy, prefix-free data representations with algebraic types and succinct data structures
Immutable graph library with 56+ algorithms, transformers, selectors, and lazy views.
Infinigram
Variable-length n-gram language models using suffix arrays.
High-performance N-dimensional sparse spatial hashing library for C++20
latent-codes-llm
Instrumental Goals and Latent Codes in RL-Fine-Tuned Language Models
A comprehensive theoretical and empirical analysis of mesa-optimization risks, deceptive …
crypto-perf-hash
Theoretical analysis of cryptographic perfect hash functions with optimal space complexity
tex2any
LaTeX to multiple formats converter with modular themes and components
llm-priors
LLMs as Intelligent Priors: Enhancing Classical Algorithms Through Learned Initialization
A virtual POSIX filesystem with content-addressable DAG structure. Features immutable nodes, Git-style hashing, functional transformations, and an embedded …
nfa-tools
NFA Tools: Regular Languages and Finite Automata
An elegant, pedagogical implementation of finite automata with NFA to DFA conversion, regex parsing, and …
cbt
Computational Basis Transforms - A header-only C++17 library for transformations between computational domains
mcts-reasoning
Monte Carlo Tree Search for LLM-based reasoning with fluent API and advanced sampling strategies
seqwise
Seqwise - Sequential Image Analysis with Vision Language Models
A simple, cost-free approach to analyzing sequences of images using local Vision Language Models …
dotsuite
The Dot Ecosystem
“What started as a single, humble function evolved into a complete, coherent ecosystem for manipulating data structures—a journey in …
A network-native functional language.
JSONL Algebra
A powerful relational algebra CLI and library for JSONL data manipulation.
fisher-flow
Fisher Flow: A unified information-geometric framework for sequential inference revealing how modern optimizers (Adam, Natural Gradient, K-FAC, EWC) emerge as …
Cognitive MRI of AI Conversations: Conference paper analyzing ChatGPT conversations through network science. Presented at Complex Networks 2025.
CLI tool for managing AI conversations with tree structure preservation, network analysis, semantic search, and multi-format export. Part of the Long Echo …
chatgpt-complex-net
Cognitive MRI of AI Conversations: Network analysis of ChatGPT conversation logs using semantic embeddings to reveal knowledge topology, community structure, …
CLI tool for managing and querying your git repository collection. Tracks events, metadata, and provides powerful queries across all your repos with GitHub, …
dreamlog
Logic programming with LLM integration and wake-sleep learning cycles
CLI tool for managing bookmarks and media with reading queues, content caching, semantic search, and multi-format export. Part of the Long Echo toolkit for …
src2md
Convert source code to structured, context-optimized markdown for LLMs with intelligent summarization.
EBK - eBook Toolkit
CLI tool for managing ebooks with semantic search, virtual libraries, annotations, and multi-format export. Part of the Long Echo toolkit for personal data …
JAF - Just Another Flow
A streaming data processing system for JSON with lazy evaluation, composable operations, and a fluent API.
fuzzy-infer
Unix-composable fuzzy logic inference with elegant Pythonic API
XTK - Expression Toolkit
A powerful symbolic expression toolkit for rule-based term rewriting.
fuzzy-logic-search
Fuzzy logic search on plain documents and JSON documents.
space-sandbox-sim
RPSDG
Reverse-Process Synthetic Data Generation: Automatically Generating Training Language Models for Complex Problem Solving
Abstract:
This paper introduces a …
zeroipc
ZeroIPC - High-Performance Shared Memory IPC
Zero-copy data sharing between processes in C++, Python, Go, and C. No serialization, no bindings — parallel native …
langcalc
LangCalc: A Calculus for Language Models
An elegant mathematical framework for composing language models through algebraic operations, featuring efficient …
tree_rewriter
Tree Rewriter
A minimal term rewriting system. 15 lines of code. Infinite possibilities.
The Insight
What if we could express computational rules as simple …
beautiful-deception
How 256 bits pretend to be infinity: A pedagogical exploration of random oracles and computational randomness
ollama_data_tools
Ollama Data Tools
Requirements
- Python 3.x
Installation
Clone the repository and install the necessary dependencies:
git clone …AlgoTree
A powerful, immutable-by-default tree manipulation library for Python with functional programming patterns, composable transformations, and advanced pattern …
A consistent API for hypothesis testing in R. Provides generic methods for p-values, test statistics, degrees of freedom, and significance testing. Includes LRT …
sluug-talk-llm
marp: true #theme: uncover math: mathjax
SLUUG Talk: Large Language Models
This repository contains the slides and code for the talk:
- Demystifying Large …
elasticsearch-lm
ElasticSearch Query Fine-Tuning Training Data for Large Language Models
sandrun
Anonymous batch job execution system with Linux namespace/seccomp sandboxing, resource limits, and WebSocket streaming
Scalable lock based on 2-thread Peterson lock.
maskedselect
Model selection for reliability estimation in series systems with Weibull components: when can engineers safely use simpler models?
A modern C++ header-only library implementing Disjoint Interval Sets as a complete Boolean algebra. Features elegant API, compile-time intervals, and …
[Archived] R package for Weibull series system estimation from masked failure data. Companion code for master's project (SIUE, 2023). Superseded by …
[Archived] Master's project (SIUE, 2023): MLE for series system reliability with Weibull components under right-censoring and masked failure data. See …
Flexible hazard rate distributions for survival analysis and reliability engineering in R
Likelihood model for series systems with masked component cause of failure and other censoring mechanisms
chatgpt-pysearch
ChatGPT chat search
This was the first python app I developed in quite some time. I wanted to host ChatGPT logs, experiment with heroku, and see how easy it …
R package for specifying and using likelihood models for statistical inference. Provides a flexible framework for independent likelihood contributions across …
R package for maximum likelihood estimation with compositional data.
Seeing how easy it is to convert an old project on Google App Engine to a modern framework with the help of ChatGPT
Like micrograd, but worse.
stepanov
Pedagogical blog posts on generic programming in C++, inspired by Alex Stepanov
R package: Algebra over distributions (random elements) with automatic simplification to closed forms
accumux
[Archived] Masked data tools for R. Superseded by internalized utilities in likelihood.model.series.md and dfr.lik.series.md.
algebraic.mle
Algebraic maximum likelihood estimators
mdrelax
mdrelax
Relaxed Candidate Set Models for Masked Data in Series Systems
Overview
This R package implements likelihood-based inference for series systems with …
aperture
Apertures
A minimal Lisp-like language where “holes” (written ?x or ?ns.x) represent unknown values that can be filled later. This enables pausable, …
alga
Alga
A mathematically elegant C++20 library for algebraic text processing and compositional parsing with fuzzy matching. Built on rigorous algebraic foundations …
limes
Composable calculus expressions for C++20: symbolic differentiation, numerical integration, and algebraic composition
Time series analysis of a confidentiality measure for an Encrypted search system
We derive a confidentiality measure against an adversary deploying a …
Algebraic cipher types
Master's thesis on encrypted search: enabling standard IR on encrypted collections. Published via ProQuest (2014). Part of the oblivious-computing research …
Encrypted Search with Oblivious Bernoulli Types: Information-Theoretic Privacy through Controlled Approximation
Probabilistic framework for quantifying confidentiality of encrypted search systems using bootstrap methods and entropy analysis
Research code and data for the IEEE CloudCom 2016 paper on estimating confidentiality risks in encrypted search systems. The Moving Average Bootstrap (MAB) …
cipher_maps
A unified theoretical framework for oblivious function approximation through algebraic structures and Bernoulli models
Closed-form MLE and Fisher information for exponential series systems with masked failure data. Includes theoretical results, proofs, and numerical validation.
Modern C++20 header-only library for algebraic hash function composition with elegant DSL
maph
Space-efficient approximate mappings using perfect hash functions. Supports arbitrary function approximation (X→Y) with configurable storage (8/16/32/64-bit) …
Privacy-preserving set operations using cryptographic trapdoor functions. Minimal Python library implementing Bernoulli types framework with explicit error …
bernoulli_data_type
Bernoulli Data Type
A general framework for understanding and constructing probabilistic data structures with controlled error rates. This framework can also …