Attention scales quadratically; physics doesn't. We build language models on state-space dynamics and continuous latent diffusion. Constant-memory inference, with reasoning that refines before it speaks.
The transformer's attention mechanism buys expressivity with quadratic cost. Our program replaces it end to end: not by approximating attention, but by changing the underlying dynamics of the model.
Selective state-space models (Mamba-family) carry context as an evolving fixed-size state rather than a growing cache. Sequence length stops being the enemy: linear-time processing, constant memory, effectively unbounded context.
Instead of committing to one token at a time, the model refines a continuous latent representation of its answer through diffusion steps. Iterative reasoning happens in latent space, before anything is projected into text.
Eliminating the KV-cache collapses inference into a fixed memory footprint. Generation speed improves by an order of magnitude, making real-time complex reasoning practical on modest hardware.
DIMBA fuses a bidirectional Mamba-2 state-space denoiser with cosine-scheduled Gaussian diffusion over latent text states: a non-autoregressive model that generates whole sequences in one pass instead of token by token. The library ships classifier-free guidance, self-conditioning, cross-architecture distillation from any Hugging Face transformer, and DPO/SimPO/GRPO post-training. It runs without CUDA, on CPU and Apple Silicon included.
A 135M-parameter training run is underway. First results are coming soon.
Hamiltonian Research is an independent lab based in Abu Dhabi, founded and led by Faris Allafi, architect of the DIMBA model and, at 13, one of the youngest published researchers working on post-transformer architectures.
The lab's conviction is simple: the current generation of language models is defined by an architectural accident, not a law of nature. Better dynamics exist, and they are worth building from first principles: openly, with technical depth as the only currency.
Formerly operating as DimbaLabs.
We're looking for researchers, engineers, and partners who want to scale the DIMBA architecture and the ideas behind it.
Email the lab