Products
Models
Pricing
Blog
Contact us
Try for free
Our products
MakoraInference
MakoraGenerate
RESOURCES
Docs
CASE STUDIES
Code Translation
Performance Optimization
COMPANY
About
Careers
Contact Us
The latest findings, insights, and publications from the Makora team.
Makora inference endpoints are built for AI agents that need to think, respond, and act in real time.
Apr 20, 2026
Sequential Monte Carlo Speculative Decoding enables new highs for tokens per second per user
Apr 16, 2026
Writing fast FP8 GEMMs just got faster
Mar 6, 2026
Everyone's favourite kernel generation agent, now in your CLI!
Feb 18, 2026
The code was correct. The problem wasn't.
Feb 12, 2026
Pushing frontier model capabilities with reinforcement learning
Jan 15, 2026
A systematic study of reward hacking, adversarial detection, and robust evaluation for LLM-optimized GPU kernels
Dec 16, 2025
MakoraGenerate implements functional and fast KDA kernels with evolutionary search
Dec 3, 2025
Same team. Same mission. Two new letters.
Sep 18, 2025
Creating a representative subset of KernelBench to evaluate a long-running agent more efficiently
Aug 12, 2025
Announcing Makora's seed round
Aug 6, 2025
MakoraGenerate outperforms torch.compile when optimizing DeepSeek MOE Kernels
Jul 29, 2025
MakoraGenerate writes inline PTX to achieve near-optimial GEMM performance
Jul 22, 2025
Optimizing the kernel generation pipeline through accelerated compilation
Jun 25, 2025
MakoraGenerate is an LLM-powered AI agent that writes GPU kernels
May 29, 2025
Makora improves the performance of vLLM and SGLang
Apr 2, 2025
Achieve state-of-the-art latency on FLUX.1-schnell by leveraging multiple executor backends
Jan 29, 2025
Easily deploy models on Makora
Oct 29, 2024
Identifying the most price efficient AI inference accelerators
Talk to an engineer
Status
Terms of Service
Privacy Policy
Cookie Policy
Copyright © 2026 MakoRA. All rights reserved.