All Posts
Generating 1 Million Synthetic Humans - a New Method for Seeding Diverse LLM Outputs
We demonstrate a new method for seeding diverse LLM responses, and release an accompanying open-source dataset of 1 million synthetic humans.
Hacker News is Obsessed with Aviation: Classifying 42 Million Posts with SLMs
An analysis of 40+ million Hacker News posts reveals that 0.62% are aviation-related, with the percentage steadily increasing over time. Using small language models, we classified 10.7B tokens of content to discover aviation's surprising popularity among technologists.
Model Security with Large-Scale Inference
How do you verify open-source AI models aren't Trojan horses? We bombarded Qwen 2.5 Coder with 50,000 diverse programming tasks and used Mistral Codestral as a verifier to check for malicious outputs. Our large-scale inference approach found minimal security concerns, demonstrating a practical framework for evaluating model safety before deployment.