All Posts

Generating 1 Million Synthetic Humans - a New Method for Seeding Diverse LLM Outputs

We demonstrate a new method for seeding diverse LLM responses, and release an accompanying open-source dataset of 1 million synthetic humans.

Hacker News is Obsessed with Aviation: Classifying 42 Million Posts with SLMs

An analysis of 40+ million Hacker News posts reveals that 0.62% are aviation-related, with the percentage steadily increasing over time. Using small language models, we classified 10.7B tokens of content to discover aviation's surprising popularity among technologists.

Model Security with Large-Scale Inference

How do you verify open-source AI models aren't Trojan horses? We bombarded Qwen 2.5 Coder with 50,000 diverse programming tasks and used Mistral Codestral as a verifier to check for malicious outputs. Our large-scale inference approach found minimal security concerns, demonstrating a practical framework for evaluating model safety before deployment.