The Cereby Blog

Updates, tips, and stories.

Notes from the team building Cereby — what we're shipping, why, and how to get more out of your studying.

Performance analysis without the pin

Our detailed performance breakdown only worked when users pinned a quiz first. We rebuilt it so the system understands subjects on its own, with a Gemini Flash 2.5 Lite classifier replacing brittle keyword matching.

May 4, 20269 min read

EngineeringReliability

Seeing Page Freezes in Our Error Dashboard

Chrome's "Page Unresponsive" dialog was happening in production and we had no signal for it. Here is how a tiny rAF heartbeat turned invisible freezes into rows in the same error dashboard everything else lands in, grouped per route so a 7-second freeze and a 17-second freeze on the same page collapse into one bug.

May 2, 20269 min read

Engineering

From 10MB to 500MB: How Direct-to-Bucket Uploads Scaled With Us

Next.js API routes cap request bodies at roughly 10MB, so we rerouted file uploads to go straight from the browser to Supabase Storage and gave the server only a small JSON pointer, which let us support up to 500MB without giving up validation, quota enforcement, or auditability.

Apr 25, 20269 min read

EngineeringPerformance

Streaming Read-Aloud: From Buffered Waits to Sentence-Level Pipelining

Read-aloud used to wait for the full assistant response before speaking; we rebuilt the pipeline around streaming text deltas and parallel TTS prefetch so the first sentence plays in roughly half a second instead of several.

Updates, tips, and stories.

Performance analysis without the pin

Seeing Page Freezes in Our Error Dashboard

From 10MB to 500MB: How Direct-to-Bucket Uploads Scaled With Us

Streaming Read-Aloud: From Buffered Waits to Sentence-Level Pipelining

Enriching Cereby's Pinned Content Context: How We Made @ Mentions Smarter

How We Rebuilt Cereby's Memory to Feel Like It Actually Knows You

Six Cuts to the Cereby Orchestration Layer: Lazy Context, Smarter Budgets, and Fewer Wasted Calls

From First-Match-Wins to Parallel Scoring: How We Fixed Cereby's Misclassification Problem

Building the Cereby Humanizer: A Rule-Based System That Fights AI Detectors

How We Reduced PDF Export Size by 98%

Cereby AI System Design: From File Upload to Grounded Answer

How We Fix Typos Without Wild Guesses

Hardening Our Detector API for Production Reliability

Cereby Mini: Why We Moved to a Two-Intent Model

Introducing the Standalone AI Text Detector

Introducing Cereby Tutor: Your AI Study Partner Inside Every Quiz

Why Cereby Supports Multiple AI Models (and How to Choose One)

How Cereby's Four-Layer Context Makes AI Feel Continuous

Introducing Cereby Mini: Your Document AI

Accurate Citations with Compressed Context: A Two-Stage Verification System

Query-Aware Smart Compression: Solving the Single-Page Truncation Problem (part 2)

Hierarchical Context Compression: Cutting AI Costs by 90% Without Losing Quality (part 1)

Optimizing Cereby AI: From 5-8 Seconds to Sub-Second Responses

Voice-Powered Learning: Cereby's New Audio Capabilities

Inside Cereby's Intent Classifier: How We Route Natural Language to the Right Tool

Improving Cereby Capabilities: From Plain Text to Rich Visual Learning Materials

Introducing Cereby AI: Your Personal Learning Assistant

Mastering Spaced Repetition: The Science of Long-Term Memory

Learn with Cereby

Updates, tips, and stories.

Performance analysis without the pin

Seeing Page Freezes in Our Error Dashboard

From 10MB to 500MB: How Direct-to-Bucket Uploads Scaled With Us

Streaming Read-Aloud: From Buffered Waits to Sentence-Level Pipelining

Enriching Cereby's Pinned Content Context: How We Made @ Mentions Smarter

How We Rebuilt Cereby's Memory to Feel Like It Actually Knows You

Six Cuts to the Cereby Orchestration Layer: Lazy Context, Smarter Budgets, and Fewer Wasted Calls

From First-Match-Wins to Parallel Scoring: How We Fixed Cereby's Misclassification Problem

Building the Cereby Humanizer: A Rule-Based System That Fights AI Detectors

How We Reduced PDF Export Size by 98%

Cereby AI System Design: From File Upload to Grounded Answer

How We Fix Typos Without Wild Guesses

Hardening Our Detector API for Production Reliability

Cereby Mini: Why We Moved to a Two-Intent Model

Introducing the Standalone AI Text Detector

Introducing Cereby Tutor: Your AI Study Partner Inside Every Quiz

Why Cereby Supports Multiple AI Models (and How to Choose One)

How Cereby's Four-Layer Context Makes AI Feel Continuous

Introducing Cereby Mini: Your Document AI

Accurate Citations with Compressed Context: A Two-Stage Verification System

Query-Aware Smart Compression: Solving the Single-Page Truncation Problem (part 2)

Hierarchical Context Compression: Cutting AI Costs by 90% Without Losing Quality (part 1)

Optimizing Cereby AI: From 5-8 Seconds to Sub-Second Responses

Voice-Powered Learning: Cereby's New Audio Capabilities

Inside Cereby's Intent Classifier: How We Route Natural Language to the Right Tool

Improving Cereby Capabilities: From Plain Text to Rich Visual Learning Materials

Introducing Cereby AI: Your Personal Learning Assistant

Mastering Spaced Repetition: The Science of Long-Term Memory