Overview
The Live Digest endpoint captures frames from your stream and generates narrative summaries. The quality and cost of summaries depend on how you configure frame capture: how often frames are captured and over what time window.Key Parameters
window_minutes
How long to accumulate frames before generating a summary.- Shorter windows (5-10 min): Frequent, detailed summaries. Better for fast-moving events.
- Longer windows (30-60 min): Less frequent summaries. Good for gradual trends.
capture_interval_seconds
How often to capture a frame from the stream.- Frequent capture (30 sec): More frames, richer detail. Higher costs.
- Sparse capture (180+ sec): Fewer frames, lower cost. Works well for slow-changing scenes.
Configuration Examples
Traffic Monitoring
Check for incidents throughout the day:Weather Tracking
Hourly weather summaries:Security Monitoring
Detailed 5-minute snapshots:Low-Cost Monitoring
Daily trend reports with minimal API calls:Cost Optimization
Cost = Number of frames sent to VLM × Price per VLM call Lower costs by:- Increasing
capture_interval_seconds- Capture fewer frames (e.g., 180 instead of 60) - Increasing
window_minutes- Summarize longer periods between summaries - Using faster models -
gpt-4o-miniorgemini-2.5-flashcost less thanclaude-3-5-sonnet
Output
Each summary includes:- summary: Narrative description of what happened during the window
- frame_count: Number of frames analyzed
- window_minutes: The time period covered
- grid_b64: Base64-encoded composite image (if
include_frames: true) showing the frames analyzed
Getting Started
- Start with the use case that matches yours above
- Check the summary quality and costs in your metrics
- Adjust
window_minutesandcapture_interval_secondsbased on your needs - Use shorter intervals for detail, longer intervals for cost savings
Related
- Live Digest - API endpoint documentation
- Monitoring & Cost Analysis - Track your usage and costs
- Check Once - For single-frame analysis