Drsti's AI is not a black box. It is a deterministic, rule-based system built on top of MediaPipe FaceMesh — Google's open-source face landmark detection library. It does not learn from your data, does not adapt over time, and does not make probabilistic inferences.
Every two seconds while your timer is running, a single JPEG frame is captured from your webcam and sent to a local Flask server running on your machine at localhost:5000. The server runs MediaPipe FaceMesh on the frame, which identifies 468 facial landmark coordinates in normalized (0–1) space.
From those 468 points, Drsti extracts three specific signals: head yaw (left/right rotation), head pitch (up/down tilt), and eye aspect ratio (detecting closed eyes). If any threshold is exceeded, that second is marked as "away."