Post by Yuzhe Yang
AI Prof @ UCLA | Scientist @ Google | PhD @ MIT
๐ Can LLMs really reason over health time series? Introducing ๐๐๐๐ฅ๐ง๐ฆ โค๏ธ โ the first ๐ญ๐ช๐ท๐ช๐ฏ๐จ benchmark for health time-series reasoning. Most current evaluations of health time series are still narrow in scope.ย With ๐๐๐๐ฅ๐ง๐ฆ, we move beyond that and study how modern LLMs handle real physiological data at scale. We built a large-scale benchmark with ย โข ๐งช ๐ฎ๐ฌ๐+ย test samples ย โข ๐งฉ ๐ญ๐ญ๐ฌ tasks ย โข ๐ฅ ๐ญ๐ฎ health domains (metabolism, motion, cardiac, sleep, audio, ...) ย โข ๐ก ๐ฎ๐ฌ signal modalities (ECG, PPG, EEG, IMU, EMG, CGM, ...) ๐ It enables to date the broadest coverage of ย โข sequence lengths (up to 1M+ steps), ย โข sampling frequencies (up to 48kHz), ย โข time spans (from seconds to years). ๐ Rather than focusing on a narrow slice of prediction, ๐๐๐๐ฅ๐ง๐ฆ covers four levels of reasoning in one unified benchmark: ๐ง ๐๐ฆ๐ณ๐ค๐ฆ๐ฑ๐ต๐ช๐ฐ๐ฏ ๐ ๐๐ฏ๐ง๐ฆ๐ณ๐ฆ๐ฏ๐ค๐ฆ โ๏ธ ๐๐ฆ๐ฏ๐ฆ๐ณ๐ข๐ต๐ช๐ฐ๐ฏ โ๏ธ ๐๐ฆ๐ฅ๐ถ๐ค๐ต๐ช๐ฐ๐ฏ Across 14 state-of-the-art LLMs ๐ค, we find that strong general capability does not yet translate into strong health time-series reasoning. Many models still struggle with long-range temporal structure, high-frequency signals, and tasks that require more than simple pattern matching or heuristic shortcuts. ๐๐๐๐ฅ๐ง๐ฆ is designed as a living and evolving community benchmark. We hope it will continue to grow with community inputs on new datasets / tasks / models, and help push toward AI that can better understand and reason over health time series in the real world! ๐ ๐ Paper: https://lnkd.in/gf5-UBeA ๐ Website: https://lnkd.in/gNEnwjXB ๐ต๏ธ Code: https://lnkd.in/gzcfmYCZ ๐ค Dataset: https://lnkd.in/g7Ea6zvj ๐ Leaderboard: https://lnkd.in/gc5y_8EX Great work led by my students Sirui Li, Shuhan Xiao, Mihir Joshi and collaborators Ahmed Abdelhadi Metwally, Daniel McDuff, and Wei Wang! We are also grateful for generous compute support from Google, OpenAI, Anthropic, and xAI. UCLA UCLA Computer Science Computational Medicine Department UCLA Henry Samueli School of Engineering and Applied Science #AI #HealthAI #LLM #TimeSeries #MultimodalAI #FoundationModels