TCBench: A Benchmark for Tropical Cyclone Track and Intensity Forecasting at the Global Scale
TCBench is a benchmark for evaluating short to medium-range (1-5 day) forecasts of tropical cyclone (TC) track and intensity at the global scale. Built on the IBTrACS observational dataset, TCBench formulates TC forecasting as predicting the time evolution of a known tropical system, conditioned on its initial position and intensity. As references, TCBench includes state-of-the-art physical (TIGGE) and global neural weather models (AIFS, Pangu-Weather, FourCastNet v2, GenCast). If not readily available, baseline tracks are consistently derived from model outputs using the TempestExtremes library, while TC intensity baseline models postprocess clipped neural forecasts. For evaluation, TCBench provides deterministic and probabilistic storm-following metrics. Designed for accessibility, TCBench helps AI practitioners tackle domain-relevant TC challenges and equips tropical meteorologists with data-driven tools and workflows to improve prediction and TC process understanding. By lowering barriers to reproducible, process-aware evaluation of extreme events, TCBench aims to democratize data-driven TC forecasting. Code, data, and leaderboard are available at https://tcbench.github.io.