# Time-MMD **Repository Path**: ring24/Time-MMD ## Basic Information - **Project Name**: Time-MMD - **Description**: No description available - **Primary Language**: Unknown - **License**: Not specified - **Default Branch**: main - **Homepage**: None - **GVP Project**: No ## Statistics - **Stars**: 0 - **Forks**: 0 - **Created**: 2025-06-25 - **Last Updated**: 2025-06-25 ## Categories & Tags **Categories**: Uncategorized **Tags**: None ## README # Time-MMD: A New Multi-Domain Multimodal Dataset for Time Series Analysis ## Introduction Time-MMD is the first multi-domain, multimodal time series dataset covering 9 primary data domains. We ensures fine-grained modality alignment with text-numerical series, eliminates data contamination, and provides high usability. (Check [our paper](https://arxiv.org/abs/2406.08627) for more details!) ## Dataset Overview Time-MMD consists of 1) numerical sequences 2) textual sequences. Binary timestamps `(start, end)` are occupied which enables the adapatation onto various tasks or demands. The structure of this repo is: ``` - Readme.MD - numerical - Agriculture - Agriculture.csv - (Domain Name) - (Domain Name).csv ... - textual - (Domain Name) - (Domain Name)_report.csv - (Domain Name)_search.csv -- Downstream_Tasks - ShortTerm Forecasting - LongTerm Forecasting - Imputation - Anomaly Detection ``` Here, Downstream_Tasks is used to introduce how Time-MMD supports different downstream tasks. For Short-Term and Long-Term Forecasting, please check our library [MM-TSFlib](https://github.com/AdityaLab/MM-TSFlib) for detailed usage examples. we denote to support more tasks and domains in the future. Please feel free to let us know your demands. ### Numerical Data Numerical data of each domain contains a csv file with has the following format: ``` start_date, end_date, OT, (other variable 1), (other variable 2), ... ``` Here, OT represents the default target variable for prediction in each dataset. Its specific meaning is as follows: