TimeSeries Large Model
TimeSeries Large Model
Introduction
A time series large model is a foundational model specifically designed for time series analysis. The IoTDB team has independently developed time series large models, which are pre-trained on massive time series data using technologies such as transformer structures. These models can understand and generate time series data across various domains and are applicable to applications like time series forecasting, anomaly detection, and time series imputation. Unlike traditional time series analysis techniques, time series large models possess the capability to extract universal features and provide technical services based on zero-shot analysis and fine-tuning for a wide range of analytical tasks.
The team's related technologies of time series large models have been published in top international machine learning conferences.
Application Scenarios
- Time Series Forecasting: Provides forecasting services for time series data in industrial production, natural environments, and other areas, helping users to understand future trends in advance.
- Data Imputation: For missing segments in time series, perform context imputation to enhance the continuity and completeness of the dataset.
- Anomaly Detection: Utilizing regression analysis technology, monitor time series data in real-time and provide timely warnings for potential anomalies.

Timer Model
The Timer model not only demonstrates excellent few-shot generalization and multi-task adaptation capabilities but also gains a rich knowledge base through pre-training, endowing it with the universal capability to handle a variety of downstream tasks, featuring the following:
- Generalization: The model can be fine-tuned using a small number of samples to achieve leading predictive performance in the industry.
- Versatility: The model is designed flexibly to adapt to various task requirements and supports variable input and output lengths, enabling it to play a role in various application scenarios.
- Scalability: As the number of model parameters increases or the scale of pre-training data expands, the model's performance continues to improve, ensuring the model can optimize its predictive effects with the growth of time and data volume.

Timer-XL Model
Timer-XL is an upgraded version of Timer that further extends the network structure and achieves comprehensive breakthroughs in multiple dimensions:
- Ultra-long Context Support: This model breaks through the limitations of traditional time series forecasting models, supporting inputs of thousands of Tokens (equivalent to tens of thousands of time points), effectively solving the context length bottleneck problem.
- Multi-variable Prediction Scenarios Coverage: Supports various forecasting scenarios, including non-stationary time series forecasting, multi-variable forecasting tasks, and forecasting with covariates, meeting diverse business needs.
- Large-scale Industrial Time Series Dataset: Pre-trained on a trillion-scale industrial IoT time series dataset, which has the important characteristics of huge volume, excellent quality, and rich domains, covering energy, aerospace, steel, transportation, and other fields.

Timer-Sundial Model
Timer-Sundial is a series of generative foundational models focused on time series forecasting. The basic version has 128 million parameters and has undergone large-scale pre-training on 1 trillion time points. Its core features include:
- Powerful Generalization Performance: Possesses zero-shot forecasting capabilities, supporting both point forecasting and probabilistic forecasting simultaneously.
- Flexible Forecasting Distribution Analysis: Can not only forecast mean values or quantiles but also evaluate any statistical characteristics of the forecasting distribution through the original samples generated by the model.
- Innovative Generative Architecture: Adopts a "Transformer + TimeFlow" collaborative architecture—Transformer learns the autoregressive representation of time segments, and the TimeFlow module converts random noise into diversified forecasting trajectories based on the Flow-Matching framework, achieving efficient non-deterministic sample generation.

Effect Demonstration
Time series large models can adapt to real-time series data in various domains and scenarios, demonstrating excellent processing effects on various tasks. The following are real-world performances on different datasets:
Time Series Forecasting:
Utilizing the forecasting capability of time series large models, the future change trends of time series can be accurately predicted. In the figure below, the blue curve represents the forecast trend, and the red curve represents the actual trend, with a high degree of 吻合 (coincidence) between the two curves.

Data Imputation:
Utilizing time series large models to perform predictive imputation on missing data segments.

Anomaly Detection:
Utilizing time series large models to accurately identify outliers that deviate significantly from normal trends.

Deployment Usage
- Open the IoTDB CLI console and verify that the ConfigNode, DataNode, and AINode statuses are all Running.
Check command:
show cluster

When the AINode is started for the first time in a networked environment, the Timer-XL and Sundial models will be automatically pulled.
Verify model registration success
Check command:
show models
