Mark As Completed Discussion

What is Time-Series Data?

Data collected at several points in time is referred to as time-series data. This is different from cross-sectional data which looks at people, businesses, and other entities at a particular point in time. Correlations between observations are possible since data points in a time series are collected at neighboring periods. Time-series frequency refers to the frequency at which data is collected over a given period.

Time-series data have the following characteristics:

  • Data is gathered over a set period.
  • Workload data is created from scratch and written as inserts rather than being updated to replace existing data.
  • When data is written, the most current time interval is automatically allocated to it.

Processing time-series data involves 5 different steps as shown below:

What is Time Series Data?

  1. Data model: The gathered time-series data must comply with the model's specification, including all of the time series data's characteristic qualities.

  2. Stream computing: This step is for pre-aggregation, downsampling, and post-aggregation for the time series data available.

  3. Data storage: The storage system allows the separation of cold and hot data and efficient range queries. Additionally, the system offers high throughput, vast volume, and low-cost storage.

  4. Metadata retrieval: This step involves supporting several retrieval methods such as those to store and retrieve timeline metadata in the tens of millions to hundreds of millions.

  5. Data analysis: The last step analyzes and computes time-series data in real-time.