[2310.07820] Large Language Models Are Zero-Shot Time Series Forecasters

Significance

LLMs can zero-shot forecast the future

Review

Since large language models (LLM) are capable of various zero-shot tasks, timeseries forecasting is one of the possible areas of application. The authors propose LLMTime and demonstrate that LLMs can be leveraged for the timeseries forecasting task without extra fine-tuning. The key features of LLMTime include carefully designed tokenization with added spaces, rescaling, and sampling/forecasting. LLMTime with GPT-3 and LLaMA2 achieves competitive zero-shot results on a number of benchmarks including Darts, Monash, and Informer.

231013-1 LLMTime with GPT-3 and LLaMA2 achieves competitive zero-shot results

The authors further investigate the special properties of LLMs that can be related to its inherent capability of forecasting the upcoming value.

Related

Share

Comment

#image-generation #multi-modal #language-model #retrieval-augmentation #robotics #forecasting #psychiatry #instruction-tuning #diffusion-model #notice #graph-neural-network #responsible-ai #privacy-preserving #scaling #mixture-of-experts #generative-adversarial-network #speech-model #contrastive-learning #self-supervised #image-representation #image-processing #object-detection #pseudo-labeling #scene-text-detection #neural-architecture-search #data-sampling #long-tail #graph-representation #zero-shot #metric-learning #federated-learning #weight-matrix #low-rank #vision-transformer #computer-vision #normalizing-flow #invertible-neural-network #super-resolution #image-manipulation #thread-summarization #natural-language-processing #domain-adaptation #knowledge-distillation #scene-text #model-compression #semantic-segmentation #instance-segmentation #video-understanding #code-generation #graph-generation #image-translation #data-augmentation #model-pruning #signal-processing #text-generation #text-classification #music-representation #transfer-learning #link-prediction #counterfactual-learning #medical-imaging #acceleration #transformer #style-transfer #novel-view-synthesis #point-cloud #spiking-neural-network #optimization #multi-layer-perceptron #adversarial-training #visual-search #image-retrieval #negative-sampling #action-localization #weakly-supervised #data-compression #hypergraph #adversarial-attack #submodularity #active-learning #deblurring #object-tracking #pyramid-structure #loss-function #gradient-descent #generalization #bug-fix #orthogonality #explainability #saliency-mapping #information-theory #question-answering #knowledge-graph #robustness #limited-data #recommender-system #anomaly-detection #gaussian-discriminant-analysis #molecular-graph #video-processing