Knowledge-Augmented Language Model Verification
Better RAG by self-verifying the process
#language-model
#retrieval-augmentation
Self-RAG: Learning to Retrieve, Generate, and Critique through Self-Reflection
Better RAG by self-reflecting the process
#language-model
#retrieval-augmentation
Video Language Planning
Vision language models can make long horizon task plans
#language-model
#multi-modal
#robotics
PaLI-3 Vision Language Models: Smaller, Faster, Stronger
Contrastive ViT Makes VLM Stronger
#language-model
#multi-modal
Large Language Models Are Zero-Shot Time Series Forecasters
LLMs can zero-shot forecast the future
#language-model
#forecasting
Empowering Psychotherapy with Large Language Models: Cognitive Distortion Detection through Diagnosis of Thought Prompting
A step towards LLM psychotherapist
#language-model
#psychiatry
Mistral 7B
Simple, intuitive tricks leverage 7B model to 13B performance
#language-model
Instruction Mining: High-Quality Instruction Data Selection for Large Language Models
Evaluating the quality of your instruction dataset
#language-model
#instruction-tuning
Exploring the Potential of Large Language Models (LLMs) in Learning on Graphs
Pre-trained LLMs for graph related tasks
#language-model
#graph-neural-network
OPT: Open Pre-trained Transformer Language Models
Pre-trained large language models open to public for responsible AI
#language-model
#responsible-ai
SubMix: Practical Private Prediction for Large-scale Language Models
Making language models keep the secret by partitioned ensemble models watch each other
#language-model
#privacy-preserving
Efficient Large Scale Language Modeling with Mixture-of-Experts
Meta is working on efficient language models with MoE too
#language-model
#scaling
#mixture-of-experts
GLaM: Efficient Scaling of Language Models with Mixture-of-Experts
Scaling language models with less global warming
#language-model
#scaling
#mixture-of-experts
Finetuned Language Models Are Zero-Shot Learners
Training natural language models to learn with natural language
#language-model
#zero-shot
Evaluating Large Language Models Trained on Code
GPT knows Python better than me now.
#code-generation
#language-model