XLS-R: Self-supervised Cross-lingual Speech Representation Learning at Scale

Large-scale speech model is here

#speech-model #scaling

LiT:fire:: Zero-Shot Transfer with Locked-image Text Tuning

Image-text pre-training with pre-trained image model enhances zero-shot performance

#multi-modal #contrastive-learning

Masked Autoencoders Are Scalable Vision Learners

Fast representation learning with autoencoders by masked image reconstruction

#self-supervised #image-representation

Palette: Image-to-Image Diffusion Models

Diffusion models beat GANs on image-to-image translation

#diffusion-model #image-processing

Bootstrap Your Object Detector via Mixed Training

Augmentation and pseudo-labeling enhances object detection

#object-detection #pseudo-labeling

#language-model #responsible-ai #privacy-preserving #scaling #mixture-of-experts #image-generation #diffusion-model #generative-adversarial-network #speech-model #multi-modal #contrastive-learning #self-supervised #image-representation #image-processing #object-detection #pseudo-labeling #scene-text-detection #neural-architecture-search #notice #data-sampling #long-tail #graph-neural-network #graph-representation #zero-shot #metric-learning #federated-learning #weight-matrix #low-rank #vision-transformer #computer-vision #normalizing-flow #invertible-neural-network #super-resolution #image-manipulation #thread-summarization #natural-language-processing #domain-adaptation #knowledge-distillation #scene-text #model-compression #semantic-segmentation #instance-segmentation #video-understanding #code-generation #graph-generation #image-translation #data-augmentation #model-pruning #signal-processing #text-generation #text-classification #music-representation #transfer-learning #link-prediction #counterfactual-learning #medical-imaging #acceleration #transformer #style-transfer #novel-view-synthesis #point-cloud #spiking-neural-network #optimization #multi-layer-perceptron #adversarial-training #visual-search #image-retrieval #negative-sampling #action-localization #weakly-supervised #data-compression #hypergraph #adversarial-attack #submodularity #active-learning #deblurring #object-tracking #pyramid-structure #loss-function #gradient-descent #generalization #bug-fix #orthogonality #explainability #saliency-mapping #information-theory #question-answering #knowledge-graph #robustness #limited-data #recommender-system #anomaly-detection #gaussian-discriminant-analysis #molecular-graph #video-processing