The seminal paper introducing the Transformer model, which has become central to many state-of-the-art NLP models.
Last Updated: May 2026

EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks
VerifiedIntroduces EfficientNet, a systematic method for scaling CNN architectures, achieving state-of-the-art accuracy with significantly reduced parameters.
EfficientNet for CNN scaling
At a glance
- Primary category: Research
- Best for: users who want a more specialized AI chat experience
Quick take
Introduces EfficientNet, a systematic method for scaling CNN architectures, achieving state-of-the-art accuracy with significantly reduced parameters.
Top EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks Alternatives
Attention Is All You NeedThe seminal paper introducing the Transformer model, which has become central to many state-of-the-art NLP models.
BERT: Pre-training of Deep Bidirectional Transformers for Language UnderstandingIntroduction of BERT, a new method for pre-training language representations that achieve state-of-the-art results on a variety of NLP tasks.
Generative Pretrained Transformer 3 (GPT-3)The third-generation model in the GPT-n series by OpenAI, showcasing the power of scaling up language models.
More Research
Introduction of BERT, a new method for pre-training language representations that achieve state-of-the-art results on a variety of NLP tasks.
The third-generation model in the GPT-n series by OpenAI, showcasing the power of scaling up language models.
Details the development and capabilities of GPT-3, illustrating its few-shot learning ability across diverse tasks.
Describes Google's Pathways, proposing an innovative approach to scaling AI models and systems asynchronously.
Presents DALL·E, a model that generates diverse and detailed images from textual descriptions, demonstrating the intersection of language understanding and visual creativity.
Introduces the T5 model, showcasing its versatility across multiple NLP tasks through a unified framework for text-to-text processing.
Details the AlphaFold system by DeepMind, which made significant breakthroughs in protein folding, impacting biological sciences.
Reviews the application of AI in generating game content, emphasizing the role of machine learning in creative processes.
Explores the potential of quantum machine learning to revolutionize 6G communication networks, highlighting future research directions.




