Ashish Patel ๐Ÿ‡ฎ๐Ÿ‡ณโ€™s Post

View profile for Ashish Patel ๐Ÿ‡ฎ๐Ÿ‡ณ

๐Ÿ”ฅ 6x Linkedln Top Voice | AI Research Scientist & Chief Data Scientist at IBM | Generative AI Expert | Author - Hands-on Time Series Analytics with Python | IBM Quantum ML Certified | 11+ Years in AI | MLOps | IIMA |

๐——๐—ฎ๐˜†-๐Ÿฏ๐Ÿฑ๐Ÿฏ ๐—–๐—ผ๐—บ๐—ฝ๐˜‚๐˜๐—ฒ๐—ฟ ๐—ฉ๐—ถ๐˜€๐—ถ๐—ผ๐—ป ๐—Ÿ๐—ฒ๐—ฎ๐—ฟ๐—ป๐—ถ๐—ป๐—ด Meta AI Introduces A New AI Technology Called โ€˜Few-Shot Learner (FSL)โ€™ To Tackle Harmful Content Follow me for a similar post: ๐Ÿ‡ฎ๐Ÿ‡ณ Ashish Patel ๐Ÿ‡ฎ๐Ÿ‡ณ ------------------------------------------------------------------- ๐—œ๐—ป๐˜๐—ฒ๐—ฟ๐—ฒ๐˜€๐˜๐—ถ๐—ป๐—ด ๐—™๐—ฎ๐—ฐ๐˜๐˜€ : ๐Ÿ”ธ Paper: ๐—˜๐—ป๐˜๐—ฎ๐—ถ๐—น๐—บ๐—ฒ๐—ป๐˜ ๐—ฎ๐˜€ ๐—™๐—ฒ๐˜„-๐—ฆ๐—ต๐—ผ๐˜ ๐—Ÿ๐—ฒ๐—ฎ๐—ฟ๐—ป๐—ฒ๐—ฟ ๐Ÿ”ธ This paper is published arxiv2021. ๐Ÿ”ธ For the training of AI models, a massive number of labeled data points or examples are required. Typically, the number of samples needed is tens of thousands to millions. Collection and labeling of these data can take several months. This manual collection and labeling delay the deployment of AI systems that can detect new types of harmful content over different social media platforms. To handle this issue, Meta has deployed a relatively new AI model called โ€œFew-Shot Learnerโ€ (FSL) such that harmful contents can be detected even if enough labeled data is not available. ------------------------------------------------------------------- ๐—œ๐— ๐—ฃ๐—ข๐—ฅ๐—ง๐—”๐—ก๐—–๐—˜ ๐Ÿ”ธ Large pre-trained language models (LMs) have demonstrated remarkable ability as few-shot learners. ๐Ÿ”นHowever, their success hinges largely on scaling model parameters to a degree that makes it challenging to train and serve. ๐Ÿ”ธIn this paper, we propose a new approach, named as EFL, that can turn small LMs into better few-shot learners. The key idea of this approach is to reformulate potential NLP task into an entailment one, and then fine-tune the model with as little as 8 examples. ๐Ÿ”นWe further demonstrate our proposed method can be: (i) naturally combined with an unsupervised contrastive learning-based data augmentation method; (ii) easily extended to multilingual few-shot learning. ๐Ÿ”ธA systematic evaluation on 18 standard NLP tasks demonstrates that this approach improves the various existing SOTA few-shot learning methods by 12\%, and yields competitive few-shot performance with 500 times larger models, such as GPT-3. ------------------------------------------------------------------- #computervision #artificialintelligence #innovation -------------------------------------------------------------------

  • No alternative text description for this image
Ashish Patel ๐Ÿ‡ฎ๐Ÿ‡ณ

๐Ÿ”ฅ 6x Linkedln Top Voice | AI Research Scientist & Chief Data Scientist at IBM | Generative AI Expert | Author - Hands-on Time Series Analytics with Python | IBM Quantum ML Certified | 11+ Years in AI | MLOps | IIMA |

2y

To view or add a comment, sign in

Explore topics