Ashish Patel 🇮🇳’s Post

𝗗𝗮𝘆-𝟯𝟱𝟯 𝗖𝗼𝗺𝗽𝘂𝘁𝗲𝗿 𝗩𝗶𝘀𝗶𝗼𝗻 𝗟𝗲𝗮𝗿𝗻𝗶𝗻𝗴 Meta AI Introduces A New AI Technology Called ‘Few-Shot Learner (FSL)’ To Tackle Harmful Content Follow me for a similar post: 🇮🇳 Ashish Patel 🇮🇳 ------------------------------------------------------------------- 𝗜𝗻𝘁𝗲𝗿𝗲𝘀𝘁𝗶𝗻𝗴 𝗙𝗮𝗰𝘁𝘀 : 🔸 Paper: 𝗘𝗻𝘁𝗮𝗶𝗹𝗺𝗲𝗻𝘁 𝗮𝘀 𝗙𝗲𝘄-𝗦𝗵𝗼𝘁 𝗟𝗲𝗮𝗿𝗻𝗲𝗿 🔸 This paper is published arxiv2021. 🔸 For the training of AI models, a massive number of labeled data points or examples are required. Typically, the number of samples needed is tens of thousands to millions. Collection and labeling of these data can take several months. This manual collection and labeling delay the deployment of AI systems that can detect new types of harmful content over different social media platforms. To handle this issue, Meta has deployed a relatively new AI model called “Few-Shot Learner” (FSL) such that harmful contents can be detected even if enough labeled data is not available. ------------------------------------------------------------------- 𝗜𝗠𝗣𝗢𝗥𝗧𝗔𝗡𝗖𝗘 🔸 Large pre-trained language models (LMs) have demonstrated remarkable ability as few-shot learners. 🔹However, their success hinges largely on scaling model parameters to a degree that makes it challenging to train and serve. 🔸In this paper, we propose a new approach, named as EFL, that can turn small LMs into better few-shot learners. The key idea of this approach is to reformulate potential NLP task into an entailment one, and then fine-tune the model with as little as 8 examples. 🔹We further demonstrate our proposed method can be: (i) naturally combined with an unsupervised contrastive learning-based data augmentation method; (ii) easily extended to multilingual few-shot learning. 🔸A systematic evaluation on 18 standard NLP tasks demonstrates that this approach improves the various existing SOTA few-shot learning methods by 12\%, and yields competitive few-shot performance with 500 times larger models, such as GPT-3. ------------------------------------------------------------------- #computervision #artificialintelligence #innovation -------------------------------------------------------------------

1 Comment

Ashish Patel 🇮🇳

Paper: https://arxiv.org/pdf/2104.14690.pdf Reference: https://ai.facebook.com/blog/harmful-content-can-evolve-quickly-our-new-ai-system-adapts-to-tackle-it/

1 Reaction

To view or add a comment, sign in

More Relevant Posts

Ashish Patel 🇮🇳

🔥 6x Linkedln Top Voice | AI Research Scientist & Chief Data Scientist at IBM | Generative AI Expert | Author - Hands-on Time Series Analytics with Python | IBM Quantum ML Certified | 11+ Years in AI | MLOps | IIMA |
2h
Report this post
Learning Series:Introduction LLM Quantization 1. Introduction to Weight Quantization : https://lnkd.in/dBEGS2BT 2. 4-bit LLM Quantization with GPTQ: https://lnkd.in/dnbzbXZq 3. Quantize Llama models with GGUF and llama.cpp: https://lnkd.in/d9_89prq 4. ExLlamaV2: The Fastest Library to Run LLMs: https://lnkd.in/dWMCwFjk P.S. If you have similar resources please share in the comment
Like Comment
To view or add a comment, sign in
Ashish Patel 🇮🇳

🔥 6x Linkedln Top Voice | AI Research Scientist & Chief Data Scientist at IBM | Generative AI Expert | Author - Hands-on Time Series Analytics with Python | IBM Quantum ML Certified | 11+ Years in AI | MLOps | IIMA |
2d
Report this post
Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone by Microsoft We're thrilled to introduce phi-3-mini, a groundbreaking language model that fits right in your pocket! With 3.8 billion parameters trained on 3.3 trillion tokens, it rivals giants like GPT-3.5 in performance—all from your phone. 🔍 Our unique approach utilizes a mix of refined web data and synthetic data, enabling top-tier performance without the bulk. The phi-3-mini not only promises powerful AI capabilities on mobile devices but does so with an emphasis on safety and reliability, aligning with Microsoft’s responsible AI principles. 📊 Showcased results from benchmarks such as MMLU and HellaSwag confirm that phi-3-mini isn't just small, it's mighty, offering insights and interactions previously only possible on high-end servers. 🌐 Looking ahead, they're committed to enhancing multilingual support and further refining our models to handle a wider array of tasks more effectively. Stay tuned as we continue to push the boundaries of what's possible in AI technology, making it more accessible than ever. #AI #Innovation #Microsoft #LanguageModels
Like Comment
To view or add a comment, sign in
Ashish Patel 🇮🇳

🔥 6x Linkedln Top Voice | AI Research Scientist & Chief Data Scientist at IBM | Generative AI Expert | Author - Hands-on Time Series Analytics with Python | IBM Quantum ML Certified | 11+ Years in AI | MLOps | IIMA |
2d
Report this post
Introducing LLaVA-Llama-3-8B is released! XTuner team releases the new multi-modal models (LLaVA-Llama-3-8B and In the realm of multi-modal models, the XTuner team has taken a monumental leap forward with the launch of LLaVA-Llama-3-8B and its variant, LLaVA-Llama-3-8B-v1.1, powered by the groundbreaking Llama-3 LLM. These models aren't just a step up from their predecessors—they're a giant leap, smashing previous benchmarks and setting new standards of excellence. Why is this a game-changer? 🌟 The performance of these models doesn't just surpass the previous iteration; it obliterates it, redefining what we thought was possible in the process. And guess what? There's even more on the horizon with LLaVA-Llama-3-70B. For those looking to dive deeper, explore the models and their capabilities here: Model: https://lnkd.in/gC2YUrmB) https://lnkd.in/gGXZjyV8 Code: https://lnkd.in/gPYs7qgi ↓ Check out https://lnkd.in/gredE7Nu to get a weekly summary of the top models, repos and papers in AI. Read by 180,000+ engineers and researchers.
2 Comments
Like Comment
To view or add a comment, sign in
Ashish Patel 🇮🇳

🔥 6x Linkedln Top Voice | AI Research Scientist & Chief Data Scientist at IBM | Generative AI Expert | Author - Hands-on Time Series Analytics with Python | IBM Quantum ML Certified | 11+ Years in AI | MLOps | IIMA |
2d
Report this post
LLaVA-Llama-3-8B is released! XTuner team releases the new multi-modal models (LLaVA-Llama-3-8B and LLaVA-Llama-3-8B-v1.1) with Llama-3 LLM, achieving much better performance on various benchmarks. The performance evaluation substantially surpasses Llama-2. (LLaVA-Llama-3-70B is coming soon!) Model: https://lnkd.in/gC2YUrmB) https://lnkd.in/gGXZjyV8 Code: https://lnkd.in/gPYs7qgi
1 Comment
Like Comment
To view or add a comment, sign in
Ashish Patel 🇮🇳

🔥 6x Linkedln Top Voice | AI Research Scientist & Chief Data Scientist at IBM | Generative AI Expert | Author - Hands-on Time Series Analytics with Python | IBM Quantum ML Certified | 11+ Years in AI | MLOps | IIMA |
4d
Report this post
Exciting news in the LLM community! 🚀 The latest rankings are in, and it’s a close competition at the top. With GPT-4 variants showing strong performance, we see a notable entry with Llama-3-70b-Instruct securing a spot within the top ranks. This is more than a scoreboard—it’s a testament to the rapid advancements in LLMs language models. Organizations are pushing boundaries, as shown by the significant movement in positions. A key takeaway for anyone in tech or AI: stay adaptive, innovative, and ready to embrace the evolving landscape.
Like Comment
To view or add a comment, sign in

89,165 followers

View Profile Follow

Ashish Patel 🇮🇳’s Post

More from this author

The Art of Training LLMs: Navigating the Toolkit Beyond Rewards for LLMs

Exploring Mixtral 8x7B: Deep Dive into its Architectural Wonders

Discover the World of Graph Analytics: A Python Guide to Graph Data Modeling

Explore topics