Reports: A new web crawler launched by Meta last month is quietly...
Meta has recently introduced a new web crawler, known as the Meta External Agent, which has been deployed to gather data from various websites to support its AI model. This new crawler has been operational since last month, as reported by multiple firms specializing in tracking web scrapers and bots.
The primary function of the Meta External Agent is to scrape publicly available data from websites, such as text from news articles and discussions from online forums. This data collection process is essential for training Meta's AI models and enhancing their capabilities.
Unveiling the Meta External Agent
According to sources, the Meta External Agent is similar to OpenAI's GPTBot, which is also utilized for web scraping to train AI systems. Despite the similarities, Meta has not made any public announcements regarding the launch of this new web crawler.
One of the key features of the Meta External Agent is its ability to extract data for AI training purposes. The crawler operates under the radar, with minimal websites blocking its access compared to other web scrapers like GPTBot.
Challenges and Controversies
Scraping web data for AI training has been a contentious issue, often resulting in legal disputes between content creators and AI companies. The use of scraped content without proper consent has raised concerns among artists and writers regarding the protection of their intellectual property.
Despite the controversy, Meta continues to leverage web scraping for training its AI models, including the Meta AI chatbot. The company's investment in AI infrastructure reflects its commitment to enhancing AI technologies through data collection and analysis.
Future Developments
Meta's recent deployment of the Meta External Agent indicates a strategic shift towards expanding its AI capabilities. As Meta invests heavily in AI infrastructure, including the development of large language models like Llama, the need for quality training data becomes increasingly vital for AI performance.
The ongoing evolution of Meta's AI technologies underscores the company's dedication to innovation and advancement in the field of artificial intelligence. With a significant budget allocated for AI-related expenses, Meta is positioned to lead the way in AI development and integration.




















