Love Bytes with Cybelle Navigating Life’s Highs and Lows, A Husband’s Journey Through Family and Career Challenges!
June 25, 2024Apple Teams Up with Meta: The Future of AI on Your iPhone!
June 26, 2024AI to Exhaust Internet Text Data by 2026
Artificial intelligence systems, such as GPT-4 and Claude 3 Opus, could consume all available internet text data by 2026, according to a new study published on arXiv (Live Science). These models rely on vast amounts of text from the web to improve their performance. However, researchers estimate that the current supply of high-quality online text will be depleted between 2026 and 2032. This impending scarcity raises concerns about the future development of AI models, which may need to turn to synthetic data or potentially controversial sources like private servers.
Implications of Data Depletion
The depletion of freely accessible data could lead AI developers to seek alternative data sources, including private information from servers and synthetic data generation. Current AI advancements heavily depend on the availability of large, high-quality datasets. For example, ChatGPT was trained on 570 GB of text data. As the quality of available data declines, the outputs from AI models could suffer, as seen in past instances where models trained on lower-quality data provided inaccurate or absurd results.
Future Strategies and Challenges
Companies are exploring strategies to address this potential data scarcity. One approach includes using private data, as highlighted by Meta’s new policy to utilize chatbot interactions for training. Additionally, synthetic data generation, though currently limited to specific areas like gaming and coding, could become more prevalent. However, this shift may face legal challenges from content creators seeking compensation for the use of their intellectual property. Beyond data scarcity, the energy consumption of AI models, which is significantly higher than traditional methods, presents another major hurdle for the future of AI development.
(Visit Live Science for the full story)
*An AI tool was used to add an extra layer to the editing process for this story.