Categories: Social Media News

TikTok owner ByteDance scrapes the web faster than OpenAI

In This Story

As ByteDance develops artificial intelligence models to compete in China, the bot it uses to scrape data to train those models is reportedly spiking in activity.

Suggested Reading

Suggested Reading

The TikTok owner launched its own web scraper, Bytespider, in April, and it’s now scraping data multiple times faster than bots from other companies, Fortune reported, citing research from Kasada, a bot management company, and Dark Visitors, a monitor of scraper bots. Companies developing AI models, such as Google (GOOGL) and Meta (META), use scraper bots to gather data to train and improve the large language models (LLMs) and multimodal models that power the companies’ AI services.

Advertisement

Bytespider is scraping web data about 25 times faster than OpenAI’s web scraper, GPTbot, Sam Crowther, CEO of Kasada, told Fortune. Compared with Anthropic’s ClaudeBot, Bytespider is 3,000 faster.

Advertisement

Like OpenAI’s and Anthropic’s bots, Bytespider ignores instructions from robots.txt, a non-legally binding line of code that tells web scrapers which data it can and cannot access on a website, Fortune reported. According to Kasada’s data, Bytespider has had spikes in scraping activity in the last six weeks.

Advertisement

“It’s like they’re trying desperately to catch up,” Crowther told Fortune.

ByteDance did not immediately respond to a request for comment.

The China-based company released its AI-powered chatbot, Doubao, last August, and it’s proving to be a tough competitor to homegrown rival Baidu’s (BIDU) Ernie Bot. In May, ByteDance launched a series of Doubao LLMs for enterprises, which cost less than models from the company’s Chinese competitors.

Advertisement

Now, ByteDance is planning to build a new AI model using chips from China’s Huawei, Reuters reported, citing three unnamed people familiar with the matter. However, a spokesperson for ByteDance previously told Quartz the company is not developing a new AI model.

The company has also designed two AI chips with Taiwan Semiconductor Manufacturing Company (TSM) that ByteDance plans to mass produce by 2026, The Information reported, citing unnamed people familiar with the matter. By producing its own chips, the company could become less dependent on Nvidia’s (NVDA) pricey graphics processing units, or GPUs, which are subject to U.S. export controls, people told The Information.

Social Media Asia Editor

Recent News

Disappoint Me by Nicola Dinan review – a fresh take on modern love

Disappoint Me is a novel structured around meals, whether assembled distractedly or seasoned with care,…

12 hours ago

I wish you all a sparkling new year, filled with joy, love, and endless possibilities

Happy New Year to you all, in these uncertain times. But is ‘happy’ the right…

13 hours ago

2026 Ford Ranger Super Duty spotted in Australia

A prototype version of the Ford Ranger Super Duty – a tougher, trade-focused version of…

15 hours ago

Sacrificing bears for amazing hotpots with Kuma-chan Onsen’s fukubukuro lucky bag

Hearts cry and mouths water as this cute little bear disappears into the broth forever.…

15 hours ago

‘I was very lucky’: activist and blogger Lu Yuyu on escaping China

As he trekked up the lush mountain range on China’s border with Laos, Lu Yuyu…

16 hours ago

The top 5 Starbucks Frappuccinos we’d like to drink again in Japan this year

There were many limited-edition drinks on the menu in 2024, but these were the best…

17 hours ago