FAS

Yottabytes and the Data Analysis Challenge

07.06.09 | 2 min read | Text by Steven Aftergood

The increasing capability of high-resolution military and intelligence sensors is producing ever growing quantities of data that could overwhelm the capacity to analyze them without new approaches to data management and analysis, according to a newly released report (pdf) from the JASON defense advisory panel.

“As the amount of data captured by these sensors grows, the difficulty in storing, analyzing, and fusing the sensor data becomes increasingly significant,” the report said.

Extrapolating from current trends, data production could hypothetically reach the Yottabyte range by 2015.  (The Yotta- prefix means ten raised to the twenty-fourth power.  Mega- means ten to the sixth power, Giga- means ten to the ninth power, and Tera- is ten to the twelfth power.)  If one byte of data were used to image one square meter of the Earth’s surface, then 1.6 Yottabytes would be generated by imaging the entire surface of the Earth every second for a hundred years, the report explained.

While the data management challenge is daunting, it is not unmanageable in principle, the JASONs said, nor is it entirely unprecedented.  “Important parallels can be drawn with data intensive science efforts such as high energy physics and astronomy.”  These efforts show how data filtering approaches can be applied to reduce data storage and processing requirements well below the Yottabyte range.

The report suggested several research and development strategies for improving data management and analysis.  The JASONs also proposed a series of “grand challenges” that would set ambitious technical goals and provide monetary rewards for their achievement.

The December 2008 JASON report was initially withheld from public access, but a copy was released in response to a Freedom of Information Act request from Secrecy News.  See “Data Analysis Challenges”.

publications
See all publications
Emerging Technology
day one project
Policy Memo
Empowering Communities through Community Benefit Agreements in AI-Fueled Data Center Development

When properly structured — with specific numeric targets, secured financial obligations, independent monitoring, and meaningful enforcement — CBAs transform data center deals into durable community partnerships.

06.10.26 | 16 min read
read more
Emerging Technology
day one project
Policy Memo
Settlement Wins Against Big Tech Should Underwrite Digital Resilience Funds

Protecting the public from the tech industry’s predatory business models and the next wave of AI harms is an enormous challenge, but we have the evidence that trying to build a healthier digital culture is absolutely worth the effort.

06.10.26 | 12 min read
read more
Emerging Technology
day one project
Policy Memo
Prioritize Student Safety in K-12 Education By Establishing AI Procurement Guardrails

Opaque and insufficiently tested tools are increasingly shaping student outcomes without consistent transparency, civil rights review, or technical safeguards. States and the U.S. Department of Education can address these risks using procurement and oversight tools already within their authority.

06.10.26 | 20 min read
read more
Emerging Technology
day one project
Policy Memo
How to Safely Bring AI into Law Enforcement:  The Case of AI-Generated Police Reports

Commercial artificial intelligence tools have recently emerged that are able to produce police reports. If the resulting reports are inaccurate, incomplete or biased, or if the process leaks confidential information, this could undermine the criminal justice system and harm citizens.

06.09.26 | 20 min read
read more