FAS

Yottabytes and the Data Analysis Challenge

07.06.09 | 2 min read | Text by Steven Aftergood

The increasing capability of high-resolution military and intelligence sensors is producing ever growing quantities of data that could overwhelm the capacity to analyze them without new approaches to data management and analysis, according to a newly released report (pdf) from the JASON defense advisory panel.

“As the amount of data captured by these sensors grows, the difficulty in storing, analyzing, and fusing the sensor data becomes increasingly significant,” the report said.

Extrapolating from current trends, data production could hypothetically reach the Yottabyte range by 2015.  (The Yotta- prefix means ten raised to the twenty-fourth power.  Mega- means ten to the sixth power, Giga- means ten to the ninth power, and Tera- is ten to the twelfth power.)  If one byte of data were used to image one square meter of the Earth’s surface, then 1.6 Yottabytes would be generated by imaging the entire surface of the Earth every second for a hundred years, the report explained.

While the data management challenge is daunting, it is not unmanageable in principle, the JASONs said, nor is it entirely unprecedented.  “Important parallels can be drawn with data intensive science efforts such as high energy physics and astronomy.”  These efforts show how data filtering approaches can be applied to reduce data storage and processing requirements well below the Yottabyte range.

The report suggested several research and development strategies for improving data management and analysis.  The JASONs also proposed a series of “grand challenges” that would set ambitious technical goals and provide monetary rewards for their achievement.

The December 2008 JASON report was initially withheld from public access, but a copy was released in response to a Freedom of Information Act request from Secrecy News.  See “Data Analysis Challenges”.

publications
See all publications
Emerging Technology
Report
SOURCE CODE: A Policy Agenda for Fostering Trust and Fairness in AI

These ideas aim to advance the detailed policy solutions needed to foster public trust and implement fairness in the adoption of AI across diverse domains, from healthcare and government benefits to rural access, education, and worker protections.

06.11.26 | 17 min read
read more
Emerging Technology
day one project
Policy Memo
Move Algorithmic-Driven Pay and Scheduling Systems From Surveillance Pay to Fair Wages

The evidence is clear: algorithmic pay-setting is established in app-based work, and payroll/timekeeping failures show how software can produce systemic wage harm at scale

06.11.26 | 15 min read
read more
Emerging Technology
day one project
Policy Memo
How State Leaders Can Put People First in AI Decision-Making

While a few states have taken steps to implement decision-making mechanisms for certain AI systems, too many leaders are simply accepting narratives about AI’s purported public benefit at face value – jumping to the “how” of AI implementation before thoroughly vetting potential systems and deciding whether they are appropriate to use at all.

06.11.26 | 17 min read
read more
Emerging Technology
day one project
Policy Memo
Empowering Communities through Community Benefit Agreements in AI-Fueled Data Center Development

When properly structured — with specific numeric targets, secured financial obligations, independent monitoring, and meaningful enforcement — CBAs transform data center deals into durable community partnerships.

06.10.26 | 16 min read
read more