The increasing capability of high-resolution military and intelligence sensors is producing ever growing quantities of data that could overwhelm the capacity to analyze them without new approaches to data management and analysis, according to a newly released report (pdf) from the JASON defense advisory panel.
“As the amount of data captured by these sensors grows, the difficulty in storing, analyzing, and fusing the sensor data becomes increasingly significant,” the report said.
Extrapolating from current trends, data production could hypothetically reach the Yottabyte range by 2015. (The Yotta- prefix means ten raised to the twenty-fourth power. Mega- means ten to the sixth power, Giga- means ten to the ninth power, and Tera- is ten to the twelfth power.) If one byte of data were used to image one square meter of the Earth’s surface, then 1.6 Yottabytes would be generated by imaging the entire surface of the Earth every second for a hundred years, the report explained.
While the data management challenge is daunting, it is not unmanageable in principle, the JASONs said, nor is it entirely unprecedented. “Important parallels can be drawn with data intensive science efforts such as high energy physics and astronomy.” These efforts show how data filtering approaches can be applied to reduce data storage and processing requirements well below the Yottabyte range.
The report suggested several research and development strategies for improving data management and analysis. The JASONs also proposed a series of “grand challenges” that would set ambitious technical goals and provide monetary rewards for their achievement.
The December 2008 JASON report was initially withheld from public access, but a copy was released in response to a Freedom of Information Act request from Secrecy News. See “Data Analysis Challenges”.
Datasets and variables that do not align with Administration priorities, or might reflect poorly on Administration policy impacts, seem to be especially in the cross-hairs.
One month of a government shutdown is in the books, but how many more months will (or can) it go? Congress is paralyzed, but there are a few spasms of activity around healthcare and the prospects of a continuing resolution to punt this fight out until January or later.
At a period where the federal government is undergoing significant changes in how it hires, buys, collects and organizes data, and delivers, deeper exploration of trust in these facets as worthwhile.
Moving postsecondary education data collection to the states is the best way to ensure that the U.S. Department of Education can meet its legislative mandates in an era of constrained federal resources.