ABOUT COSMIC LOG

Quantum fluctuations in space, science, exploration and other cosmic fields... served up regularly by MSNBC.com science editor Alan Boyle since 2002.

Alan Boyle covers the physical sciences, anthropology, technological innovation and space science and exploration for MSNBC.com. He is a winner of the AAAS Science Journalism Award, the NASW Science-in-Society Award and other honors; a contributor to "A Field Guide for Science Writers"; and a member of the board of the Council for the Advancement of Science Writing.

Check out Boyle's biography or send a message to Cosmic Log via cosmiclog@msnbc.com.



First, the Web ... now, the Grid

Posted: Monday, June 25, 2007 6:25 PM by Alan Boyle


CERN

Computer cases are lined up in CERN's Computing Center with room to spare.


The World Wide Web was born in 1990 to manage the billions of bytes of data from experiments at CERN, Europe's particle-physics laboratory. Now the same laboratory is gearing up for a new round of experiments that could generate more than a quadrillion bytes of data every month - data that will have to be processed and delivered to researchers around the world. Is there anything in sight that could outdo the Web? Say hello to the Grid.

CERN has been working for a decade on the foundations for the Grid, which is a next-generation network that draws upon storage space as well as processing power from linked computers. That's about as long as it's been preparing for the Large Hadron Collider, or LHC.

The LHC’s experiments will be taking in the details from millions of proton collisions every second. A lot of the less interesting data will be filtered out almost immediately by the "trigger" computer programs overseeing each experiment. Nevertheless, hundreds of megabytes of data will be dumped into CERN's central computer system every second.

"That means you fill a DVD in a few seconds," Francois Grey of CERN’s Computing Center told me. Over the course of a year, the center is due to store 15 billion megabytes of data – which you can also think of as 15 petabytes, 15 quadrillion bytes or (according to CERN) roughly 1 percent of all the information generated by humanity in the course of a year.

All those data will have to be available on demand for the 7,000 researchers around the globe who are slaving away on the LHC experiments. The information flow is expected to rise to 1.6 gigabytes per second, or roughly 1,000 times faster than your typical high-end cable Internet connection.

Fortunately, information technology has come a long way since the invention of the Internet (in the 1960s) and the Web (in 1990).  You can tell by all the empty space you see in the Computing Center's air-conditioned hive, in the heart of CERN's campus on the French-Swiss border.

Thousands of computers and robotically controlled tape drives are humming right along, processing simulated data to prepare for the real thing. But there’s still enough empty floor space available to start up a bowling alley.

"We have this oversized computer center because it's from the '70s, when computers were huge,” Grey explained.

Even while information is coming in from the experiments for storage, Grey said, it will be going out via a 10-gigabit-per-second optical-fiber network to 11 Tier 1 computers around the globe (PDF file). Like a multilevel-marketing pyramid, the network is structured so that Tier 1 computers feed the data to more than 50 Tier 2 computers in various regions (PDF file). Those computers, in turn, distribute the data to the home institutions for all of the 7,000-plus collaborators in the LHC experiments.

"What's new here is that we're getting hundreds of organizations to share resources," Grey said. He added that the "human challenge is in a sense bigger than the technical challenge."

But Grey said the Grid is settling into place as CERN prepares to move from merely simulating the data load to sending out the real stuff. "The peak grumbling phase is over," he joked.

There are already moves afoot to expand Grid technology to other applications. The Open Science Grid and EGEE (Enabling Grids for E-sciencE) are among the first initiatives going beyond the LHC, Grey said. He foresees a day when climate modelers, genetics researchers, oil and gas prospectors and others who have to deal with large, dynamic data sets will get into Grids as well. "It'll be behind the scenes in Web services who use them," Grey said.

Many everyday Web users are already using different breeds of Grids, such as SETI @ Home, Einstein @ Home or Stardust @ Home. CERN itself is joining the club by offering LHC @ Home as a public project.

Just as the Web addressed the challenges of the 1990s, interlocking Grids will be called upon to address the challenges of the 21st century. "It's not about information management now. It's data processing and storage," Grey said.

But when you consider Grids for the common computer user, you have to remember the potential downside. Like the Web, the Grid can be used for good or evil: One could easily imagine the rise of a malevolent Grid - in fact, the zombies may already be taking over.

Add your own thoughts about the rise of the Grid in the comments section below - and for more information about the Grid's past, present and future, just drop in at CERN's Grid Cafe. As for me, I spent the weekend dropping in on cafes and tourist sites in Paris, but I'm due to get back on the road on Tuesday.

Previously from the Big Science Tour: The science behind the tour ... Living in the Web's cradle ... Inside the big-bang machine ... Toiling in the fields of physics ... Inside the antimatter factory.

MAIN PAGE

Email this EMAIL THIS

Comments

Welcome to the MATRIX
We're still trying to figure out how to lock down the  WWW and make it 'safe' for the casual user. The GRID as described sounds like a Petri dish to me.  Fortunately, it'll be full mostly of data from meaningless physics experiments. Later, if oil companies or financial institutions can be duped into using it, the real trouble will start.
Now thats a grid I am talking about. Joining the
LHC @ Home public project.

What are other ways for general public to use, maybe they should not use it, first research than general ppl

Vijay
Grid, Linked computers?, Rise of a malevolent Grid?  Sky net! (okaay, for those of you not into action movies, think Terminator). The day of computer self awareness is drawing closer.
What is to happen to the exisiting physical structure?  This states that everything will be moved over to fiber optics which will have more capacity.  Is this only for scientific research facilities?  
It'll only take hackers seconds to dowload your entire hard drive and steal all your personal information
From the picture in the article, it looks like they will be doing a lot of cluster-based computing?  Is this true, and what is their choice of OS?
All this without completely understanding Electricity
I love grit!
I am actually suprised at the "slowness" of the network associated with a project like this.  1.6 Gb/sec, is 12.8 Gbps, which already exceeds the 10 Gbps fiberoptic network.  So unless they have multiple runs, and some sort of "processing" to split the date into multiple streams, they have already maxed out their network, or are hampered in performance by having to divvy the data across multiple lines.  Bad architecture imo.
Dan from Dallas:  Over what kind of network will Grid users be able to download a 100GB hard drive in seconds?  Because, if that were ever possible, that would mean some incredibly quick download times during MY web browsing!  I'd welcome it.
Every last frame of porn ever filmed stored in one place.....the Grid can take control of it and parcel it out to those that comply with its wishes....the evil possiblities are endless.
Evil intents aside, we shouldn't be so concerned with how it could be mis-used that we fail to use and develop it further.  I look forward to the day when we figure out the human brain is the best computer ever made.
Eh, ATX cases are a serious waste of space. They should take a page from google and just use bare aluminium trays. With creative wiring you should be able to fit 84 dual socket quad core boards per rack.
Hilarious.  

Reminds me of Al Gore, except it steals the true earned credit for the real innovation at CERN itself for HTTP.

I think the term "World Wide Web" confused the tehnical illerati into believing that what was invented was a network, not a new way of interfacing between humans their machines and information, and other humans their machines and their information.

So what is innovative here?  Looks like getting a bunch of bureaucratic science organizations to play nice, so they can get lots of taxpayer $$$$.  Maybe they will buy a DNS to resolve "EGEE" in the middle of a URL.
Anyone see Terminator 3?  Skynet was the program, using Grid like P2P connections to launch nukes and exterminate humanity.  Is this the first step?
Pete Townsend included his version of The Grid in his 1993 album 'Psychderelict'.
get real, people! the sky will not fall. the chinese are not trying to control the computer networks. our government is not trying to use the grid (as in, OMG!!) to soak up and analyze all the data recorded about us online, from simple searches to complex tribal behavior vis a vie the new social order arising from networked behavior. i could go on, but do you really want them to have a central repository for mining information about you? just think what bush and his evile minions would do with that data...
it looks like arnold will finally have his way. ski net is taking over!!!!!!
Just think all the world's music in one place! The ultimate iTunes. Then when you want a song you could find any song or type of musical composition ever created without having to go on a wild goose chase. It's a nice dream but I doubt the record companies would ever co-operate.


SEND A COMMENT

PLEASE READ: All comments must be approved before appearing in the thread; time and space constraints prevent all comments from appearing. We will only approve comments that are directly related to the blog, use appropriate language and are not attacking the comments of others.

Message (please, no HTML tags. Web addresses will be hyperlinked):

TRACKBACKS

Trackbacks are links to weblogs that reference this post. Like comments, trackbacks do not appear until approved by us. The trackback URL for this post is: http://cosmiclog.msnbc.msn.com/trackback.aspx?PostID=235021

Latest Tech & Science News

Syndicate This Site

Add Cosmic Log to your news reader:
live.com xml
myyahoo msn
bloglines newsgator
google