Thomas Edison invented tomorrow here.
And here, Eric Anderson wants to invent the tomorrow after that â" a future of endless digital information and the means to make sense of it.
In other words, a future of Big Data.
More than a century after Edison moved his Edison Machine Works here, laying the groundwork for what would become the General Electric Company, Mr. Anderson is deploying myriad sensors to collect information about what happens inside a massive steam-driven G.E. generator: heat, vibrations, pressure and so on. Working at peak, the contraption can light 750,000 homes.
âWeâre designing these for the next 30 years,â Mr. Anderson, a senior engineer at G.E., said of the companyâs steam-driven generators. âWeâll want to know everything thatâs going on, so we can start to make predictions about what else will happen.â
The sensors inside the turbine will feed data into an even larger computer system, channeling information about things like fuel costs, eather, demand for power in this town or that, and alternative supplies of electricity. All of that data will be churned by computer algorithms to find the optimal performance for the conditions of the moment. Should the turbine run harder, given forecasts of a heat wave? Can wind turbines pick up the load for a day or two? What are the maintenance costs? How well trained is the new worker on the crew?
âThe process of making power doesnât change,â Mr. Anderson said. âIt is still thermal energy, to mechanical energy, to electrical energy. But now itâs about instrumentation, data and prediction, too. Thatâs all different.â
In much the way clicks on an Internet browser tell Google what search results and ads to show next, voluminous data enables faster insights, and faster feedback loops, in fields that include manufacturing, national security, health care and agriculture.
And much the way the Internet has raised new issues around freedom and privacy, the wealth of new in! formation seems likely to afford unprecedented new control over the behavior of things and people. Across the global economy, computer scientists, hospital technicians, consumer marketers, bankers, soldiers, students and more see new possibilities.
âOne thing we can say for sure is that many domains will gain a lot by recording and collecting everything, then putting computing people on top of it all,â said Jon M. Kleinberg, a professor of computer science at Cornell. âIt will change the way we think about ourselves, other people and society.â
Data seemingly collected for one purpose can be easily used for another, Mr. Kleinberg worries. âYou study a power grid to learn about turbines, but you also learn things like peopleâs bedtimes, and how they commute,â he said. âThe part that is less clear is always the harms that happen.â
The trend may change venerable companies like G.E. in striking ways. A thousand feet from the cavernous Building 273, where Mr. Anderson builds tubines with 1,100 hourly workers, many of 20,000 sensor-rich wind turbines built by G.E. are managed by a handful of people, replacing scores of local employees.
Service contracts like these have been important to G.E. for decades, but the company thinks that automated, data-infused management will be far more profitable. Problems can be caught earlier, and there will be much more knowledge about how things behave in the real world, interacting with other people and things.
Each wind tower feeds data into the system 400 times a second. In turn, its blades are adjusted for demand from the grid, what the other towers on the wind farm are doing and the power supply coming from conventional turbines. Two years from now, the turbines will feature lasers that sense the wind 100 meters out, adjusting their blades for the next gust. In the last year, G.E. became the worldâs top seller of wind turbines thanks to the data technology.
Because wind is fickle, the energy goes to batteries that ! feed powe! r to the grid when the data says to. Those batteries are made with a new kind of nickel-salt composite, in full commercial use for less than a year, that is created next door to the wind center. The factory has just 250 hourly workers and 10,000 sensors, reading information four times a second. Each cell in every battery has its own history of the people, chemicals and processes contributing to its creation.
Six months ago, the $170 million factory had a reject rate from poorly made products of 22 percent, among the worldâs lowest for this kind of technology. Normally it takes years to get that yield from a new material. Using Big Data, the company has also expanded the diameter of its wind turbines by 23 percent using cheaper materials, testing the machines under 300 different situations.
Those results have G.E. rethinking 30 years of offshoring and outsourcing its manufacturing to cheaper suppliers in far-off countries. There is now data lust for every product, as each mass-produced object i also seen as something unique in its creation and use. More than 250,000 G.E. turbines, locomotives, jet engines and medical devices are in operation around the world, and controlling all that data improves the profitability of its service contracts and its future manufacturing.
âNow we want to look at the life of every specific machine, all the parts of manufacturing, how everything behaves,â said Mark M. Little, G.E.âs chief technology officer. âIâve spent a career trying to offload manufacturing. Now weâre talking about vertically integrating again.â
Mr. Little started in steam generators 35 years ago and now runs a center with 2,000 Ph.D.âs and advanced software engineers, about twice the number of Building 273âs manual laborers. At its height, G.E. Schenectady employed 45,000 people; 4,000 now work there. The company has over $100 billion in revenue from heavy industry, but the big push is into software that can better analyze and manipulate that world.
That ! push bega! n about two years ago, when Mr. Little and Jeffrey R. Immelt, G.E.âs chief executive, began to take note of the kind of efficiencies that companies like Google, Amazon, Yahoo and Facebook were able to realize on the Internet, using lots of data to overtake traditional competitors in publishing and media.
Much sensor data, like much of the information on the Internet, is so-called unstructured data, which means it does not easily fit into the kind of database grids on which most businesses have relied. To figure out efficiently what is on numerous Web pages in order to catalog things for Internet search or mechanically snoop on the content of e-mails to put the right ads next to them, these companies came up with new methods of dealing with all kinds of data, in fast feedback loops.
Sensors and data storage had become cheap and abundant, filling G.E.âs computers with lots of data it had not kept before. Mr. Immelt committed $1 billion toward seeing if G.E. could use those Internet skills in is business.
Two floors below Mr. Littleâs office, those skills are being applied to, of all things, the venerable locomotive. Using global positioning data, as well as sensors across tracks, in the engine and on cars, software can figure out the horsepower needed to run everything at just the right speed, using the mass of a train, the local topography, even the temperature inside tunnels. Other trains on tracks and in yards pour out similar amounts of data.
There are tens of millions of variables every second. At such a large scale even small changes matter. Running a national rail network one mile per hour better is worth $200 million a year. Improving the wait times in rail yards by 15 percent saves $500 million.
Near Cincinnati, where G.E. makes jet engines, complex calculations about the weight and makeup of materials are used in 3-D printers that make certain aircraft parts.
To help figure out airport delays, the company hired statisticians from Kaggle, which runs cont! ests to f! ind the best data scientists worldwide. They looked at 35 factors with hundreds of variables each, like weather, cloud cover, gate availability and regional traffic. The results cut average delays in landing to four minutes, from seven.
âPeople talk about Facebook being complex,â Mr. Little said. âThis is geometrically more complicated. But even a 1 percent difference in the industries weâre in is huge.â
G.E. might also want to look at its own internal operations. All of that offshoring and diversification have given the company more than 450 different and incompatible complex planning systems, the kind of software complexity that even Big Data cannot yet manage. So far, there are no plans to overhaul the complexity of that world.
Instead, G.E. has opened an in San Ramon, Calif., not far from Silicon Valley. Impatient with how much data it is already collecting, G.E. is working with Internet companies on new ways to run bigger power systems, how to manage hospital beds in a way tat keeps nurses happy or figuring out which trees to cut before a storm hits and knocks out power lines.
âWhat will all of this look like when 50 billion machines around the world get connected?â said William Ruh, the head of the San Ramon operation. âIt will certainly change the way we operate machines and how we operate the company. And it will change lots more besides.â
People will invent new kinds of analysis, he says, in a world of âdata fusionâ that combines advanced math and supercomputing in entirely new ways.
That, he says, may be the least of the ways data analysis will change us. âMy kids are already comfortable with technology that makes decisions for them, like what route to drive,â he said. âTo the next generation, these will be the tools of insight.
âThere will be changes in human behavior. People who are used to working from gut experience will ignore data that doesnât agree with their preconceptions and will miss the productivity gains.â!
P! eople will also learn to manage themselves more closely, he said: âEverything you say will be saved and annotated. Will that make us all supercareful, or will we grow immune to things people say? Who owns that record at work, and what will they use it for? We donât know, but that is where the world is taking us.â