By: Charles Bowen
I remember back in the pre-Watergate days addressing a journalism class at our local college and telling students — with all the certainty that only a brand new city editor could muster — that the jobs in their chosen field would always be safe. “After all,” I said smugly, “robots are never going to be able to cover a beat and write a story for tomorrow morning’s newspaper. Only we humans can write a lead.”
Well, at least that remark was correct for a longer span of time than some of my other pronouncements of the period. But that statement might not be correct much longer. The robot reporters are at the gate. Want to see how they’re doing at trying to eat your lunch?
Columbia Newsblaster is an experiment in artificial intelligence at Columbia University that tries to create fresh news leads by analyzing the dispatches it finds from assorted news organizations on the Web. Newsblaster (http://www.cs.columbia.edu/nlp/newsblaster) looks at reports from nearly a dozen resources, including the Los Angeles Times, The Washington Post, USA Today, ABC News, CNN, Reuters, CBS News, Canadian Broadcasting Corp., Yahoo!, Virtual New York, and Wired. The site uses artificial intelligence techniques to sift through news reports published online, then sort and summarize these dispatches into five different news categories: U.S., world, finance, entertainment, and sports.
To organize the material it finds, the software bases the summaries on “reflecting factors,” such as where a fact is mentioned in the published reports and how often it is repeated across reports dealing with the same event or subject. The summaries also are based on the news value of individual facts, such as how many were killed or injured in an incident or how much damage was done to property. The program then produces its leads for each story’s display on the Web site along with links to the original source material it used in producing the leads.
Significantly, Newsblaster doesn’t merely extract whole sentences from the original reports. Instead, it uses “natural language processing techniques” to produce its own copy. Kathy McKeown, the professor who heads the team of researchers at Columbia’s Department of Computer Science in the School of Engineering and Applied Science, says journalists really don’t have to be unsettled by this seeming encroachment on their turf. What Newsblaster does, she recently told The Christian Science Monitor, “is take what journalists have already written and provides a way to easily browse. We could not write the stories ourselves.” At least, not yet. But some of the leads the software provides are remarkably concise and lucid.
Other considerations for using Newsblaster in your own work:
1. While Newsblaster is primarily an artificial intelligence experiment, its interest is not limited to computer scientists. Online Journalism Review recently called it a tool for journalists, executives, “and even average news consumers to help them manage in an age of information overload.”
2. This project arises from a rather impressive background in artificial intelligence. Team leader McKeown has worked on “natural language processing” of news texts for almost 20 years with research funded by the Defense Advanced Research Projects Agency, an arm of the U.S. military.
3. If you write about Newsblaster in your news columns, you might alert students and teachers to its potential as a study aid. McKeown recently told the Monitor, “Right now, my 11-year-old daughter uses it when she has to do her homework on current events.”