April 16, 2008 7:15 AM PDT

On the road to the Semantic Web

The Semantic Web has been just around the corner for a few years. It turns out that bringing a semantic layer of metadata to the Internet is like climbing a mountain in flip-flops.

Tuesday night, Semantic Web mountain climbers Powerset, Radar Networks, and Metaweb participated in a salon at Powerset's San Francisco office, where I talked with them about their product plans.

Powerset gives wings to Wikipedia
I got a preview of Powerset's search engine, which is due to go into beta in the coming weeks, according to co-founder and CTO Barney Pell and as reported by TechCrunch.

Powerset differs from Google and other mainstream search engines in that it linguistically parses sentences, finding subjects, verbs, objects, synonyms, and other elements using a highly sophisticated, language-independent parser licensed from Xerox PARC).

Powerset then extracts and indexes concepts, relationships, and meanings, rather than keywords. (I wrote about Powerset when it first came out of stealth mode, in June 2007.)

Rather than trying to boil the search ocean, compete with Google, and deal with spam and 20 billion documents, Powerset has focused its initial efforts on giving wings to the 3 million pages of Wikipedia.

Hakia's semantic search engine also indexes Wikipedia and other sources. However, Powerset returns a more comprehensive dossier of results for queries, based on deep analysis of Wikipedia pages and other content, and also provides new ways to navigate and discover facts on the individual Wikipedia pages. More details to come when Powerset officially launches its public beta version.

Powerset plans to index the Web at some point (at a significant cost, in terms of servers and bandwidth). For now--or more precisely, when the company allows the public access to its technology--Wikipedia users will be the beneficiaries of a powerful semantic index and user experience.

True Knowledge
I also got a look at True Knowledge's search engine. Company CEO William Tunstall-Pedoe said the search engine is in private beta for now, with about 7,000 users.

Unlike Powerset and other search engines, Cambridge, England-based True Knowledge is building its own knowledge base. Users input facts, as in Wikipedia, but in a more structured manner. In addition, True Knowledge imports data from sources, including Wikipedia, in the form of discrete facts, such "Sacramento is the capital of California."

Queries, including those in natural language, are parsed for machine reading, and they access the repository of facts accumulated. True Knowledge can make inferences, such as in the following example.

(Credit: True Knowledge)

The capability to infer truths based on the data repository would be a welcome feature for Wikipedia, which doesn't have an automated method for dealing with contradictions.

Barney Pell (Powerset), William Tunstall-Pedoe (True Knowledge), Nova Spivack (Radar Networks), Paul Davison (Metaweb)

(Credit: Dan Farber/CNET News)

Metaweb
Another San Francisco Semantic Web start-up, Metaweb, was also a participant in the salon. The company's Freebase is more similar to True Knowledge than Powerset.

Freebase is an community-built database with a large corpus of open data sets, including Wikipedia and MusicBrainz. Powerset includes some Freebase-structured content in its index, and True Knowledge could add Freebase data to its knowledge repository.

Radar Networks' Twine
I also chatted with Nova Spivack, co-founder and CEO of Radar Networks. His company created Twine, an application combining bookmarking, blogging, and RSS reading, with an underlying semantic engine to tie the pieces of data together.

Spivack said Twine has about 7,000 users in private beta, as well as 40,000 standing in line for access. Half of the users have created private Twines, with corporations and closed communities of interest using the service for collaboration.

Major enhancements are planned for the summer and fall, including allowing for complete customization of the user interface. "We have only surfaced a bit of the platform so far. Twine as a platform will integrate with other applications, such as blogs, catalogs, social communities, and corporate sites," he told me.

"It's an enormous multiyear project," Spivack said. It's not like a Google beta or a 1.0 version masquerading as a beta." The same could be said of the other Semantic Web services in the room. It's going to be a very long beta cycle.

Recent posts from Outside the Lines
EIC Squared: Psystar vs. Apple, Cisco vs. Microsoft, Dell's cloud
Exploring Internet Explorer 8
Dell's designs on cloud computing
Welcome to the new CNET
Daily Debrief: Yahoo's winding road
Add a Comment (Log in or register) 1 comment
by thekohser April 16, 2008 9:07 PM PDT
If you want to see a Semantic Web version of Wikipedia, where any person, any company, any organization can create an article about themselves, see MyWikiBiz.com. The site is currently executing an inline table of airline accidents that shows the folly of Wikipedia's "category creep" problem. Why have categories like "Airline accidents in Kentucky" and "Airline accidents in 1983", when you can just organize all that data semantically in one table?
Reply to this comment
Add a comment
Comment SUBMIT

The posting of advertisements, profanity, or personal attacks is prohibited. Click here to review our Terms of Use.

Need help? » Feedback »
Powered by Jive Software
Comment reply

Submit Cancel
The posting of advertisements, profanity, or personal attacks is prohibited. Click here to review our Terms of Use.
Report offensive content:

If you believe this comment is offensive or violates the CNET's Site Terms of Use, you can report it below (this will not automatically remove the comment). Once reported, our staff will be notified and the comment will be reviewed.

Select type of offense:

Offensive: Sexually explicit or offensive language

Spam: Advertisements or commercial links

Disruptive posting: Flaming or offending other users

Illegal activities: Promote cracked software, or other illegal content

Comments (optional):

Report Cancel
E-mail this comment to a friend.

E-mail this to: (Separate multiple e-mail addresses with commas. Limited to 10 addresses.)

Your e-mail address:

Send me a copy of this message

Note: Your e-mail address is used only to let the recipient know who sent the e-mail and in case of transmission error. Neither your address nor the recipients's address will be used for any other purpose.

Add your own personal message: (Optional)

Send e-mail Cancel
advertisement

About Outside the Lines

Dan Farber is the editor in chief of CNET News. He has covered technology for more than two decades, and he previously served as editor in chief of ZDNet, PC Week and MacWeek. Outside the Lines explores the intersection of business and technology.

Add this feed to your online news reader

Outside the Lines topics

Subscribe to the EIC² podcast

Editors Dan Farber of News.com and Larry Dignan of ZDNet, square off in EIC² in this weekly podcast. The two editor in chiefs talk about the big tech stories of the day and provide insight and analysis.

View all EIC² podcast episode blog entries

Subscribe to this podcast using an RSS reader other than iTunes

Subscribe to this podcast using iTunes

Latest tech news headlines

Dell earnings down 17 percent

August 28, 2008 1:34 PM PDT

Google CEO: Internet spurred Obama's nomination

August 28, 2008 1:15 PM PDT

Apple applies for touch-screen Mac patent

August 28, 2008 12:34 PM PDT

Featured blogs

Beyond Binary by Ina Fried Coop's Corner by Charles Cooper Defense in Depth by Robert Vamosi Geek Gestalt by Daniel Terdiman Green Tech One More Thing by Tom Krazit Outside the Lines by Dan Farber The Iconoclast by Declan McCullagh The Social by Caroline McCarthy Underexposed by Stephen Shankland More CNET blogs »
advertisement

Inside CNET News

Scroll Left Scroll Right

News - Business Tech

Dell earnings down 17 percent

The PC maker's net income and earnings per share are below expectation. Dell blames conservative IT spending and the costs of acquiring more market share in Europe.

Gallery

Photos: Raising accessibility standards Photos: Raising accessibility standards

News - Apple

Apple applies for touch-screen Mac patent

A recent patent application filed on behalf of Apple describes technology for controlling a touch-screen Mac tablet with iPhone-like gestures and controls.

Outside the Lines

EIC Squared: Psystar vs. Apple, Cisco vs. Microsoft, Dell's cloud

On this episode of the EIC Squared podcast, CNET News' Dan Farber and ZDNet's Larry Dignan discuss the week's news.

Video

Democrats: Twitter, text, or telephone? Democrats: Twitter, text, or telephone?

News - Digital Media

Veoh decision setback for Viacom, but Google not off hook

In the Veoh video copyright case and in Viacom's lawsuit against YouTube, there's a key difference: whether the plaintiff sent take-down notices.

Video

Bigger blogger presence at DNC Bigger blogger presence at DNC

News - Politics and Law

Google CEO: Internet spurred Obama's nomination

Eric Schmidt fields questions at the Democratic convention about politics, online journalism, and privacy.

News - Cutting Edge

Rocket Racing League takes off with new engine, DKNY

The league, an aspiring Formula 1 for rocket racing, chooses a new liquid oxygen-alcohol engine from Armadillo Aerospace, a suborbital space company founded by Doom creator John Carmack.

Gallery

Images: The highs and lows of digital drama Images: The highs and lows of digital drama

Crave

Battle of the wireless headsets

We put the Creative Digital Wireless Gaming Headset vs. the Logitech ClearChat PC Wireless headset to see which you're better off with.

Green Tech

GE reshapes the future of wind power

How to make wind 10 percent of electricity generation? Funky-shaped turbine blades, high-tech materials, and smarter grid connections, says GE's head of wind research.

Copyright ©2008 CNET Networks, Inc., a CBS Company. All rights reserved. Privacy policy Terms of use

Visit other CBS Interactive sites:

[image]


You are viewing a mobilized version of this site...
View original page here

Mobilized by Mowser Mowser