Posts Tagged ‘radar-networks’

PostHeaderIcon Sneak Peak At T2, Twine’s Semantic Search Engine

Extracting meaning from the Web is huge project that is very difficult to do at large scale. Keyword search only skims the surface of meaning locked in Web pages. Various semantic search technologies try to go deeper by adding structured data to web pages so that the Web can be treated more like a database. But adding semantic metadata to the Web is laborious and time-consuming. Just look at Twine. It’s approach so far has been to add semantic data only to the Web pages members save to the service.

While it appeared like Twine was finally getting some traction earlier this year, it’s fallen by the wayside. Traffic is way down (see chart below), partly because it is no longer buying traffic with ads and partly because of changes to the way Google indexes the site. Bottom line is that is that beyond a hardcore following of about 250,000, Twine does not have broad appeal.

But CEO Nova Spivack and his team at Twine have been busy working on something else entirely, to the point that the current Twine service is pretty much on autopilot. In the video above, Spivack gives a sneak peak at what his team has been working on. Codenamed T2, it is complete departure from the navel-gazing approach of Twine 1.0. It is a big step towards creating a semantic search engine that might eventually scale across the Web—exactly the kind of swing for the fences type of idea we like to see at TechCrunch.

When T2 launches, hopefully by the end of the year, it will be a demonstration of what semantic search could be. T2 will have a semantic index of the top 50 to 100 sites across major categories such as food, health, sports, music, finance, television, politics, tech and movies. In those categories, T2 should provide really good guided search. If you search for “baseball” you will get a list of baseball players, along with categories on the side to refine the list such as by position or team name. When you type in “thai food,” you can select the Recipes tab and then filter by food site, rating, main ingredient, and so on. Or you can select the restaurant tab and drill down by city, hours of operation, etc.

You’ll find this type of guided search on Bing, with the categories changing based on the initial search term. But Twine does things differently.

What Twine has done, basically, is speed up the rate at which it can look at a raw Web page and create semantic metadata for it. Bing sometimes does this via natural language processing, through the technology it bought with Powerset. That takes a lot of computation. It also employs other methods. Twine’s approach is more to create a set of semantic tags for each page.

There are already standards for doing this, such as RDF and OWL, but most Webmasters don’t bother adding such tags to their sites. If they happen to be there, Twine can read them, but it can also make a good guess as to what is on the page and assign its own tags to the page. In order to try to make it easier for Web developers to tag their sites, Twine is also working on developer tools such as an Ontologies Editor. This lets anyone with domain expertise define the different concepts and tags which would characterize a page about a particular topic, such as a recipe or a baseball player or a car. For example, a recipe might be contain concepts such as ingredients, difficulty level, an author, and a a date. There are literally millions of potential properties that can be matched to different concepts. The collection of all of these together for a specific topic is an ontology.

There can literally be hundreds of thousands of ontologies for every conceivable topic. If Twine knows what ontology to apply to a given Web page, it can do a better job applying semantic tags to it and extracting data. Twine wants to create an open directory of these things, which will be like a SourceForge for ontologies where anyone can contribute and make them better. You can watch this video for more details.

All of this might seem a bit abstract, but if we could ever get to the point where the most important pages on the Web have semantic tags, it will be a lot easier for computers to know what they are about. And to the extent that data is locked in those pages, the Semantic Web will turn that data into something that can be computed. As these tags get applied to more and more information, they could eventually help filter stream data as well that everyone is increasingly drowning in.

Whether or not Twine will be the company to deliver any or all of this is a long shot, but it is definitely something worth swinging for.

Crunch Network: CrunchBase the free database of technology companies, people, and investors

TechCrunch50 Conference 2009: September 14-15, 2009, San Francisco





PostHeaderIcon Twine Is Taking Off, Now Bigger Than FriendFeed

It turns out that people are following more than just their friends online. Look at the comScore chart above comparing unique visitors in the U.S. to FriendFeed versus Twine. Yeah, I was shocked to see that Twine has more than three times as many unique monthly visitors as FriendFeed (714,000 vs. 188,000). On a worldwide basis, comScore shows FriendFeed still slightly ahead of Twine. ComScore doesn’t always do a great job with small sites, so I checked Compete, which shows Twine with 2.25 million monthly visitors in April versus 998,000 for FriendFeed (see embed below). Different numbers, same story.

While FriendFeed is organized around following feeds of your friends’ activities across the Web, Twine is organized around interest feeds. Essentially, Twine members create topic pages that others can follow. It requires more work than FriendFeed. You have to add items such as links,articles, videos, and notes. But once you do, Twine organizes them for you using an underlying semantic index and tagging technology combined with social inputs from the community. So in a sense it competes more with Mahalo or Squidoo in that it creates authoritative pages around topics, except that these pages are really constantly updated topic or interest feeds that anyone can add to. You can read more about the original concept here, which relaunched publicly in October, 2008. All the growth is from October.

I pinged Nova Spivack, CEO of Radar Networks, the company behind Twine, to ask what’s up. He says that both the Compete and comScore numbers are off, but the trend is right. The initial growth came simply from letting people in who had been on the waiting list. But even he is surprised by the growth rate. So far five million items have been bookmarked in Twine. There are now 200,000 registered users who have created Twines (its name for interest feeds) across 30,000 different interest groups. The rest of the traffic comes from people visiting those topic pages.

And it is not all SEO traffic. Spivack provides the following breakdown of traffic by source: 59 percent comes from people coming directly to Twine, 20 percent comes from search engines, and most of the rest comes from people who receive email notifications and daily digests tracking the interest feeds they’ve signed up for. About 2 percent of traffic comes from twitter, but that portion is “rising fast.”

Following interesting people is just a proxy for following your interests, and Twine lets you connect with like-minded people as well. It is the combination that is killer.

Crunch Network: CrunchBoard because it’s time for you to find a new Job2.0




Good Net Recommended