Google Talk: No one’s listening

July 25, 2006 by wes2

One gets the feeling that if AskJeeves launched an IM client they’d do better than Google. But why is Google doing so poorly and how poorly are they doing? From TechCrunch:

The user numbers coming out on Google Talk are staggeringly terrible. Comscore usage numbers show that nearly a year after launch Google is a distant, distant 4th after MSN, Yahoo and AIM. They hold a pitiful 1% of total instant messaging market share, with 3.4 million unique users in May 2006. See the Comscore chart below for more details (I wonder where Skype IM falls in those stats). Note that Comscore does not include Google Talk usage within Gmail itself (where it is embedded), but even factoring that in, the numbers are just awful.

Even factoring in the chatting that goes on within Gmail and which were not included in these numbers, GoogleTalk is a failure by most measures. However Google has been mum on the strategy of their IM client so one doesn’t know what priority, if any, Google placed on the success of their downloadable client.

Marketing and prioritization aside, why haven’t more people switched to Google? The most likely reason is the combination of inertia, switching costs and interoperability. Those that are comfortable with AIM see no reason to deal with the hassle of moving IM systems (despite AIM being bloated and ad heavy); Yahoo and MSN are now interoperable which has opened up new contacts for people on those systems. Since no one even talks about ICQ anymore we’ll just skip them. This leaves GoogleTalk.

The Web 2.0 Bookshelf

July 22, 2006 by wes2

Alex Iskold’s (Web 2.0 Journal) recommended books for those interested in building Web 2.0 applications. It’s a very interesting mix of books about Java, CSS and about the power of networks. A good resource for someone looking for a couple of relevant books to take on their summer vacation.

Web 3.0 Explained

June 15, 2006 by wes2

I've gotten in just enough "what is Web 3.0?" conversations to appreciate this definition (via Web 2.0 Journal):

 

Defining Web 3.0
The defining aspects of the Web 3.0 social experience may therefore be as follows:

· One, that you won’t need to “go” anywhere, except maybe to set up some initial parameters. Where your computer is, is where you are. Information comes to you based on tags and search criteria; you don’t have to go out there.

· Two, that there are no pages. Information comes in packets of discrete units. You merge or cross them, as you need to.

· Three, that there are no Web sites. Existing Web sites are no longer meant for human eyes. They act as indexes to the information, which is accessible via XML request. Exceptions to this will not be Web sites, but independent little islands of commerce or games.

· Four, that creating information is like writing an email or writing a document. Accessible to anyone with a computer.

· Finally, that being on the Web means not being on the Web at all. It is like being “on the telephone”, i.e. you have a telephone in your house.

I believe that some of these can be and should be part of current Web 2.0 design but the line between Web 1.0 and Web 2.0 was blurry (e.g., eBay) so why should it surprise us that the transition to 3.0 won't be as grey?

AOL Launches Buggy YouTube UnCut Video

May 17, 2006 by wes2

Having essentially created mainstream online community and communication back in the early 90’s AOL is finally and wisely exploiting the leverage of its immensely popular AIM service to compete against MySpace and YouTube (the subject of this posting) in their respective areas.

AOL’s foray in community video hosting is called UnCut Video and there is much to like and dislike about it. To be fair it is in beta (and real beta not the fake never-ending Google beta) though if it was me I would have termed it “pre-beta” or “alpha.” Users of today’s web have become used to the “beta” term meaning “we’re still flushing out the business model but the functionality works pretty well” and not “good luck getting this thing to work.”
UnCut Pros:

1. Nice clean interface and straight forward log-in.

2. Easy uploading but it requires an executable (see cons).

3. Easy log-in (and possibly unnecessary registration). If you are one of the 30 million or so people with an AIM account you can immediately log-in to UnCut Video as well using your AIM account. One less confirmation email and one less log-in to remember.

4. Flash videos–which everyone (including YouTube) agrees is the right choice.

5. AIM integration. While I did not try it, AIM is integrated directly into UnCut so if you can easily start a real-time dialog with the person who just uploaded a clip of their kid falling down in the snow.

UnCut Cons:

1. Very, very buggy.

When they say ‘beta’ they mean it. It is buggy. Lot’s of page errors, no search and you can post a video, watch it, and then not be able to find it again. I posted 2 videos but when I went to the MyVideo, they were nowhere to be found. I know that the second video is up but at least as of this writing it’s not showing up in MyVideo. There were even screen formatting issues but I didn’t see anything about a particular browser being required (I was using the latest

“We’re sorry! We are temporarily experiencing difficulty delivering this page”

2. Dowload of executable required.

Requires you to download the VideoEgg Publisher–a step not required by YouTube. The upside of downloading this small executable however is that you get ‘drag & drop’ functionality to the uploading process. (I also understand that VideoEgg technology may be driving much of UnCut).

The video uploading is really very easy and you can immediately watch it (notwithstanding you can immediately lose it as well but this is a bug AOL is sure to quickly fix).

3. Organization.

Hard to tell how much of this is because of the beta but finding videos is very hard. You can sort by overly broad categories (something I would imagine that hardly anyone ever uses on YouTube). Incredibly and unlike YouTube there are no tags (and of course no tag cloud).

4. Overly onorus content release:

First, as a point of comparison, this the YouTube content release:

Do not upload copyrighted material for which you don’t own the rights or have permission from the owner.

Do not upload copyrighted, obscene or any other material which violates YouTube’s Terms of Use. By clicking “Upload Video,” you are representing that this video does not violate YouTube’s Terms of Use and that you own all copyrights in this video or have express permission from the copyright owner(s) to upload it.

By contrast AOL’s release is several paragraphs that covers the expected copyright language (no surprise given the fact that AOL is part of major media company Time Warner) but also includes in the fine print a rather aggressive grant of rights to AOL from the uploader. Including:

By submitting a video through this feature (the “Content”), you hereby grant AOL LLC, its affiliates, subsidiaries, assigns, licensees and legal representatives (”AOL”) the irrevocable, perpetual, worldwide right to use, reproduce, edit, market, store, distribute, have distributed, publicly and privately display, communicate, publicly and privately perform, transmit, have transmitted, create derivative works based upon and promote the Content (as such may be edited and modified by AOL in its discretion), or any portion thereof, through the AOL brand service and any other other products or services as AOL may designate, and any other web sites not designated by AOL, in any medium now known or hereafter devised, online or offline and by any means of delivery, for editorial, commercial, promotional and all other purposes. You acknowledge and agree that the Content may be shown on any web site, whether within or outside the AOL service. AOL is not obligated to include the Content in this or any program, and AOL reserves the right to remove the Content for any reason at its sole discretion. In addition, AOL cannot guarantee that technical difficulties will not occur during the upload of the Content or that the Content will upload successfully.

While you retain ownership of all right, title, and interest in the Content itself, you agree that AOL owns all right, title, and interest in any compilation, collective work, or other derivative work created by AOL using or incorporating the Content. You represent and warrant that (i) the Content does not and shall not infringe on any copyright or any other third-party right nor violate any applicable law or regulation, (ii) you have the right to grant any and all necessary rights and licenses provided herein, including, without limitation, all necessary copyright and other related rights to the Content, free and clear of all claims and encumbrances without violating the rights of any person or entity, including any right to privacy or publicity.

AOL also has this indemnity clause:

You hereby hold AOL harmless from and against any third-party claim arising from use of the Content. You waive any right to privacy. You waive any right to inspect or approve uses of the Content or to be compensated for any such uses. You hereby represent and warrant that you are at least 18 years of age and that you have read this release and are fully familiar with its contents.

What others are saying:

Mashable: AOL is in a great position to capitalize on the AIM brand with web-based social software, but its latest ventures still have a long way to go.

TechCrunch: The only significant difference between AOL Uncut and YouTube is that YouTube supports tagging, whereas UnCut doesn’t.

tags:

Humorous Web 2.0 Quote of the Day

May 12, 2006 by wes2

A Web 2.0 site is "a startup that generates more RSS than revenue."

(source: Engadget)

tags:

Deconstructing Flickr’s “Interestingness!”

May 12, 2006 by wes2

flickr_logo_beta.gif

Since Flickr is one of the most well-known Web 2.0 sites it is worth taking a look at what they do if for no other reason than to use it as a predictor for what the legions of Flickr clones will soon try to copy. Today we are looking at Flickr's method for selecting what it terms "interesting" photographs–purportedly without the intervention of human editors. The results are generally pretty impressive adding to the question what variable they use to distinguish compelling visuals.

Thomas Hawk:

"More than just "interesting," "interestingness" could potentially be a way that Yahoo! reclaims a little piece of search from Google. Today image search at both Google and Yahoo! is largely broken. Do a search for "San Francisco" at both Google and Yahoo! Image Search and you will find a hodge podge of mostly mediocre images.

Says BuzzMachine:

"What’s great about this is that it exposes not the wisdom of the crowd but the taste of the crowd"

Now for the algorithm. Interestingness is described on Flickr as:

"…an amazing new Flickr Feature.

There are lots of things that make a photo 'interesting' (or not) in the Flickr. Where the clickthroughs are coming from; who comments on it and when; who marks it as a favorite; its tags and many more things which are constantly changing. Interestingness changes over time, as more and more fantastic photos and stories are added to Flickr."

Emphasis has been added.

By hinting at the existence of secret sauce, Flickr enters the "we're more than a pretty face" and "trust us we have amazing algorithms underneath" competitions along with people like Digg. If fact it's getting to the point that if you can't work the word algorithm into your "about us" you risk being called "web 1.0"!

Looking at Flickr's description of interestingness above you get a hint at what gets a photograph selected for this distinction and here is what we (and others) have been able to determine.

  • Views, internal and external to Flickr, of the photo
  • Number of comments on the photo, and also who comments on the photo
  • Tags applied to the photo
  • Flickr discussion groups in which the photo appears
  • Favorites, a.k.a Flickr bookmarking, of the photo
  • Time varying behavior of the above factors

Flickr mentions "who" twice in their one paragraph description of their process, but that is a much more difficult aspect to deconstruct. However one blogger noted an absence of the amaturish photos that seem so prevalent and wondered out loud how this might be so:

"One conclusion to draw might be that the professional and semi-professional photographers who make up a minority of Flickr's users are having a disproportionate influence on the metrics that go into Interestingness because they are more active. They make more comments, mark more photos as favorites, look at more pictures not by their current contacts, and therefore their activity has a greater weight in the algorithms that choose the Interesting photos."

Flickr appears to have tinkered with their algorithm (in Feb?) and introducing a penalty for those who appear to try and game the system by uploading to numerous Flickr groups. Aocording to one Flickr user:

"…some recent changes to the algorithm devalued the interestingness of photos submitted to too many groups. This had sparked controversy with a specific kind of Flickr user affectionately referred to as a 'group whore'."…"Group whores are users who send their photos to tons of different groups in a desperate attempt to garner attention (read: views, favs and comments) which in turn would hypothetically lead to a higher level of the coveted interestingness."

Here is a debate between flummoxed Flickr users and the Flickr founder over the issue.

Flickr is right, it is an amazing new feature.

Update: Thomas Hawk adds this hypothesis to Flickr's Interesting algorithm:

One major change that has also occured with regards to interestingness (in my guestimation of course) is that averaging has been introduced for more popular photographers to prevent them from overly dominating interestingness.

tags:

The “Del.icio.us Lesson,” Now don’t forget it!

May 11, 2006 by wes2

The "Del.icio.us Lesson" is very simple. So much so that it is far too often forgotten by Web 2.0 technologists. The lesson was defined by Josha Porter who states that "personal value precedes network value: that selfish use comes before shared use."

Even though we’re definitely benefiting from the value of networked software, we’re still not doing so unless the software is valuable to us on a personal level first.

In a later post, Porter continues:

What this means is that if we are to build networks of value, then each person on the network needs to find value for themselves before they can contribute value to the network. In the case of Del.icio.us, people find value saving their personal bookmarks first and foremost. All other usage is secondary.

As people use Del.icio.us more, and in order to gain more personal value, they use tags to be able to find their bookmarks later. Tagging isn’t even the primary function of Del.icio.us. Most of the tagging done on Del.icio.us is done secondarily, and for personal use.

The social value of tags on Del.icio.us is only a happy side-effect. Even though most of the ink spilled about Del.icio.us is about the social value, it’s really not the reason why people use it.

For all of the folks designing Del.icio.us-killers or Del.icio.us-clones (or perhaps Killer Clones?) it is extremely important for them to remember this lesson before they blog endlessly in their founder's blogs in the "about us" sections of their closed beta web sites about the social aspects of their software or the incredible network effect that's going to change their users' worlds.

Porter continues further explaining how tags are different than meta keywords. Tags provide personal benefit, keywords provide only social benefit. Unfortunately people don't really enjoy tagging for the sake of tagging. Sure bloggers would add keywords to their posts because they're hoping it will help drive traffic (in fact bloggers would pretty much do anything to drive traffic). However, for the rank and file user the social benefit of tags pales in comparison to the immediate personal benefit of easily finding sites and information that user had selected for follow-up. As Porter points out, the social benefits and the network effect is just a nice bonus. A bonus of course which has garnered most of the attention from a Web 2.0-hungry world.

Here is a link to a graphic of Del.icio.us stats. As you can imagine it goes to the right and up and up and up.

tags:

Tagging 2.0: Would a ‘rose’ tagged by any other name smell as sweet?

May 10, 2006 by wes2

A Survey of Tagging Trends to Answer the Question "What's all of the fuss about?"

When one digs (or is it diggs?) into tagging, what started out as a really simple way to identify resources quickly gets complicated (anyone up for "deduplication" or "morphological analyses?)." With the hundreds of millions of resources being tagged and the purpose of tagging being the ability to quickly and easily find them, tagging has become an enormous area of focus for entrepeneurs, academics, and researchers, the same way that Internet search grew from basic site cataloging and indexing of a decade ago to today's mind-boggling search/ranking algorithms. But we are getting ahead of ourselves, let's get back to what tagging is.

I. Tagging Defined

"Tagging" is a lightweight and flexible approach to classifying information that allows users to apply whatever terms they think are appropriate to describe or recall an asset without the burden of selecting a category from a known taxonomy. It is an extremely important aspect of 2.0 thinking since it puts the power of classification into users' hands. In short it is a tool set that is both created by the community and beneficial to the community.

II. Tagging Vs. Traditional Bookmarking

Computer users have long stored the URLs of useful web resources locally in a browser client (so-called "bookmarking"). Using bookmarks involves scanning the hierarchical lists. These bookmarks are accessible only through the browser of the computer originally used to store them. There are only limited methods for sharing bookmarks (and even moving them to a new computer of the same user can be a hassle).

Tagging differs from this traditional bookmarking in several very critical ways. First, tags can be annotated with identifying tags, or keywords, selected by the individual bookmarking the resource as meaningful. The use of tagging does not impose mutually exclusive categorization schemes that hierarchical structures or faceted metadata do. People can retrieve bookmarks by tag (or title or comment) without having to search down long folder paths or even which folder they put it in. Moreover, since bookmarks are typically stored in a central repository, social bookmark collections are accessible from any browser and any machine.

III. Social Tagging

Social tags (also called "folksonomies") are users to publicly tag and share content. On sites with social tagging, users can categorize information both for themselves as well as browse (and often add to) the information categorized by others. There is therefore at once both personal and public aspects to collaborative tagging systems. Furthermore, social tagging is inherently open-ended and can respond almost immediately to changes and innovations in the way users think about content. Think of it as 'open source keywords.'

Social tagging also allows users to follow tags that interest them to find other users with interests or viewpoints similar to theirs (another social aspect). The front page of Del.icio.us shows the most recently added bookmarks (including the tags given to them,who created them, and how many other people have that bookmark in common). There is also a “popular” page, which shows the same information for the URLs that are currently the most popular. One can also see any other user’s personal page and even tag it. By looking at other users’ personal pages as well as the “popular” tags page, users can get a sense of what other people find interesting and hook-up with those with similar interests..

Anyone who uses a service like Del.icio.us knows that some tags will be useful for many people (e.g., tagging a picture of a cat as "cat") but other will only be applicable to that individual (tagging a web page as "remembertoprint" or "sendtobob"). If a service has enough people the popular tags will generally overwhelm the individual tags. Interestingly, even personal tags can benefit other users. For example, "if many users find something funny, there is a reasonable likelihood someone else would also find it to be so, and may want to explore it."

IV. Clouds on the Horizon

The good news about tagging is that it's rather simple and it is very useful to find tagged pictures, videos, and other resources with a couple of keywords. However, once tagged, going back and looking for things in the 'tag space' has a number of hard limitations.

First, different people will apply different tags to the same resource (and not just the 'individual' vs. 'many' tags described above). This is okay if you are only searching for things that you filed yourself but that is often not the case. While it's true that this variability can be compensated when a large enough number of users have applied tags, this isn't always possible. Second, from a user perspective coming up with tags from scratch might be more work than they want to do.

Despite these difficulties some in the tagging community adamantly resist any form of taxonomy being imposed on users and the inherent biases that would result. After all Web 2.0 is about listening to users and letting them guide their experiences as much as possible. (You don't want to say, "You can tag this anyway you want as long as you do it the way I want you to."). That said, even tagging pioneers are rethinking their tagging practices to address this issue. For example, Del.icio.us has recently introduce the concept of "bundles" - acknowledging the organization problems of scaling a tagging model.

From Del.icio.us:

What are bundles?

Bundles are a way to group together common tags. For instance, if you have the tags "design", "painting", and "moma", you may want to group these together into a bundle called "art".

Similarly, Flickr has its "clusters" (again grouping related tags). While these can in fact improve the tagging experience, many consider it to be only a partial solution.

One compromise approach is the use of collaborative tagging techniques which suggest tags for resource based on what other users use for a particular object addressing both the vocabulary divergence problem as well as the task of having to come up with tags from scratch. In addition, "virtual users" could be employed and which automatically generate content-based tags and at least address the cold start problem–especially for content without broad appeal. (We all know that it's easier to edit something that start with a blank piece of paper.)

V. Tag Spam?

Unfortunately as with many things on the web, spamming is a problem and threat to the integrity of tagging systems and people are already implementing ways to combat it. One method looks at the users who applying the tag:

In order to combat tag spam, we introduce an authority score (or reputation score) for each user. The authority score measures how well each user has tagged in the past. This can be modeled as a voting problem. Each time, a user votes correctly (consistent with the majority of other users), the user gets a higher authority score; the user gets a lower score with more bad votes.

VI. The Future of Tagging

The ultimate answer, at least according to researchers at UC Berkeley and Yahoo could be "a revised, probabilistic model using seed ontologies to induce faceted ontology," which I believe is a fancy way of saying that users shouldn't have to choose between pure tagging or completely closed taxonomic models.

In fact I learned today that there is already a word for a theoretical compromise between collaboration and vocabulary, collabulary, think of it as a mashup of the two. Wikipedia proposed this example: "If two users define an object as being 'white' and one user defines an object as being 'cream' then a relevance can be defined as "more white than cream".

Note the majority of the papers accessed for this post were accessed from the Tagging Workshop at WWW 2006. For those interested in delving deeper into tags this is an excellent place to start and all of the papers are currently available in PDF.

Additional Recommended Reading: Why People Tag,
tags:

Humor: Build your own Web 2.0 site

May 7, 2006 by wes2

Click here if you don't want to go thru all of the work creating a functioning Web 2.0 company and would like somone else to do it for you.

And we're not the only people having fun with 2.0.

tags:

Ajax Clarified

May 5, 2006 by wes2

Ajax does not equal Web 2.0 but since they are mentioned so often in the same breath it is worth defining what Ajax is and is not.

From the Mozilla Developer Center:

Asynchronous JavaScript and XML (AJAX) is not a technology in itself, but is a term that describes a "new" approach to using a number of existing technologies together, including: HTML or XHTML, Cascading Style Sheets, JavaScript, The Document Object Model, XML, XSLT, and the XMLHttpRequest object. When these technologies are combined in the AJAX model, web applications are able to make quick, incremental updates to the user interface without reloading the entire browser page. This makes the application faster and more responsive to user actions.

I happen to like Ajax a lot but utilizing the Ajax itself does not make a site Web 2.0. Got it?

tags: