[
continued from parts one and two;
abstract]
Foundations of digital identityHow will identity in an integrated social net be tangibly defined and described? What will be the mechanisms that facilitate the flow of attributes* of that identity across the net? And how might these factors promote a strong yet fluid phase-patterned evolution of human communities** within the net?
*I use "attributes" to mean any digital artifact within the net that is attributable to an identity (of an individual or community), including digital media, personal information and other descriptive meta-data.
**See
part one for a discussion of phase behaviour in human communities.
These are the questions with which I ended
part two of this essay. Perhaps a useful starting point for their discussion is to explore the aptitudes and limitations of existing technologies and services with regards to the definition, description and federation of identity attributes across the social net, and the implications of those attributes as regards the promotion or inhibition of healthy phase behaviour in online community structures. Four main topics spring to mind here:
RSS syndication, open
APIs, web search and free-tagging. Each of these technologies or services provides at least a partial counter-balance to the ad hoc and fragmentary tendency in social applications' ontologies discussed in part two, serving as identity "glue" with which those applications and the identities of their users may be provisionally patched together.
RSS
RSS (and, to a lesser extent, its sibling syndication format
Atom), has become a de facto standard for the open syndication of net-based content
[explanation here], providing a simple "mould" with which blocks of data may be built, then shared and combined across diverse applications,
much like Lego bricks may be built into an infinite number of models. This modular re-usability has led to RSS becoming the content syndication format of choice for a swathe of social net applications such as blogs, newsreaders, photo sharing services and social bookmarking services. One of the biggest single shifts in the way people consume
web content is being facilitated by RSS: within the rapidly-expanding sphere of RSS-enabled web sites, people no longer have to surf to each one to manually check for updates, but can aggregate their favourite content streams in one or several places of their choice. The personalised newspaper really has arrived.
However, two drawbacks to RSS as a medium for identity attributes are evident. Firstly RSS data is public data, accessible by anyone via a URL, like an HTML web page. RSS therefore doesn't support identity-based access filtering—you can't syndicate your digital creations with RSS and be selective about who can see or hear those creations (as you might want to do with family photos or a personal message, for example).
Secondly, one of RSS's biggest strengths is also its weakness: the descriptive (meta-data) fields within standard RSS formats (there are actually four versions of RSS in use—0.91, 0.92, 1.0 and 2.0) have been arbitrarily limited within the standards to a few simple items such as creator's name and date of publication. This simplicity is a great advantage in the sense that it precludes conflict between its users over which descriptive fields should be included in the format—in other words, over how to describe the identity of data. But, conversely, the lack of an integration of a rich ontology within RSS effectively hobbles it as a medium for the communication of identity attributes. To be human is surely to be in a state of constantly constructing and communicating meaning for one's life and the people and things which flow through it. A medium for identity attributes must allow people to codify the meaning—for them—of their creations.
RSS excels at dissolving the geo-specific metaphor of traditional web surfing, where we have to visit a particular place on the web each time we wish to know what is new there. With RSS, we can participate as observers at least in the public aspect of multiple communities' and individuals' spaces from the comfort of our own newsreader. Perhaps RSS's key contribution to the social net's evolution is in facilitating this fundamental shift of its governing spacial metaphor. However, RSS's lack of privacy controls or semantic depth would seem to limit its role in shaping net communities in and of itself.
Open APIs
Whereas RSS requires people and social applications to make their content available for syndication by all comers, APIs allow social applications to open up their data store more selectively, via information-processing services, to third-party developers, who can then build added-value applications on top of these "web services". The API-facilitated data-sharing ecosystems amongst and around
Flickr,
del.icio.us and blog services such as
Blogger (amongst other examples) demonstrate the value of open APIs in extending the reach and functionality of social applications. I can post entries to all the main blog services and also links to del.icio.us from my desktop blog application,
Ecto; I can have
my entries in 43 Things (a social goal-setting application) piped into my blog if I choose; I pull my latest Flickr photos into my blog sidebar with a few lines of JavaScript that call the appropriate Flickr web service via their API.
Moreover, the integration facilitated by open APIs works nicely in tandem with RSS syndication: for instance, I can take an RSS feed from my del.icio.us bookmarks, or a topic-defined subset (see the section on "Free-tagging" below) of those bookmarks, and display it in my blog sidebar by using JavaScript to call the
Feed2JS API (see "about Japan" and "del.icio.us bookmarks" in the sidebar of my main blog page for examples). The combination of RSS and open APIs is going quite some way to giving me control over my data: control over from where and to where I process and publish it and control over how I aggregate it.
So does RSS plus open APIs equal an organically-integrated social net? Well, no—at least, not in the way the two are currently being utilised. We have already seen how RSS is mostly agnostic to identity, and completely so as regards the consumers of RSS data. When data enters an RSS channel, it is effectively stripped of all but the bare bones of its identity context. The open APIs discussed above, in contrast, all impose upon the data-exchange process the data schema (structure) of the application that sits behind them. Ecto must know Blogger's schema and how to map that schema onto Ecto's own schema so as to understand how to request relevant data and then how to process it when it is transmitted.
But as one data schema can be mapped onto another, is a situation of ontological fragmentation across social applications really so problematic? If we consider the issue with regard to phase behaviour, it becomes clear that the phase-lock between the schemas of social applications exchanging data via APIs is strong but very brittle: the map that expresses the data transformation between the two schemas must be manually updated every time the schema on either side of the connection changes if it is to remain useful. This might be feasible for a single relationship, but would rapidly become unmanageable in the complex webs of application interconnectivity that an integrated social net will require. Moreover, each application requires a separate login, so if I want to have Ecto (for example) exchange my data with Blogger, Amazon and del.icio.us via their APIs, I must entrust Ecto with the logins to all three. This makes seamless data-sharing across applications problematic from a security point of view. It seems that we must find ways to supplement openness in APIs with some kind of semantic flexibility and a way of federating identity authentification if APIs are to facilitate a truly organic evolution of the social net.
Web search
Web search services such as
Google allow their users to search for web pages containing a particular keyword or phrase. Such services deploy a great deal of computing power to index web pages and the hyperlinks that join them, abstracting a ranking for each page based on criteria such as how many inbound links it receives and the page rank of the linking pages. What emerges, in theory at least, is a picture of which subset of the pages containing a given keyword or phrase the global online populace as a whole rate most highly.
Google's massive popularity attests to both its utility and its simplicity: I type a natural language phrase, press a button, and within microseconds Google serves up several million authority-ranked results from across the web. In many cases I find an acceptable match to what I'm looking for within the first page of results.
Technorati, a specialist search service for blogs, similarly indexes the authority of blogs and blog posts according to which and how many other blogs are linking to them, and utilises the RSS output of many blogs to do this in a far more timely way than Google (Technorati's index is updated every few minutes rather than every few weeks). This effectively allows me to track discussions by topic as they leap across the
Blogosphere (especially as I can subscribe to an RSS feed of the search results).
So how might web search services facilitate organic integration in the social net, and what might their limitations be in this regard? Just as RSS transcends the limitations of place within the web (by allowing us to aggregate and view content from diverse sources in our applications of choice), so web search does for the process of content discovery, granting us one-pointed access to information from across the web. And in ranking the authority of a web page according to the predominantly human-generated hyperlinks that reference it (albeit along with the machine-generated spam links that are a constant threat to these services' integrity), web search services are able to put search results in at least a provisional social context. This social aspect to web search is particularly evident with Technorati, in that the links between blogs that it tracks are often the connections between individuals' voices within conversations that are distributed between their blogs.
But how useful is web search as a tool for community building? A hyperlink connects two web pages in a purely mechanical sense, not as an integral aspect of a mutually-agreed relationship between the pages' creators. This means that anyone can express a relationship (albeit a unidirectional relationship) to anyone else's page on the web at any time. Web search services allow people to reduce this highly fluid ecosystem of public relationship to a set of keyphrase-indexed, linear rankings of global popularity. These rankings in turn tend to amplify the popularity of the top results, whose ranking then further increases. This feedback cycle of hyperlinks and web search has been statistically shown to lead to a
power-law governed, "winner-takes-most" distribution pattern in both
the blogosphere and the
wider web—a powerful phase-locking behaviour that emerges from the web's radical fluidity. This pattern of social interaction as redolent of the vicarious relationship between Hollywood stars and their fans around the globe as it is of healthy communities of mutual value and respect (although the radically-lower barrier to entry allows the web to greatly ameliorate its inherent inequities).
Another limitation of web search is its exclusive reliance on algorithmic analysis of unstructured data in indexing web pages. Web search works on the presumption that it can deduce the topics of a web page simply by indexing the occurence of single words and their combinations anywhere within that page. Yet we have all experienced the frustration of Google failing to intuit our meaning from the search phrase we type. Human language takes its meaning from its rich structure, both internal (grammar and syntax) and external (culturally-specific association), and can surely only be properly understood through the medium of these structures.
Web search excels at tracking the aspects of our identities that are both public and predominantly independent of structured semantic or community context. If I am looking for good ideas on a particular topic or information about certain mass-produced products, for example, I use web search, confident that I will find something at least half useful. But what if I want to find out all the photos that my family have uploaded to the net over the last year? The video rental shops in my local area that locals rate highest? Or which Classical composers my friends like, taking a precise meaning of "Classical" as music composed after the Baroque and before the Romantic era? I need to somehow limit the scope of my search to the content that is attributable to people who participate within the relevant community, be it my friends, my family or my neighbours. Or, to put it another way, I need to be able to search within and through the context of the communities in which I myself participate—and have that search include all the potentially diverse manifestations of that community across locations and applications on the net.
If web search is to evolve to facilitate semantic and community-contextualised search functionalities, it seems that it must learn to traverse the semantic and community structures that people express within the net; it must adapt to peoples' personal ontologies (see also the section on APIs above); it must respect the boundaries of privacy communities place around themselves, providing a view of the network dependent as much on the identity of the searcher as that of the searched.
Free-tagging
One of the
hottest topics in current social software discussions across the blogosphere is free-tagging (additionally known as tagsonomy and folksonomy, the suffix deriving from "taxonomy"). Flickr, del.icio.us and now a rapidly-increasing number of social net applications are allowing people to categorise web content with multiple, freely-chosen keywords. Technorati has also
begun tracking special tag hyperlinks, effectively allowing people to have their own web pages show up in the appropriate tag space within Technorati.
Free-tagging sidesteps the issue of ontological fragmentation inherent in a structured approach to data management by avoiding structure altogether: each tag has an existence independent of other tags, yet it may be combined with those other tags in infinite permutations. Unlike with a traditional directory such as
Meetup.com's where information is shoehorned into a rigid one-shape-fits-all (which may be altered only by the site's administrator), the fluidity of free-tagging allows each person to assemble the keyword "particles" of their personal descriptive ontology according to their individual point of view on a given content element (a photo, for example). This flexibility of free-tagging proves to be extremely useful when it comes to retrieving the information—say a web bookmark—I have stored previously, as I only need to remember one of the keywords that I categorised that information with to relocate it, rather than having to remember a complex route through a directory tree structure.
The purely personal benefits of free-tagging are valuable, but it is as a social tool that it really shines. As any given tag can be used by anyone, it acts as a social pivot-point for the topic or topics it points to. So, for example,
I can find all the photos posted to Flickr and tagged as "London", then, if I like one in particular, leave a comment for its author. And my own choice of tags may be influenced by others' tagging behaviour—so I might tag my photo of my local area as "
rotherhithe" rather than "
canadawater"—even though I think of myself living in both Rotherhithe and Canada Water—because the former clearly has a stronger community following within Flickr and so would be more likely to help me connect with other local people. Free-tagging allows each person their ontological freedom, yet simultaneously incentivises community convergence—or phase lock—onto common ground.
Free-tagging also facilitates the integration of related content across social net applications: Technorati tag pages, for example, aggregate content from Flickr (photos), del.icio.us and Furl (web bookmarks) and web pages with tag hyperlinks into a single location. While this content integration is analogous in some ways to that facilitated by web search, it differs from the latter in two important ways. Firstly, free-tagging is active and explicit semantic categorisation, whereas web search aims to derive web pages' implicit semantic meaning, a process in which the page's author remains passive. Free-tagging would seem the more likely to reflect an author's real opinion. Secondly, free-tagging has the potential to integrate the view points of both audience and author across a single semantic strata (tags), whereas web search construes topic according to the content of the web page, relegating the audience's role to that of arbitrator of the page's relative authority.
Free-tagging combines nicely with RSS, allowing people to lock into semantically-filtered content streams: del.icio.us and Flickr both provide RSS feeds of content categorised with individual tags and tag combinations, and allow further filtering by author. So I can feed my interest in
emergent democracy, for instance, by following in my newsreader the
RSS feed of the
"emergent_democracy"-tagged del.cio.us bookmarks of Joi Ito,
arguably one of the best-informed people in this field. And unlike web search, free-tags allow me to follow a topic from the point of view of an expert single person or group: Joi may divine in a web page—such as
this one—a relevance to emergent democracy, even if the author himself doesn't explicitly make that connection. Free-tagging can also be used by the author to channel a subset of their content to another person, simply
by using that person's name as a tag. Finally, applications that utilise free-tagging, such as del.icio.us and Flickr, contextualise the user-generated tags within the application's formalised schema (see Open APIs). This allows the applications to combine (though not integrate) the advantages of fluid and structured approaches to data management—to append a structured context (creator, date of creation, content referenced etc.) to unstructured tags.
So what limitations might free-tagging have in its potential to facilitate healthy phase processes in online communities? As with RSS, flat-tagging's strength—it's flatness!—is arguably its greatest weakness. While flat-tagging's avoidance of taxonomical structure makes it an excellent tool for integrating approximate semantic meaning across diverse contexts within the social net, this radical fluidity also prevents flat-tagging from acting as a vehicle for information about the structured social and semantic context of each keyword "particle". And without the possibility of structured context, tags can only ever tell a partial story about the meaning, to their author, of the content they describe—about that content's position within its author's world relative to all the other things, people and communities with which they have a relationship. And while social net applications that utilise flat-tagging contextualise free-form tag data within the application's structured data schema, the formal rigidity of such schemas severely limits their usefulness for facilitating integrated data exchange between ontologically-diverse applications (as I demonstrated in my discussion of Open APIs). Much as we saw with web search's lack of structural granularity, free-tagging's radical fluidity would seem to limit its utility as a tool for community evolution.
ConclusionThere remains a dichotomy between the formalised data schemas of social net applications (exposed through their open APIs) and RSS on one hand and the fluid, globally-emergent data patterning of web search and free-form tagging on the other. Functional combinations of fluid and structured modalities (APIs with RSS, web search with RSS, free-tagging with APIs and so on) can provide provisionally useful solutions. However, the lack of organic integration between these mutually-polarised approaches prevents even their combination from facilitating an exchange between and within individuals and communities that shifts seamlessly between phase-lock and phase-shift—between strong structural stability and free-form fluidity.
This final part of my essay has focused on applications and technologies that are becoming or have already become mainstream components of the social net. It seems to me that in order to understand the best ways forward, a clear understanding of the potential and limitations of current approaches will be very useful. However, there are already a number of initiatives in the Digital Identity space that are taking practical steps to address many of the issues this essay has explored—persistent identity, social data-exchange and federated authentification. Amongst them are
Liberty Alliance,
Sxip,
Shibboleth,
FOAF,
Ping ID,
Social Physics,
LID and
Identity Commons. It is beyond the scope of this essay to discuss their relative merits, but it is heartening to see that the vigorous conversation that has sprung up between
the blogs of some of the most prominent players (in a network-wide sense!) is exhibiting the very quality of integrated diversity that is crucial for the development of an integrated social net that can support organic community phase processes.
In a way, I have posited a set of problems for the integration of the social net in the second and third parts of this essay, and done no more than hint at their possible solutions. What might an integration of fluid and structured approaches to data and identity management actually look like, after all? Perhaps if we can first clearly define these problems, their solutions
will emerge. Above all, it seems self-evident to me that discussion of the social net must penetrate to the heart of human identity if it is to give rise to strategies that will enable us to develop that social net in a way that supports the organic phase processes that seem to govern our interactions with each other and the world around us.
topics:
identity community phase_behaviour RSS API tagsTechnorati Tags: API, community, identity, phase_behaviour, RSS, tags, technorati-tags-introduction.html