[GEEKY] How DataRSS might work

Editors Note (that’s me, Pito): I’ve decided to change the name of this thing to “Decentralized Data Discovery – DDD” because I learned from more than one person that calling it Data RSS was misleading and confusing. I need to go back and update the papers and blog posts.

I’ve just finished writing part 3 of my series about DataRSS. Part 1 gives the background and justification for the concept, and Part 2 worked through a semi-believable scenario where having DataRSS would be a good thing.

In part 3 I try to get into more technical detail. I hope that you take the time to read it because that’s the only way I will get technical feedback on it. The reason I wrote the first two parts is that realistically I expect to lose 99.5% of you guys once you open up part 3. That’s why this post is labeled [GEEKY]. Here’s some of what I cover in part 3:

“Data RSS is a simple protocol and a simple data format. It can be implemented in any programming language.

Importantly, the Publisher and Accessor software need not know (can not know) what language the counterparties software is written in. ” (from DataRSS: Technical Overview)

and

“DataRSS is used between two parties, the Publisher, who ‘owns’ some data, and the Accessor, who wants to use that data. Publisher and Accessor are organizations with people in them. The Publisher wants to offer a technical means to allow an application program simple and standardized access to their data.

The Accessor wants to write an application program that accesses and does something useful with data coming from any Publisher. Accessor and Publisher don’t know each other. ” (from DataRSS: Technical Overview)

Delicious isn’t it? One final tease, I also have worked out some detailed examples of how DataRSS might work with the New York Times API, with the Sunlight Foundation API and with the Follow The Money API.

0 thoughts on “[GEEKY] How DataRSS might work

  1. Sort of (loosely) related, I’ve been exploring the use of Many Eyes Wikified to pull in live CSV data from Google Spreadsheets et al, and it struck me that in the consumer it owuld be good to be able to have an equivalent to the browser status bar padlock and current domain for https sites that displayed at least the domain from which the data was being pulled. Then eg in a visualisation widget, you would at least see in the ‘source status’ bar the provenance for the data.(I appreciate this is more a use case than data format comment, but this just happens to be the comment box I’m in at the mo!)

    Like

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out /  Change )

Google photo

You are commenting using your Google account. Log Out /  Change )

Twitter picture

You are commenting using your Twitter account. Log Out /  Change )

Facebook photo

You are commenting using your Facebook account. Log Out /  Change )

Connecting to %s