data page 37 - Dave's Blog

Search
My timeline on Mastodon

Reuters Wants The World To Be Tagged - ReadWriteWeb

2008 Feb 8, 3:24FTA: "...Using a mix of natural language processing, AI techniques, and a massive databases, Reuters' solution extracts important bits of information from raw HTML pages. People, Companies, Places, and Events are really at the heart of many business articPermalinkCommentsvia:sambrook api reuters news tagging semantic semantic-web web

Web Q&A: XPath, XML Notepad, Data Islands, Case Sensitivity, XSL, and More -- MSDN Magazine, September 2001

2008 Feb 7, 2:36To summ up the last Q&A, the one I was interested in: "Is there any way to escape the characters " and ' in an XPath expression...". And their answer is no. Lame. I thought XPath folk would have defined this.PermalinkCommentsmicrosoft msdn xpath xml article

making_coins [Zotero Developer Documentation]

2008 Jan 29, 7:28A standard URI scheme for describing books.PermalinkCommentsmetadata microformats openurl coins uri

Crossing Four Way Stops Fast and Searching Closed Caption MCE Videos: More Stolen Thoughts

2008 Jan 22, 9:56

More ideas stolen from me in the same vein as my stolen OpenID thoughts.

Fast Pedestrian Crossing on Four Way Stops. In college I didn't have a car and every weekend I had weekly poker with friends who lived nearby so I would end up waiting to cross from one corner of a traffic lit four way stop to the opposite corner. Waiting there in the cold gave me plenty of time to consider the fastest method of getting to the opposite corner of a four-way stop. My plan was to hit the pedestrian crossing button for both directions and travel on the first one available. This only seems like a bad choice if the pedestrian crossing signal travels clockwise or counter clockwise around the four way stop. In those two cases its better to take the later of the two pedestrian signal crossings, but I have yet to see those two patterns on a real life traffic stop. I decided recently to see if my plan was actually sound and looked up info on traffic signals. But the info didn't say much other than "its complicated" and "it depends" (I'm paraphrasing). Then I found some guy's analysis of this problem. So I'm done with this and I'll continue pressing both buttons and crossing on the first pedestrian signal. Incidentally on one such night when I was waiting to cross this intersection I heard a loud multi-click sound and realized that the woman in the SUV waiting to cross the intersection next to me had just locked her doors. I guess my thinking-about-crossing-the-street face is intimidating.

Windows Searching Windows Media Center Recorded TV's Closed Captions. An Ars-Technica article on a fancy DVR described one of the DVRs features: full text search over the subtitles of the recorded TV shows. I thought implementing this for Windows Media Center recorded TV shows and Windows Search would be an interesting project to learn about video files, and extending Windows Search. As it turns out though some guy, Stephen Toub implemented Windows Search over MCE closed captions already. Stephen Toub's article is very long and describes some other very interesting related projects including 'summarizing video files' which you may want to read.

PermalinkCommentsstolen-thoughts windows search mce windows traffic closed captions four-way-stop windows-media-center

HIMAGELIST Stream Size

2008 Jan 12, 2:26If you're like me you need to serialize an object that contains an HIMAGELIST via IMarshal for COM's marshalling. I could use ImageList_Write to actually write the HIMAGELIST to a stream for IMarshal::MarshalInterface, but I needed to know the size of the data that I was going to write for IMarshal::GetMarshalSizeMax. I thought I'd use HIMAGELIST_QueryInterface to get an IPersistStream pointer which works, but alas its implementation of IPersistStream::GetMaxSize just returns E_NOTIMPL. Ultimately I called ImageList_Write on a special stream that ignores the data passed to it and just records how much data is written to it. In this fashion I could get the size the HIMAGELIST would require when written to a stream.PermalinkCommentstechnical himagelist boring serialize imarshal com

Main Page - Gutenberg

2008 Jan 5, 10:39Project Gutenberg is a directory of public domain literature.PermalinkCommentsarchive books fiction database directory literature writing free gutenberg public-domain

Summary of data visualisation utilities measured by functionality - Swivel

2008 Jan 3, 12:30This just in from Swivel: Swivel is the best!PermalinkCommentsswivel data visualization website

My Stuff - Swivel

2008 Jan 3, 12:29This is me on Swivel a site like Many Eyes that does social data table and data table visualization sharing.PermalinkCommentsproldfile me data database visualization social

GPS Luddites - the English countryside rebels against satnav | Decentralized Information Group (DIG) Breadcrumbs

2008 Jan 2, 2:13FTA: "Seems that a number of villages in the English countryside are being overrun by errant trans-European trucks which are regularly misdirected by their GPS satnav systems onto roads that were better suited for horse-drawn carriages than big, long-distPermalinkCommentsgps humor article metadata blog england

OpenAerialMap Is Ready For Your Data

2007 Dec 7, 9:45FTA: "OpenAerialMap is a site for collecting, hosting, and mapping freely available aerial imagery. "PermalinkCommentsaerial geo map opensource photography social data via:felix42

Windows Media Center and Zune Integration Hack

2007 Nov 28, 1:23One of the new Zune features that had me the most excited was the claimed improved Windows Media Center integration which unfortunately turned out to simply mean support for the Win MCE video format (with an exception for HD). I wanted to be able to pick shows recorded by my Win MCE and have the Zune automatically sync up the latest episodes. However, with the improved podcast support in the Zune software one can easily create a ridiculous hack to accomplish this.

The new Zune software has podcast support which does everything I'd want to do with a Win MCE recorded TV series so the goal is to shoehorn a TV series into a Zune podcast. An overview of the steps: Create an XSLT that converts Win MCE data to a podcast, run the XSLT as a scheduled task every few hours per TV series, setup a Web server pointed at the resulting podcasts and the Win MCE Recorded TV directory, and subscribe to the resulting podcasts in the Zune software.
  1. Reading through the Win MCE data stored as an XML file in "C:\ProgramData\Microsoft\eHome\Recording\Recordings.xml" and the spec for podcasts I created an XSLT to convert a series from Win MCE data to a podcast.
  2. I added a new task to the Scheduled Tasks to run my XSLT using my xsltproc.js script. The task runs a handful of commands that look something like the following:

    C:\windows\system32\wscript.exe C:\users\dave\bin\xsltproc.js C:\Users\Dave\Documents\trunk\development\mce-zune\mce-to-podcast.xslt C:\ProgramData\Microsoft\eHome\Recording\Recordings.xml --param title "The Daily Show With Jon Stewart" --param max 4 --param baseURI "http://groucho/" --param thisRelURI "tds.xml" -o "D:\recorded tv\tds.xml"

    For each TV series I run a command like the above and that outputs a podcast for that series into my "D:\Recorded TV\" directory.
  3. Zune only allows http URIs for its podcasts so I installed a web server on my Win MCE server. I'm running Vista Ultimate so it was quick and easy for me to install IIS7 but any Web server will do. Then I pointed it at "D:\Recorded TV\".
  4. Once all the above was done I just subscribed to the resulting podcasts via my Web server and viola! Since I'm forced to use a Web server I can even run the Zune software on a machine other than my Win MCE server. You can see a screen-shot above of my Zune software showing my Colbert Report podcast.
PermalinkCommentstechnical xml mce hack windows media center zune windows xslt podcast

Zune.net's User Card Service

2007 Nov 19, 4:15Replace the zunetag parameter in the URI with the desired Zune username to find the user's Zune social data such as Favorites, Recent Tracks, etc.PermalinkCommentszune microsoft xml

Zune Software Update

2007 Nov 19, 3:47I really appreciate that the first gen Zune's get the new Zune's firmware and software. I like the updated Zune software personally because its faster and simpler, has better podcast support, and the whole social thing has is on their website now. So, I guess I like the software because it has new features that should have been there in the first place.

The social thing is like a Zune social network. It uses your Xbox Live friends to seed your Zune friends list, lets you do the expected social network stuff, lets you preview songs, and unlike first gen Zunes which required face to face time with other Zune owners, allows you to send songs to people. It also lets you display your recently played tracks and your favorite tracks, similar to what Last.FM has, via a Zune Card. I like the Zune Card from a technical perspective because it separates the Zune Card view, written in flash from the User Card data which is in XML. I hope they intend to keep the XML available via this UserCard Service because I think there's potential to easily do cool things.PermalinkCommentsmicrosoft technical music zune social

Bunny Yawns

2007 Nov 13, 8:47
Cadbury yawns.
From: David Risney
Views: 1165
5 ratings
Time: 00:10 More in Pets & Animals
PermalinkCommentsvideo

Bunny Hears Something

2007 Nov 13, 8:45
Cadbury hears something, slowly.
From: David Risney
Views: 94
1 ratings
Time: 00:15 More in Pets & Animals
PermalinkCommentsvideo

RFC 3548 The Base16, Base32, and Base64 Data Encodings

2007 Nov 5, 4:33Syntax of base64PermalinkCommentsbase64 encoding syntax reference rfc standard

Many Eyes

2007 Oct 31, 10:41Upload your data and visualize it in various interesting ways using Java applets from IBM.PermalinkCommentsvisualization java applet ibm analysis graph

Google's timeline view

2007 Oct 30, 2:44Google has new views of search data: graphed over time and over a map.PermalinkCommentsgoogle ui view time timeline

Latest Earthquakes: Feeds & Data

2007 Oct 21, 5:52From USGS is Earthquake info in RSS form.PermalinkCommentsdata gis earthquake rss science usgs government

Hixie's Natural Log: A low-bandwidth, high-latency, high-cost, and unreliable data channel (Cached)

2007 Oct 16, 12:22FTA: "Look at all those zero cents... there are data bits there, lying unused! It struck me that with every single restaurant transaction I could set the cents field to some number under my control, thus allowing me to communicate with myself at a lPermalinkCommentsblog humor hack food article encoding restaurant via:ericlaw
Older EntriesNewer Entries Creative Commons License Some rights reserved.