Goodwill for anybody.
RaftPeople, as I write this, it is 2008. Costs for simple, even big, databases, are dirt cheap. The cost to write a program to post data on a web site periodically is dirt cheap. The cost to administer a web site is dirt cheap. Web site technology is pretty mature and data access is easy and dirt cheap. Every two-bit company in the world has a friggin’ web site, it’s so friggin’ cheap. Putting data on the web for public access is such a trivial exercise that hobbyists do it on weekends just for the fun of it even if the data is of limited use to the average Joe.
I myself have about 10GB of video posted to my $100/year website. Since I get 25GB of storage for that price, the 10GB is essentially free. I could post another 10GB tomorrow with no increase in cost. And I only use 5% of my allocated maximum monthly bandwidth for customer downloads.
Let’s give you an example. My county has a GPS website with aerial photos and maps of the entire area keyed to tax data. For the entire county, every land parcel, every owner, every inch. It is kept current within 2 months. Since it has graphics (the photos and maps), it is a little larger than a text-only database would be. It is available to the world, free, and is maintained as a part-time job by one dude in the IT department. It is hosted on the government servers. My WAG is the storage is around 10GB or less. (The complete tax data alone, uncompressed, is only 20MB. I know because I have a copy of the entire database in my home computer.)
I honestly don’t know what the cost is to the County (it was a software package they purchased), but our County is notoriously tight with money (good thing, it comes from the taxpayers) and this software was around for many years before they jumped on it. They are certainly not early adopters with money to throw around.
Why do they do this? For the convenience of anyone who needs or wants to use the data. They don’t care who. There is no cost to them to prepare the tax database, as it is maintained for tax reasons already. The photos are seriously downgraded versions of a project done 10 years ago for primarily geological reasons. The maps are part of geographic records that have been converted to digital. Nothing is posted to the web site that isn’t already part of a database somewhere else.
And it reduces the telephone inquiries, since all the data is available online. We Realtors used to make frequent calls to the Real Property Listing Dept. or even pay hundreds of dollars per month for access to a clunky, teletype, non-graphic database. No more. And the trend is towards more online data…they are scanning surveys and deeds into the system and gradually making those available, although there might be some charges for those because the state mandates it.
I doubt if 1% of the county’s taxpayers even know this site exists, let alone actually use it (it’s not good thru dialup, which most residents have). Yet the county pays for it because the cost/benefit ratio is so advantageous.
I haven’t tried every county in every state, not even in this one, but when I have occasion to lookup similar data elsewhere, it seems other governments are doing much the same thing. We’re not unique, but typical.
So I see no reason why it can’t be done for a UPC number vs. description project. Heck, give me the 600,000 records in CSV form and I’ll post it to a corner of my website in the next few hours as HTML. I’m sure somebody will find it useful.