Pages tagged datasets:

Amazon Web Services Blog: New AWS Public Data Sets - Economics, DBpedia, Freebase, and Wikipedia
http://aws.typepad.com/aws/2009/02/new-aws-public-data-sets-economics-dbpedia-freebase-and-wikipedia.html

We have just released four additional AWS public data sets, and have updated another one. In the Economics category, we have added a set of transportation databases from the US Bureau of Transportation Statistics. Data and statistics are provided for aviation, maritime, highway, transit, rail, pipeline, bike & pedestrian, and other modes of transportation, all in CSV format. I was able to locate employment data for our hometown airline and found out that they employed 9,322 full-time and 1,122 part-time employees as of the end of 2007. In the Encyclopedic category, we have added access to the DBpedia Knowledge Base, the Freebase Data Dump, and the Wikipedia Extraction, or WEX.
amazon
Data Store: Facts you can use | Data Store | guardian.co.uk
http://www.guardian.co.uk/data-store
The data store for the guardian newspaper
datasets
Apps for America 2: The Data.gov Challenge
http://sunlightlabs.com/contests/appsforamerica2/
Apps for America is a special contest we're putting on this year to celebrate the release of Data.gov! We're doing it alongside Google, O'Reilly Media, and TechWeb and the winners will be announced at the Gov 2.0 Expo Showcase in Washington, DC at the end of the Summer. Why we're doing it Just as the federal government begins to provide data in Web developer-friendly formats, we're organizing Apps for America 2: The Data.gov Challenge to demonstrate that when government makes data available, it makes itself more accountable and creates more trust and opportunity in its actions. The contest submissions will also show the creativity of developers in designing compelling applications that provide easy access and understanding for the public, while also showing how open data can save the government tens of millions of dollars by engaging the development community in application development at far cheaper rates than traditional government contractors.
DataSF - DataSF - Liberating City Data
http://www.datasf.org/
Why can't every city have this?
City of SF opens site containing datasets
"DataSF is a clearinghouse of datasets available from the City & County of San Francisco. While there is plenty of room for improvement, our goal in releasing this site is: 1) improve access to data, 2) help our community create innovative apps, 3) understand what datasets you'd like to see, 4) get feedback on the quality of our datasets."
"DataSF is a clearinghouse of datasets available from the City & County of San Francisco. While there is plenty of room for improvement, our goal in releasing this site is: (1) improve access to data (2) help our community create innovative apps (3) understand what datasets you'd like to see (4) get feedback on the quality of our datasets."
AggData | AggData
http://www.aggdata.com/
The goal of AggData is to play a small part in making this sought-out data more accessible, portable and reliable.
great source for aggregated data
AggData is short for aggregate data, which means a set of data that is collected together in one place. On this site, the AggData will come in the form of a list of records, where each record has details about a specific object in the group.
data aggregated by web scraping
another free data library.
30 Resources to Find the Data You Need | FlowingData
http://flowingdata.com/2009/10/01/30-resources-to-find-the-data-you-need/
Let's say you have this idea for a visualization or application, or you're just curious about some trend. But you have a problem. You can't find the data, and without the data, you can't even start. This is a guide and a list of sources for where you can find that data you're looking for. There's a lot out there. Universities Being a graduate student, I always look to the library for books and resources. Many libraries are amping up their technology and have some expansive data archives. Many statistics departments also tend to keep a list of data somewhere.
data.australia.gov.au – beta
http://data.australia.gov.au/
data.australia.gov.au is the home of Australian government public information datasets. We encourage you to make government information even more useful by mashing-up the data to create something new and exciting! Make sure you pay attention to the licence attached to the datasets you are interested in using.
data.australia.gov.au is the home of Australian government public information datasets. Like Data.gov, it has a wide variety of downloadable government data on topics such as crime, weather, and public lands--as well as some very Australian topics, such as the location and attributes of barbecues on public lands.
the home of Australian government public information datasets. We encourage you to make government information even more useful by mashing-up the data to create something new and exciting! Make sure you pay attention to the licence attached to the datasets you are interested in using. Each licence should make clear what you can and can’t do with the data. If you’re unsure, please contact the contributing agency.
data.australia.gov.au is the home of Australian government public information datasets. We encourage you to make government information even more useful by mashing-up the data to create something new and exciting! Make sure you pay attention to the licence attached to the datasets you are interested in using. Each licence should make clear what you can and can’t do with the data. If you’re unsure, please contact the contributing agency.
New York Times - Linked Open Data
http://data.nytimes.com/
For the last 150 years, The New York Times has maintained one of the most authoritative news vocabularies ever developed. In 2009, we began to publish this vocabulary as linked open data. The Data The New York Times has published 5,000 people subject headings as linked open data under a CC BY license. We provide both RDF documents and a human-friendly HTML versions.
People subject headings for New York Times
data.nytimes.com For the last 150 years, The New York Times has maintained one of the most authoritative news vocabularies ever developed. In 2009, we began to publish this vocabulary as linked open data. The Data The New York Times has published 5,000 people subject headings as linked open data under a CC BY license. We provide both RDF documents and a human-friendly HTML versions.
The New York Times has published 5,000 people subject headings as linked open data under a CC BY license. We provide both RDF documents and a human-friendly HTML versions.
data.nytimes.com For the last 150 years, The New York Times has maintained one of the most authoritative news vocabularies ever developed. In 2009, we began to publish this vocabulary as linked open data. The Data The New York Times has published 5,000 people subject headings as linked open data under a CC BY license. We provide both RDF documents and a human-friendly HTML versions.
pskomoroch's dataset Bookmarks on Delicious
http://delicious.com/pskomoroch/dataset
Resource list of public datasets
Data Sets | GroupLens Research
http://www.grouplens.org/taxonomy/term/14
Unlocking innovation | data.gov.uk
http://data.gov.uk/
UK government stats online
UK government opens up its data - using Drupal!
Google - public data
http://www.google.com/publicdata/directory
Limited data but nice and clear interface
Datasets and visualization
The Google Public Data Explorer makes large datasets easy to explore, visualize and communicate. As the charts and maps animate over time, the changes in the world become easier to understand. You don't have to be a data expert to navigate between different views, make your own comparisons, and share your findings.
Graficación animada desde diferentes fuentes de info pública
Stack Overflow Creative Commons Data Dump - Blog - Stack Overflow
http://blog.stackoverflow.com/2009/06/stack-overflow-creative-commons-data-dump/
Awesome, Stack Overflow release all of their public web data under a CC license.