uppgifter is Swedish for data.
It is also the title of a semester-long project I did for a class on Data Manipulation and Analysis. The first half of the course focused on the collection and manipulation of data. The topic was open and our three person team chose to start with data from the National UFO Reporting Center (NUFORC). What began as a lighthearted approach to a serious process ended with some very interesting results.
We wrote Python scripts to collect twelve years of structured and unstructured data from multiple sources. Data was scraped from NUFORC, downloaded from the Census Bureau and USGS, accessed using third-party APIs, and then munged and massaged into a final structure for analysis.
The second half of the course focused on the exploratory analysis of the data using R. Our analysis revealed basic statistics–UFO sightings are more likely during the summer months–as well as more compelling information–lower education and income levels did not predict UFO sightings.
See some pretty pictures below or read the full report generated from R.