Data: Big & Noisy!
Title Text
http://www.komli.com/aunz/content/blog/big-data%E2%80%99s-argyle-principle
Tweet Sentiment Visualization Project: http://www.csc.ncsu.edu/faculty/healey/tweet_viz/tweet_app/
Reviewers thought he said...
- Big data can solve everything
- There is no need to collect research data, just use what’s out there
- The future is predictable, if we have enough data
What he really said...
- Consider the limitations of Big Data
- Do not expect to predict everything using Big Data
- Combine insights from humans and machines
http://www.visioncritical.com/3-tips-big-data-nate-silver-s-signal-and-noise/
2012 HBR ...Data Scientist sexiest job in 21st century
2013 "Big Data" entry in OED
http://sils.unc.edu/sites/default/files/publications/Presentations/Kilgour_Lecture_SILS_Apr10_FINAL.pdf
Taylor, R. S. (1962), The process of asking questions. Amer. Doc., 13: 391–396. doi: 10.1002/asi.5090130405
http://www.gartner.com/newsroom/id/2575515
Scale
or size matters
- Books in the Library of Congress represent 235 terabytes of data
- 1994 conference presentation first mention of petabyte and said "the problems which confront the meteorologist today, will worry the arts and humanities in 10 years time."
- A petabyte = 4 Library of Congresses
- The Large Hadron Collider generates 1 petabyte of data every second
- The Square Kilometre Array will generate 4 petabytes of data every second
- 2013 = zetabyte (1 zetabyte = 250 billion DVD's)
- By 2020, the digital universe will equal 35 zetabytes (or 44 times as big as 2009)
Andrew Prescott: Big Data in the Arts and Humanities
http://www.cendi.gov/presentations/12_11_12_Wilde_Nature_of_Research.pdf
http://www.humanconnectomeproject.org/
https://www.sciencenews.org/article/big-data-studies-come-replication-challenges
http://spectrum.ieee.org/robotics/artificial-intelligence/machinelearning-maestro-michael-jordan-on-the-delusions-of-big-data-and-other-huge-engineering-efforts
Fast forward to 2015
https://chronicle.com/article/Big-Data-Big-Obstacles/151421/
https://www.sciencenews.org/article/big-data-studies-come-replication-challenges
http://www.gartner.com/newsroom/id/2819918
https://jakevdp.github.io/blog/2014/08/22/hacking-academia/
https://medium.com/thelist/moving-from-data-science-to-data-literacy-a2f181ba4167
Big Data
Humanities, Sciences, Librarians
Digital Humanities
DH Projects
- Textual Comparisons
- Textual Visualizations
- TEI
- Media
http://www-personal.umich.edu/~eklanche/digdemog/index.html
https://medium.com/the-physics-arxiv-blog/when-a-machine-learning-algorithm-studied-fine-art-paintings-it-saw-things-art-historians-had-never-b8e4e7bf7d3e
Ref: arxiv.org/abs/1408.3218 : Toward Automated Discovery of Artistic Influence
Text
http://www.industrytap.com/big-data-the-hottest-sector-in-it/23697
Librarians add..
Value
Veracity
http://datapub.cdlib.org/
Data Management
Data and Librarians
Research Data Services
- Data Management and DMP's
- Teaching data best practices
- Institutional Repositories
- Open data
Introduction to data managment
Steal this idea: A library instructors' guide to educating students in data management skills by Lisa Johnson and Jon Jeffryes published in C&RL, Sept 2014: 431
Managing your data from the University of Minnesota
https://www.lib.umn.edu/datamanagement
Create data that you and others can understand
http://maps.repository66.org/
DCC 2009
http://data.library.virginia.edu/data-management/
Research Data Management
- Data Management
- Data Curation
- Data Information Literacy
- Data Visualization
Data Visualization
http://cdnlarge.tableausoftware.com/sites/default/files/pages/the_5_most_influential_data_visualizations_of_all_time_0.pdf
Sources for learning
Duke Library LibGuide
http://guides.library.duke.edu/content.php?pid=355157&sid=2904817
ESRI ARCGIS
LITA
Data Science - University of Washington
IA MOOC
Text
Text
https://www.insidehighered.com/views/2015/01/28/report-101-innovations-scholarly-communication
https://jakevdp.github.io/blog/2014/08/22/hacking-academia/
Presentation LSC874 Feb 2015
By kmhoffman56
Presentation LSC874 Feb 2015
- 1,563