Data.

dafaq.

Domain specific knowledge is more important than raw skill for developers.

Data analysis is the meta domain that will control the future.

Data analysis is now a core technology skill.

(   ergo   )

(   and   )

Software Developers

BLS

2016 Median Pay - $102,280 per year
  Number of Jobs, 2014 - 1.1 million

Job Outlook, 2014-24 - 17% (Much faster than average)

 

 

 

Statisticians

BLS

2016 Median Pay - $80,500 per year
  Number of Jobs, 2014 - 30,000

Job Outlook, 2014-24 - 34% (Much faster than average)

stitchdata.com/resources/reports/the-state-of-data-engineering

"  The number of data engineers more than doubled from 2013-2015.  "

"   42% of data engineers graduated from a Software Engineering role.   "

"   Today, there are 6,500 people on LinkedIn who call themselves data engineers.


In San Francisco alone, there are 6,600 job listings for this same title.    "

Data Engineer

at

 (    Cool Tech Company   )

SQL, Python, Java, Hadoop**, and Linux

 "A lot of companies separate the data platform team from the data science team. I took the job at Huffington Post because they gave me the authorization to remove that wall.

-
Maggie Xiong, Director of Data Engineering at The Huffington Post

"Think about the relationship between designers and front-end developers, One comes up with the ideas, the other implements. And it can cause a lot of tension."
-
Ryan Orban, Galvanize CTO

ML > NoSql

Um... cool, what now?

Mean, median, standard deviation.

Regressions / model complexity.

Random Forests, Neural Nets

Statistics for Hackers
https://www.youtube.com/watch?v=Iq9DzN6mvYA
--

 

Bayesians vs Frequentists

https://www.youtube.com/watch?v=KhAUfqhLakw
 

Intro to Deep Learning

https://www.youtube.com/watch?v=nCPf8zDJ0d0


Data Engineering and Data Science: Bridging the Gap
https://www.youtube.com/watch?v=-K9SjrWpeys

Python libs/tools

http://pandas.pydata.org/

https://www.scipy.org/

http://jupyter.org/

https://www.tensorflow.org/

Other

https://d3js.org/

https://www.npmjs.com/package/simple-statistics

https://www.npmjs.com/package/lumenize

https://www.scala-lang.org/

https://stackoverflow.com/questions/383920/what-is-the-best-library-for-java-to-grid-cluster-enable-your-application

Sources

https://www.bls.gov/ooh/computer-and-information-technology/home.htm

 

https://www.stitchdata.com/resources/reports/the-state-of-data-engineering/?thanks=true

 

Data Dork

By Vincent Buscarello

Data Dork

  • 448