Data Life

Favio Vázquez

Data Scientist

@faviovaz

¿Quién soy?

  • Venezolano
  • Licenciado en Física e Ingeniero en Computación
  • Maestría en la PCF-UNAM (Cosmología)
  • Data Scientist (Auto-Proclamado)
  • Sr. Data Scientist @ Raken Data Group
  • Chief Data Scientist @ Iron
  • Profesor Afi Escuela y Escuela Bolsa Mexicana
  • Editor Instituto Internacional de Ciencia de Datos

¿Quién soy?

  • Profesor de Ciencia de Datos en Business Science
  • Muy activo en LinkedIn ;)
  • Editor International Journal of Business Analytics and Intelligence
  • Escritor en Towards Data Science, Heartbeat,  Becoming Human y Planeta Chatbot

Releases 1.3.0, 1.4.0, 1.4.1 1.5.0, 2.2.0 and 2.3.0

¿Quién soy?

Colaborador de Spark GitHub y StackOverflow

Alter Favio

Alter Favio

Alter Favio

Alter Favio

Alter Favio

What is Spark?

Is a fast and general engine for large-scale data processing.

Unified Engine

High level APIs with space for optimization 

  • Expresses all the workflow with a single API
  • Connects existing libraries and storage systems

My Spark Contributions

  • SPARK-7238. Upgrade protobuf-java (com.google.protobuf) version from 2.4.1 to 2.5.0
  • SPARK-7249. Updated Hadoop dependencies due to inconsistency in the versions
  • SPARK-7671. Fix wrong URLs in MLlib Data Types Documentation
  • SPARK-8274. Fix wrong URLs and Doc in MLlib Frequent Pattern Mining Documentation
  • SPARK-21976. Fix wrong doc about Mean Absolute Error

 

My Spark Contributions

  • SPARK-7249. Updated Hadoop dependencies due to inconsistency in the versions

 

Updated Hadoop dependencies due to inconsistency in the versions. Now the global properties are the ones used by the hadoop-2.2 profile, and the profile was set to empty but kept for backwards compatibility reasons.

Started Apr 29, 15

Merged May 14, 15

168 comments

8 people involved

20 commits

8 files changed

My (other) Spark Contributions

  • Mailing list.
  • https://stackoverflow.com/users/4984225/favio-vázquez
  • https://www.linkedin.com/in/faviovazquez/

 

Questions?

Favio Vázquez

Cosmologist & Data Scientist

@faviovaz

Made with Slides.com