Data Carpentry
Social Sciences and Humanities Using R

OpenRefine

Annika Rockenberger

Why OpenRefine?

  • Identify and amend messy data
  • Capture all actions applied to your data
  • Reverse any action
  • No modification of raw data
  • Apply tidy cleaning actions to other data sets
  • Local application, not cloud service
  • Graphical user interface

Features

  • Open Source with a BSD-3 license (redistribution and use in source and binary forms, with or without modification, are permitted)
  • Works with large files (100.000 rows)
  • Keeps data private unless you want to
    share it!

Today's Session

  1. Facets
  2. Data formats and transforming data formats
  3. Filtering and sorting
  4. Transforming data with GREL
    (General Refine Expression Language)
  5. Using scripts to apply operations on
    other files
  6. Exporting project and data

DC:SSH Open Refine Lesson

By Annika Rockenberger

DC:SSH Open Refine Lesson

Slides for the OpenRefine lesson during the Data Carpentry: Social Sciences and Humanities Using R workshop. University of Oslo, Digital Scholarship Center, Nov 24th & 25th, 2022.

  • 67