Public Data

Freedom of Information Act (FOIA) Request

Since 1967, the Freedom of Information Act (FOIA) has provided the public the right to request access to records from any federal agency.

 

Federal agencies are required to disclose any information requested under the FOIA unless it falls under one of nine exemptions which protect interests such as personal privacy, national security, and law enforcement.

FOIA Request Exemptions

Exemption 1: Information that is classified to protect national security.

Exemption 2: Information related solely to the internal personnel rules and practices of an agency.

Exemption 3: Information that is prohibited from disclosure by another federal law.

Exemption 4: Trade secrets or commercial or financial information that is confidential or privileged.

FOIA Request Exemptions

Exemption 5: Privileged communications within or between agencies, including:

  1. Deliberative Process Privilege
  2. Attorney-Work Product Privilege
  3. Attorney-Client Privilege

Exemption 6: Information that, if disclosed, would invade another individual's personal privacy.

FOIA Request Exemptions

Exemption 7: Information compiled for law enforcement purposes that:

  • 7(A). Could reasonably be expected to interfere with enforcement proceedings
  • 7(B). Would deprive a person of a right to a fair trial or an impartial adjudication
  • 7(C). Could reasonably be expected to constitute an unwarranted invasion of personal privacy
  • 7(D). Could reasonably be expected to disclose the identity of a confidential source
  • 7(E). Would disclose techniques and procedures for law enforcement investigations or prosecutions
  • 7(F). Could reasonably be expected to endanger the life or physical safety of any individual

Exemption 8: Information that concerns the supervision of financial institutions.

FOIA Request Exemptions

Exemption 9: Geological information on wells.

How to Make a FOIA Request

https://www.foia.gov/how-to.html

FOIA Request Example

@JasonLeopold

Sr investigative reporter @BuzzFeedNews investigations team. 

src: https://twitter.com/JasonLeopold/status/814504328362741760

Public Data

Data that is freely available for anyone to use and republish as they wish

 

eg Government (taxpayer-funded) data available in the public domain such as data.gov or Public Genome Data used in scientific research 

Private Data

Private, or protected data, that is privately owned by individuals or companies

Case Study 1

What's up with RealFeel® being a registered trademark?

Where does AccuWeather's Data come from?

Where does AccuWeather's Data come from?

The National Weather Service, which provides large amounts of the data that AccuWeather repackages and sells for profit, also provides that same information for free by placing it in the public domain:

National Weather Service GIS Weather Portal http://www.nws.noaa.gov/gis/

Why can't I find an official National Weather Service app?

S.786 - National Weather Service Duties Act of 2005

https://www.congress.gov/bill/109th-congress/senate-bill/00786

  1. Who sponsored the bill?
  2. What was the purpose of the bill?
  3. What would have been the end result of the bill?
  4. Why do you think the bill was written?
  5. Did the bill become law?

Further reading

https://www.washingtonpost.com/blogs/capital-weather-gang/post/national-weather-service-hold-on-mobile-apps-stirs-controversy/2012/01/08/gIQAjWKUlP_blog.html

http://www.nwseo.org/Four%20Winds/11_12_28_Sobien_Lubchenco_Letter.pdf

https://web.archive.org/web/20111020154327/http://newsmeat.com/ceo_political_donations/Joel_Myers.php

Case Study 2

Algorithm

An algorithm is process or set of rules to be followed in calculations or other problem-solving operations.

We must also ask: an algorithm built by whom, an algorithm designed by whom, an algorithm trained by whom?

Bias

Reading: When Algorithms Discriminate

https://www.nytimes.com/2015/07/10/upshot/when-algorithms-discriminate.html

Activity: Understanding Bias in Algorithms

  1. With a partner, choose and research an algorithm such as
    1. the algorithm used for credit scores based upon Facebook friends or the algorithm for Facebook's newsfeed
    2. an algorithm used to decide which ads to show you
    3. Netflix's recommendation algorithm
    4. Google's vs Bing's search algorithm
    5. a predictive policing algorithm
  2. Be able to answer the following questions:
    1. Is the algorithm biased?  What supporting evidence exists?
    2. Who wrote the algorithm?  Who was it written for?
    3. What are the unintended results of the algorithm, if any?
    4. Could you show the class the algorithm in action?

How Data is funded

How Government

Data is Funded & Collected

  • Contracts, Grants, Loans (https://www.usaspending.gov/)
  • Under the terms of the 2013 Federal Open Data Policy, newly-generated government data is required to be made available in open, machine-readable formats, while continuing to ensure privacy and security. (https://www.gsa.gov/portal/category/25729)

https://www.whitehouse.gov/sites/default/files/omb/memoranda/2013/m-13-13.pdf

(https://obamawhitehouse.archives.gov/ or https://archive.org/web/)

How Government

Data is Made Available

  • Wordpress: https://wordpress.com/
  • Open-source data portal platform: https://ckan.org/
  • data.gov source code: https://github.com/GSA/data.gov/

Data Design & UX

Web Pages

  • Static Content
  • Limited Interativity

Web Apps

  • Data-centric
  • Dynamic
  • User Actions
  • Flexibility

Designing for data is challenging

Sources of data

  • Big Data
    • Large machine-generated quantitative data sets
    • Site analytics, shopping history, social media activity, and website click maps
  • Thick Data
    • Qualitative data
    • Insights from user interviews and focus groups, and quantitative data like online surveys

Data as storytelling device

  • Crafting a compelling narrative
    • Help users find relevant patterns and trends
    • Steer users through large amounts of information
  • Prioritizing which data is shown
    • Reducing information to only shown what is most relevant rather than showing everything which results in fighting for the user’s attention

Know Your Users

  • Determine who your target audience is
  • Know and trust their abilities
  • Establish the right level of complexity for your target audience
  • Design for both returning visitors​ and first timers

Tips from a User Experience Strategist

  • 1. Talk to users until you're bored
  • 2. Obsess over efficiency of frequent use cases
  • 3. Solve first-time use last, and test it hard

https://storify.com/miniver/why-you-should-save-ux-design-for-first-time-use-f

Establish Clear Paths to Discovery

  • Overview
  • Zoom
  • Filter
  • Details on Demand
  • Relate
  • History
  • Extract

Visual Information Seeking Mantra

Overview first, zoom and filter, then details-on-demand

Data Visualization

Designing for Visualizations: Best Practices

  1. Use fewer than 6 colors
  2. Use opposing colors for comparisons
  3. Use opposing cool colors for backgrounds, warm colors for data
  4. Always include a title and label your axes (context is important)
  5. Legends should be larger and not rely on color alone
  6. Use as many tick marks as necessary without losing readibility (fall back to zoom + tooltips if more discrete ticks are necessary

5 Categories of Organizing Information

  • Chronologically
  • Alphabetically
  • Geographically
  • Categorically
  • Hierarchically

Infographics

  1. Chronological narratives
  2. Narratives that require simple illustrations or multiple charts to tell the full story
  3. Cohesiveness in color 
  4. Particular attention to hierarchy of text and each element
  5. Consistency in color and style

Visualizations

  • Pie chart
  • Donut chart
  • Line graph
  • Bar graph
  • Scatter plot
  • Tree map
  • Node diagram
  • Area
  • Heat map
  • Bubble
  • Force-directed graph

http://peoplemaps.org/

http://davetroy.com/posts/the-math-behind-peoplemaps

Which Visualization Type Works Best for the Data at Hand?

Project (due march 20)

  1. Choose a dataset (suggested datasets available on the class site)
  2. Export as .json (or tab-separated values .tsv / comma-separated values .csv if necessary) (Note: To look at the data so you can design from it, you can import a CSV into excel/numbers and create rough visualizations as a reference)

Project (due march 20)

Design a 1-page Data Visualization/Infographic Site

  1. One visualization/infographic based upon the dataset of your choosing
  2. Compelling title and ~250 words narrating the data (do not need to be contiguous: think labels/legend/captions/tooltips/button text/explanatory welcome message).
  3. Separate, linked page with 250-500 word methodology behind infographic (see example)
  4. What is timely?  Provocative?  Controversial? What do others need enlightening on in 2017?
  5. Bonus: Interactivity

Public Data

By brigittewarner

Public Data

  • 1,205