Library Carpentry: Introduction to Data: Data Intro Cheat Sheet

Key Points

Introduction to Library Carpentry
  • Don’t be scared to ask for help

Jargon Busting
  • It helps to share what you know and don’t know about software development and data science jargon.

Foundations
  • Data structures should be consistent and predictable.

  • Consider using semantic elements or data identifiers to data directories.

  • Fit and adapt your data structure to your work.

  • Apply naming conventions to directories and file names to identify them, to create associations between data elements, and to assist with the long term readability and comprehension of your data structures.

Regular Expressions
Introduction to Data - Multiple Choice Quiz
  • Regular expressions reference guide

Introduction to Data - Multiple Choice Quiz (answers)
  • Regular expressions answer sheet

Multiple Choice Quiz
  • Regular expressions answers

Exercises
  • Regular expressions answers

Why automate?
  • The main lessons of why you might automate or take a computational approach can help guide you as you consider whether to automate or not.

File naming & formatting
  • Data structures should be consistent and predictable.

  • Consider using semantic elements or data identifiers to data directories.

  • Fit and adapt your data structure to your work.

  • Apply naming conventions to directories and file names to identify them, to create associations between data elements, and to assist with the long term readability and comprehension of your data structures.

Data Intro Cheat Sheet


Regular Expression

References

James Baker , “Preserving Your Research Data,” Programming Historian (30 April 2014), http://programminghistorian.org/lessons/preserving-your-research-data.html. The sub-sections ‘Plain text formats are your friend’ and ‘Naming files sensible things is good for you and for your computers’ are reworked from this lesson.

Owen Stephens, “Working with Data using OpenRefine”, *Overdue Ideas” (19 November 2014), http://www.meanboyfriend.com/overdue_ideas/2014/11/working-with-data-using-openrefine/. The section on ‘Regular Expressions’ is reworked from this lesson developed by Owen Stephens on behalf of the British Library

Andromeda Yelton, “Coding for Librarians: Learning by Example”, Library Technology Reports 51:3 (April 2015), doi: 10.5860/ltr.51n3

Fiona Tweedie, “Why Code?”, The Research Bazaar (October 2014), http://melbourne.resbaz.edu.au/post/95320810834/why-code