data:image/s3,"s3://crabby-images/24c1d/24c1d5c8cb9f9b23a868a8c34a86739b82233062" alt=""
How to share nicely!
Using Git, GitHub and DOIs to make your work reproducible
Ann Gledson, Research Software Engineer
The University of Manchester
data:image/s3,"s3://crabby-images/48a1c/48a1cc24741d02637e85ed8d52cfec873199f91f" alt=""
data:image/s3,"s3://crabby-images/24c1d/24c1d5c8cb9f9b23a868a8c34a86739b82233062" alt=""
Overview
data:image/s3,"s3://crabby-images/48a1c/48a1cc24741d02637e85ed8d52cfec873199f91f" alt=""
- Git / Github
- Gitflow workflow
- Structuring your repository to be sharable
-
Versioning / release process
- DOI: Link Github repo release with Zenodo
Git vs GitHub
- Git
- Distributed version control system
- Installed locally
- High quality version control system
- Excellent branching model
- Tags (snapshots of code for version release)
- GitHub (or GitLab, Bitbucket)
- Repository hosting service for Git
- Cloud-based
- Share code with others
- Resolve conflicts
- Releases (based on Git tags)
Basic use (using git and remote)
- Create a "repository" (project) with a git hosting tool (GitHub/Bitbucket)
- Copy (or clone) the repository to your local machine
- Add a file to your local repo and "commit" (save) the changes
- "Push" your changes to your 'main' branch
- Make a (remote) change to your file with GitHub/Bitbucket and commit
- "Pull" the changes to your local machine
- Create a "branch" (version), make a change, commit the change
- Open a "pull request" (propose changes to the 'main' branch)
- "Merge" your branch to the 'main' branch
Basic use: Local and Remote
data:image/s3,"s3://crabby-images/8ca3b/8ca3bb46cc7d1e81585ff295edddddae5e01dc09" alt=""
Git - from local perspective
data:image/s3,"s3://crabby-images/681b1/681b1c49922f28017e2c37d876c9db42def7b8c6" alt=""
Git / Github
- Great tutorial: http://gcapes.github.io/git-course/
data:image/s3,"s3://crabby-images/10a28/10a2885814992ce3e0c176d215a7e25595e4b04e" alt=""
Gitflow Workflow
Gitflow branch types
- 'Main' (aka 'master') branch is public release code – each version on master is tagged with a semantic version number (https://semver.org/). Code in master works.
- Release branch (if present) is temporary. Preparation for version release. (Or use develop)
- Commits on develop are stable and should work but some unanticipated unexpected behaviour may be present.
- Feature branches are temporary*. Commits on these may be broken/unstable.
Making your code shareable
data:image/s3,"s3://crabby-images/375f0/375f05f40f807ecfc6bf809e3b125fccc47bb303" alt=""
- Tailor-made workflow works well if:
- Contributors are working in the same context, same goals
- Communicate frequently
- Or each individual is working on their own branch/version
data:image/s3,"s3://crabby-images/7ecd8/7ecd87cca92af6a8d374bbe61eb6432fcf61b277" alt=""
data:image/s3,"s3://crabby-images/cddb9/cddb9515611929c90f69d02e9c4c1215009e2699" alt=""
data:image/s3,"s3://crabby-images/70a2f/70a2f5c34f900e89314869c425241748ded97678" alt=""
- Falls down if:
- Code shared with external users
- Change in context / purpose
- Package users expect a single, main branch
- Extensibility logic is in the code, not branches
- Versions: whole code, working snapshots
- Data analysis: build in parameterisation
data:image/s3,"s3://crabby-images/8934e/8934e26469caa1d031c81cd6c64223bcbd1e6933" alt=""
data:image/s3,"s3://crabby-images/c8596/c85960f5c9e7f28a1794e61fca2e07f88f416655" alt=""
data:image/s3,"s3://crabby-images/1ba9f/1ba9f25e45a59ce3bc9cb0dfad8e2020b8da1f77" alt=""
data:image/s3,"s3://crabby-images/92e4a/92e4a140c0a5428602419183924f86bed1d269cd" alt=""
data:image/s3,"s3://crabby-images/7a7cf/7a7cf0ceb85ccf5049e02dd99c68fd381f69123a" alt=""
data:image/s3,"s3://crabby-images/84db0/84db012b64c325286d208053e53c0ab752ac9c39" alt=""
data:image/s3,"s3://crabby-images/38355/38355d7df193cf585d5388a9d8a9a69069010318" alt=""
data:image/s3,"s3://crabby-images/896b5/896b5020a2342001d159f494097f57a5f7da8091" alt=""
- Why create releases?:
- code updating / improving
- external dependancies
- planned and tested snapshots / versions
- clearly ordered labels: orientate users
- Data analysis: The code that created a particular set of results.
- GitHub facilitates this...
Gitflow releases
GitHub releases
Gitflow release process
- Test run on develop branch – before making a release, make sure that all tests are passing
- (Can be run as a Continuous integration (CI) job
see: GitHub 'Actions')
- (Can be run as a Continuous integration (CI) job
- (optional) Create an issue that describes what changes have been made since last release version. Make sure everyone agrees that behaviour is as expected. Link to passing tests.
- Merge develop into master and tag with the version number.
Version numbering
- Version number format: MAJOR.MINOR.PATCH
- Increment:
- MAJOR version: when you make incompatible API changes
- MINOR version: when you add functionality in a backwards compatible manner
- PATCH version: when you make backwards compatible bug fixes
Sharing your research: DOI
Linking GitHub version to DOI
- https://guides.github.com/activities/citable-code/
Links
-
Git/GitHub Tutorial: http://gcapes.github.io/git-course/
-
Gitflow Workflow: https://www.atlassian.com/git/tutorials/comparing-workflows/gitflow-workflow
-
Semantic version numbers: https://semver.org/
-
Zenodo (DOIs) https://zenodo.org/
-
Linking GitHub to DOI: https://guides.github.com/activities/citable-code/
-
GitHub Actions: https://docs.github.com/en/actions
Git-workflow-and-sharing
By Ann Gledson
Git-workflow-and-sharing
- 1,241