title | author | date | geometry | output |
---|---|---|---|---|
Advanced scientific computing: producing better code |
Bio 5989, Washington University in St. Louis |
Sep 2, 2021 |
margin=1in |
pdf_document |
Many scientists are self-taught coders and struggle to learn the skills that best support collaboration, rigor and reproducibility, good design, extensibility, and wide community adoption. Like so many other aspects of being a scientist, these are all skills that can be taught and learned via practice.
This nanocourse focuses primarily on tools and habits that make it easier to succeed in writing robust, flexible, and widely-adopted code. Process-oriented topics include Git and GitHub, test-driven development, the workflow of continuous integration, and documentation and software releases. More conceptual topics include principles of good design and the mechanisms for writing high-performance code. Students will be introduced to all these issues via the Julia programming language, although some lessons will apply broadly.
This course is aimed at people who are comfortable in at least one programming language; it is not suitable for programming beginners. Participants should plan to allocate several hours each week for homework assignments.
All sessions will run on Mondays and Wednesdays from 10am-11:30am. There is a two week gap between the first session and the next to allow participants some time to start getting comfortable with Julia.
- Oct. 4: Introduction to the course: "why Julia?" and a brief tour: lecture materials, video, and homework1 and homework2. The two week gap between the first lecture and the remainder of the course is to allow participants to learn and become comfortable with Julia.
- Oct. 18: Open source culture, Julia packages, git, and GitHub: lecture materials, video, and homework.
- Oct. 20: Testing & principles of design: lecture materials, video, and homework.
- Oct. 25: Continuous integration, documentation, package versioning, and releases: lecture materials, video, and homework
- Oct. 27: High performance computing on your laptop I: inference, compilation, and performance measurement: lecture materials, video, and homework as a "template repository" for GitHub Classroom.
- Nov. 3: High performance computing on your laptop II: algorithms, memory, and parallelism: lecture materials and video (see the previous session for applicable homework).
Each lecture is coupled to a homework assignment, which is due at the beginning of the next class (10am). The final lecture's homework will be due Nov. 8, 10am. Students taking the course for credit must complete at least 4 of the 6 and do reasonably well. Depending on class size, either the instructor, assistants, auto-, or peer-grading will be used. If instructors/assistants grade assignments, it will be only for enrolled students.
As is presumably obvious, people auditing the class are not required to do the homeworks. But if you are not already familiar with the content and want to learn, the assignments are highly recommended. It is very hard to learn these things "passively."
It is likely that we'll use a system such as Github classroom, but at present this is still under exploration.