Welcome

These are the materials for a one-day workshop on Building data science workflows in Python using Posit tools on Monday, 18 Sept 2023 at posit::conf 2023!

In this Python-focused workshop, we will discuss ways to improve your data science workflows. During the course, we will review packages for data validation, alerting, modeling, and more. We’ll use Posit’s open source and professional tools to string all the pieces together for an efficient workflow. We’ll discuss environments, managing deployed content, working with databases, and interoperability across data products.

Is this workshop for me?

This workshop is for you if you…

  • Build finished data products starting from raw data and are looking to improve your workflow
  • Are looking to expand your knowledge of Posit open source and professional tools
  • Want to improve interoperability between data products in your work or on your team
  • Have experience developing in Python. An analogous course with an R focus is also offered

Prework

This workshop requires that you bring your own laptop. Before the workshop, please create a Posit Connect account:

  • Visit https://connect.conf23workflows.training.posit.co.
  • Click the “Sign Up” button at the top right.
  • Sign up with your personal email.
  • Make your username the prefix of your personal email. For example:
    • Email: edwardes.s@gmail.com
    • Username: edwardes.s
  • Check your email to confirm your account. The email will be from “conf23workflows@training.rstudio.com” (check your junk folder)

We will be using Discord as our main communication method! To make the process go smoothly:

  • Please sign up for an account if you don’t already have one.
  • Make sure your display name is the one you used to register for the conference.
  • In your “About Me,” put the name of your workshop: “Data Science Workflows with Posit Tools — Python Focus”

Closer to the start of the conference, we will invite you to the posit::conf Discord server. Once you’ve accepted the invite, we will add you to the channel(s) for your conf workshop(s).

If you have questions in advance of the workshop, please ask on the GitHub discussions section of our repository.

Schedule

Time Activity
09:00 - 10:30 Session 1
10:30 - 11:00 Coffee break
11:00 - 12:30 Session 2
12:30 - 13:30 Lunch break
13:30 - 15:00 Session 3
15:00 - 15:30 Coffee break
15:30 - 17:00 Session 4

Instructors

Gagandeep Singh

Gagandeep Singh is a Sr. Solutions Engineer at Posit PBC based in Toronto, Canada. He is a former software engineer and data scientist who has worked in a variety of cross-technology teams before joining Posit.

Sam Edwardes

Sam Edwardes is a Solutions Engineer at Posit PBC based out of Vancouver, British Columbia, Canada. As a Solutions Engineer he helps customers effectively use Posit’s professional products with open source Python and R tools. He is passionate about Python, R, and all things open source!

Acknowledgments

This website, including the slides, is made with Quarto. Please submit an issue on the GitHub repo for this workshop if you find something that could be fixed or improved.

Notice

The sample data science project used for this workshop provides applications using data that has been modified for use from its original source, www.cityofchicago.org, the official website of the City of Chicago. The City of Chicago makes no claims as to the content, accuracy, timeliness, or completeness of any of the data provided at this site. The data provided at this site is subject to change at any time. It is understood that the data provided at this site is being used at one’s own risk.

Reuse and licensing