PyMCon Events

Upcoming Events

Dynamic data are all around us. Changepoint models allow us to know when changes happen in these data and what they look like. Probabilistic modelling allows us to elegantly build customizable changepoint models for different data types, as well as provide us with uncertainty estimates for the position and magnitude of the change (both indispensable quantities for decision-making and hypothesis testing). This tutorial will briefly cover building changepoint models for multivariate data using PyMC but will primarily focus on the ways in which this “basic” model can be extended.

This tutorial is targeted towards academic researchers, data scientists, and anyone interested in being able to easily build bespoke models which provide uncertainty estimates for inferred statistics. This talk will attempt to be accessible to beginners but leans towards more intermediate users interested in changepoint modelling. Previous experience with PyMC, and a background in statistical modelling is assumed. No libraries other than PyMC and the basic scientific stack (numpy, scipy, matplotlib) will be used.

The tutorial aims to be hands-on, will discuss some theory to provide context for the models discussed, and will be heavy on understanding code to construct the “guts” of the models (in particular, selection of distributions for modelling the emissions and changepoint locations, and the details of the tensor manipulation to put everything together).

Abuzar Mahmood

Past Events

Causal analysis is rapidly gaining popularity, but why? Machine learning methods might help us predict what’s going to happen with great accuracy, but what’s the value of that if it doesn’t tell us what to do to achieve a desirable outcome? Without a causal understanding of the world, it’s often impossible to identify which actions lead to a desired outcome.

Causal analysis is often embedded in a frequentist framework, which comes with some well-documented baggage. In this talk, Thomas will present how we can super-charge PyMC for Bayesian Causal Analysis by using a powerful new feature: the do operator.

Thomas Wiecki

Async Talk Live Q&A Discourse Post

We are holding office hours to help YOU apply to PyMCon. Whether you have a finished talk and are looking for feedback, or just have an idea, we’ll help you get moving.

You can participate in two ways:

- For asynchronous discussions, join our discourse thread and share your amazing ideas: https://discourse.pymc.io/t/12689
- For synchronous discussions, join our Meetup event: https://www.meetup.com/pymc-online-meetup/events/295283022/

Ravin Kumar

Discourse Post

Inspired by the asset-pricing literature, the Soccer-Factor-Model (SFM) is an attempt to determine a soccer player’s alpha, i.e. his/her inherent skill. In this application, skill is defined as a player’s probability to score a goal after accounting for factors that are actually to be attributed to his team’s out- or under-performance vis-à-vis the opponent.

In that way, the SFM answers the question, of whether a player’s observed goals are an over- or understatement of his true skill.

Maximilian Goebel

Interview video Live Talk Discourse Post

Many people have heard of (or at least used) automatic differentiation, a set of routines that allow programs to compute partial derivatives from numerical functions. This innovation allowed a large class of users to benefit from gradient-based algorithms without having to understand or manually implement their own gradient functions.

PyMC grabbed this idea and applied it to a completely novel context: automatic probability. With it, we can automatically derive and evaluate probability expressions from user-specified random functions. Similar to automatic differentiation, this allows users to create rich generative models and exploit probability-based algorithms (e.g., NUTS) without having to manually implement their own probability functions.

In this talk, we present this new foundation of PyMC and how it can (and is already) used to construct specialized classes of models, from arbitrary variable transformations to mixture distributions, censoring processes, and highly structured timeseries.

The goal is to understand what makes PyMC a true probabilistic programming language and how you can exploit it!

Ricardo Vieira

Interview video Async Talk Live Q&A Discourse Post

May 24th 2023

Protecting Voting Rights With PyMC

An Overview of a Beta-Binomial Hierarchical Model and its Legal Implications
Online

Voting, elections, and democracy are hot topics. In the United States, one of the most important laws in this domain, the Voting Rights Act of 1965 (VRA), calls for fairness in the design of election systems so that minorities have an equal opportunity to participate. But elections are complicated phenomena. How do we know when that opportunity has been taken away?

This talk describes how that question is answered with a PyMC implementation of a beta-binomial hierarchical model. In a narrower legal context, the qualitative question of opportunity is inferred by the degree to which an electorate is polarized along racial lines. As the thinking goes, if a minority group has drastically different preferences than the majority, then the minority is exposed and vulnerable to partisan actors who might implement policy designed to weaken the political power of that group. Gerrymandering is a popular example.

The model produces parameter estimates that speak directly to this legal question. Designed in the early 2000s, the model has matured to the point that legal doctrine has coalesced around the quality of its estimates; it forms the backbone of a critically important civil rights law. The talk will discuss the Python implementation and how the posterior is interpreted to inform litigation decisions.

Todd Hendricks

Interview video Async Talk Live Q&A Discourse Post

April 17th 2023

The Bayesian Statistics Toolbox: Building a robust, replicable Bayesian workflow for the behavioral and neural sciences

Building a robust, replicable Bayesian workflow for the behavioral & neural sciences
Online

Are you seriously interested in Bayesian statistics but find yourself relying on familiar frequentist tools when it comes time to present your data at a conference or in a manuscript? If this statement rings true, you are not alone, and this talk is for you! In this presentation, I will go over some new tools we’ve developed to help make running Bayesian versions of some of the most common statistical tests used in the behavioral and neural sciences intuitive and transparent.

Hyosub Kim

Interview video Async Talk Live Q&A Discourse Post

PyMC has now multiple options to boost its performance (JAX support, training on GPUs, etc). The library is widely known for being easy to learn and for its great documentation, but it’s not always seen as a performant tool. The goal of the blog post is to present a benchmark where we can show that PyMC can work with large datasets and different approaches to do so. The blog post will be accompanied with reproducible code so that we can add/update metrics when there are substantial changes in PyMC or other libraries. Users will be able to compare their own models using the code provided in the blog repository.

Sandra Yojana Meneses

Interview video Event video Discourse Post

In this talk, Bill will introduce a PyMC Hilbert Space Gaussian Process (HSGP) implementation and show via case studies how it fills a few key gaps in the PyMC GP library: fast GPs as model subcomponents, and fast GPs with non-Gaussian likelihoods. I’ll also cover tips and tricks for applying HSGPs effectively in practice.

Bill Engels

Interview video Event video Discourse Post

The talk aims to get users quickly up and performing GPs, especially multi-output GPs using PyMC. Several examples with time-series datasets are used to illustrate different GPs features. This presentation will allow users to leverage GPs to analyze their data effectively.

Danh Phan

Interview video Event video Discourse Post

This talk will attempt to answer the question what is a Data Generating Process and why does it matter? While we will begin our discussion with a bit of theory, don’t worry about this being too technical or inaccessible if you’re new to Bayesian Statistics. Our primary goal is to focus on the second half of the question and give you tools to use for real-world applications.

Dante Gates

Interview video Event video Discourse Post

PyMCon Events

Upcoming Events

Nov 20th 2023

The Only Constant is Change

Past Events

Sept 28th 2023

Bayesian Causal Modeling

August 21st 2023

Let’s Work on Your Talk Idea

July 17th 2023

A Soccer-Factor-Model

June 28th 2023

Automatic Probability with PyMC

May 24th 2023

Protecting Voting Rights With PyMC

April 17th 2023

The Bayesian Statistics Toolbox: Building a robust, replicable Bayesian workflow for the behavioral and neural sciences

Mar 28 2023

Scalable Bayesian Modelling: A practical comparison

Mar 15 2023

HSGPs in PyMC: A fast Gaussian process approximation that you can actually use

Feb 21 2023, 22:00 UTC

An Introduction To Multi-Output Gaussian Processes Using PyMC

Feb 9 2023, 21:00 UTC (4pm ET)

The Power of Bayes in Industry: Your Business Model is Your Data Generating Process