Past Events

How much can one learn a PDE from its solution?

Data Science Seminar

Yimin Zhong (Auburn University)

Abstract

In this work we study a few basic questions for PDE learning from observed solution data. Using various types of PDEs, we show 1) how the approximate dimension (richness) of the data space spanned by all snapshots along a solution trajectory depends on the differential operator and initial data, and 2) identifiability of a differential operator from solution data on local patches. Then we propose a consistent and sparse local regression method (CaSLR) for general PDE identification. Our method is data driven and requires minimal amount of local measurements in space and time from a single solution trajectory by enforcing global consistency and sparsity.

Navigating Interdisciplinary Research as a Mathematician

Industrial Problems Seminar

Julie Mitchell (Oak Ridge National Laboratory)

Abstract

Being effective in industrial and team science settings requires the ability to work across disciplines. In this talk, I will reflect on how to be successful working across disciplines and what types of opportunities exist for mathematicians working at national laboratories. I will also reflect on past projects I’ve pursued, which include high-performance computing and machine learning approaches to the understanding of macromolecular structure and binding.

Exploiting geometric structure in matrix-valued optimization

Data Science Seminar

Melanie Weber (Harvard University)

Abstract

Matrix-valued optimization tasks arise in many machine learning applications. Often, exploiting non-Euclidean structure in such problems can give rise to algorithms that are computationally superior to standard nonlinear programming approaches. In this talk, we consider the problem of optimizing a function on a (Riemannian) manifold subject to convex constraints. Several classical problems can be phrased as constrained optimization on matrix manifolds. This includes barycenter problems, as well as the computation of Brascamp-Lieb constants. The latter is of central importance in many areas of mathematics and computer science through connections to maximum likelihood estimators in Gaussian models, Tyler’s M-estimator of scatter matrices and operator scaling. We introduce Riemannian Frank-Wolfe methods, a class of first-order methods for solving constrained optimization problems on manifolds and present a global, non-asymptotic convergence analysis. We further discuss a class of CCCP-style algorithms for Riemannian “difference of convex” functions and explore connections to constrained optimization. We complement our discussion with applications to the two problems described above. Based on joint work with Suvrit Sra.

What makes an algorithm industrial strength?

Industrial Problems Seminar

Thomas Grandine (University of Washington)

Abstract

In this talk, I will discuss the details of two algorithms for parametrizing planar curves in an industrial design context. The first algorithm, developed in an academic setting by world class researchers, solves the problem posed by the researchers in a very satisfying and elegant way. Yet that algorithm, elegant though it may be, turns out to be ineffective in a real world engineering environment. The second algorithm is an extension of the first that eliminates the issues that caused it to be inadequate for industrial use.

Information Gamma calculus: Convexity analysis for stochastic differential equations

Data Science Seminar

Wuchen Li (University of South Carolina)

Abstract

We study the Lyapunov convergence analysis for degenerate and non-reversible stochastic differential equations (SDEs). We apply the Lyapunov method to the Fokker-Planck equation, in which the Lyapunov functional is chosen as a weighted relative Fisher information functional. We derive a structure condition and formulate the Lypapunov constant explicitly. Given the positive Lypapunov constant, we prove the exponential convergence result for the probability density function towards its invariant distribution in the L1 norm. Several examples are presented: underdamped Langevin dynamics with variable diffusion matrices, quantum SDEs in Lie groups (Heisenberg group, displacement group, and Martinet sub-Riemannian structure), three oscillator chain models with nearest-neighbor couplings, and underdamped mean field Langevin dynamics (weakly self-consistent Vlasov-Fokker-Planck equations).

Sampling diffusion models in the era of generative AI

Industrial Problems Seminar

Morteza Mardani (NVIDIA Corporation)

Abstract

In the rapidly evolving landscape of AI, a transformative shift from content retrieval to content generation is underway. Central to this transformation are diffusion models, wielding remarkable power in visual data generation. My talk touches upon the nexus of generative AI and NVIDIA's influential role therein. I will then navigate through diffusion models, elucidating how they establish the bedrock for leveraging foundational models. An important question arises: how to integrate the rich prior of foundation models in a plug-and-play fashion for solving downstream tasks such as inverse problems and parametric models? Through the lens of variational sampling, I present an optimization framework for sampling diffusion models that only needs diffusion score evaluation. Not only does it provide controllable generation, but the framework also establishes a connection with the well-known regularization by denoising (RED) framework, unveiling its extensive implications for text-to-image/3D generation.

Computable Phenotypes for Long-COVID in EHR data

Industrial Problems Seminar

Miles Crosskey (CoVar Applied Technologies)

Abstract

Long COVID, a condition characterized by persistent symptoms following COVID-19 infection, poses challenges in identification due to its diverse manifestations and novelty. Leveraging the N3C Enclave's electronic health record (EHR) data, we devised a machine learning method to construct a computable phenotype for Long COVID. This approach enables the identification of individuals with this condition through EHR data. Our model demonstrates a sensitivity of 72.7% and a specificity of 96.3%, maintaining consistent performance on held-out sites. This technique contributes to a better understanding of Long COVID's prevalence and impact.

Large data limit of the MBO scheme for data clustering

Data Science Seminar

Jona Lelmi (University of California, Los Angeles)

Abstract

The MBO scheme is a highly performant scheme used for data clustering. Given some data, one constructs the similarity graph associated to the data points. The goal is to split the data into meaningful clusters. The algorithm produces the clusters by alternating between diffusion on the graph and pointwise thresholding. In this talk I will present the first theoretical studies of the scheme in the large data limit. We will see how the final state of the algorithm is asymptotically related to minimal surfaces in the data manifold and how the dynamics of the scheme is asymptotically related to the trajectory of steepest descent for surfaces, which is mean curvature flow. The tools employed are variational methods and viscosity solutions techniques. Based on joint work with Tim Laux (U Bonn).

Scalable AI for autonomous driving and robotics

Industrial Problems Seminar

Michael Viscardi (Helm.ai)

Abstract

Helm.ai develops scalable AI software for autonomous driving, robotics, and other applications. This talk will give an overview of our technology and results.

Math-to-Industry Boot Camp VIII

Organizers:

Thomas Hoft, University of St. Thomas
Daniel Spirn, University of Minnesota, Twin Cities

The Math-to-Industry Boot Camp is an intense six-week session designed to provide graduate students with training and experience that is valuable for employment outside of academia. The program is targeted at Ph.D. students in pure and applied mathematics. The boot camp consists of courses in the basics of programming, data analysis, and mathematical modeling. Students work in teams on projects and are provided with training in resume and interview preparation as well as teamwork.

There are two group projects during the session: a small-scale project designed to introduce the concept of solving open-ended problems and working in teams, and a "capstone project" that is posed by industrial scientists. Recent industrial sponsors included Cargill, Securian Financial and CH Robinson. Weekly seminars by speakers from many industry sectors provide the students with opportunities to learn about a variety of possible future careers.

Capstone Projects

Evaluating the real-world safety and robustness of deep learning models

Charles Godfrey, Pacific Northwest National Laboratory
Henry Kvinge, Pacific Northwest National Laboratory

Team: Kean Fallon, Iowa State University; Aidan Lorenz, Vanderbilt University; Jessie Loucks-Tavitas, University of Washington; Sandra Annie Tsiorintsoa, Clemson University; Benjamin Warren, Texas A&M University

Abstract: Deep learning has shown remarkable capabilities in a range of important tasks, but at the same time has also been shown to be brittle in many ways, especially in real-world deployed environments. This can range from a language model that can be triggered to insult users to a vision model that doesn’t recognize machine parts in certain lighting conditions. Understanding how a model will behave at deployment is a serious problem in real-world AI and requires a mixture of mathematical and out-of-the-box thinking. In this capstone project, participants will be asked to evaluate models delivered to a client by 3rd party vendors from the perspective of overall robustness. This will include (i) evaluating how well a model performs outside of its test set and (ii) particular failure modes of the model that should be avoided. The final project deliverable will include a short report recommending for or against proceeding with use of the model in the client’s application.

Interpolating the Implied Volatility Surface

Chris Bemis, X Cubed Capital Management

Team: Qinying Chen, University of Delaware; Nellie Garcia, University of Minnesota; Emily Gullerud, University of Minnesota; Shaoyu Huang, University of Pittsburgh; Pascal Kingsley (PK) Kataboh, University of Delaware; Matthew Williams, Colorado State University

Abstract: Financial markets price volatility in underlying securities primarily through what are called options. These options are defined by reference to their payoffs and the date at which they expire, along with other features such as prevailing rates, the underlying security price, and so on. The result is that markets reference a surface of implied volatilities based on market prices.

In this project, we will fit such surfaces in financially meaningful ways; especially focusing on the preclusion of arbitrage opportunities in the resulting interpolation. These methods are critical in creating assessments of constant expiry volatility time series amongst many other applications. They also sometimes suffer from a lack of stability in parameter estimation as new surfaces are fit.

We will use real (and noisy) data with the goal of efficiently creating stable volatility surface interpolations and time series of constant expiry volatility.

Multimodal Search in eCommerce

Christopher Miller, eBay

Team: Tanuj Gupta, Texas A&M University; Meagan Kenney, University of Minnesota; Pavel Kovalev, Indiana University; Chiara Mattamira, University of Tennessee; Jeremy Shahan, Louisiana State University; Hannah Solomon, Texas A&M University

Abstract: In classical search, most or all user interfaces are text-based. Users submit queries made up of strings, and possibly assert filters (also text-based) to limit the result set. When a user does not like their results, they can “requery” with slightly different terms to produce better results. This process continues until the user is satisfied or gives up.

In visual search, users submit images for their queries. The query images might come from the internet, other eCommerce sites, or from the users’ own library. They expect to see results that look similar to their query image. If the user does not like their results, now they’re stuck: they cannot simply tweak an image the way they can tweak a text-based query.

This is the problem we will resolve with a two-phase multimodal search. The first phase is regular visual search. In the second phase, users can add text to their query to augment their search results. For example, they submit a photo of a yellow dress they have at home, but add the text “green dress” to get results that are green, but otherwise similar to the dress they already have. This enables users to iteratively improve their search results just like they would in classical search.

An Excess Demand Model of Home Price Appreciation

Christopher Jones, US Bank
Matt Mansell, US Bank
Leo Digiosia, US Bank

Team: Ismail Aboumal, California Institute of Technology; Daniela Beckelhymer, University of Minnesota; Jarrad Botchway, Missouri University of Science and Technology; Jordan Pellett, University of Tennessee; Marshall Smith, University of Minnesota

Abstract: National home prices in the U.S. are tracked by one of a few indices: the Case-Shiller and FHFA home price indices being the most popular. Home price appreciation is an important metric tracked by commercial banks. Because bank originators have the ability to hold mortgages on their balance sheets, refinance activity such as cash-out and rate/term refinance contribute to the interest rate and macroeconomic risk of these assets. Traditionally, home prices are forecasted using a form of econometric regression where multiple correlated variables are used in a model. However, these models often lead decision-makers in a bank lacking in terms of interpretability or insights into the mortgage market. In this project, we will explore home prices from the point of view of a differential equation so that we can obtain forecasted values of home price appreciation on a variety of time scales. We will explore the conceptual soundness of a model of excess demand and quantify uncertainty around parameter estimation and shape optimization. We will create a story around this model to explain past events and potential future scenarios.