Accepted Contributions | SynS & ML Workshop @ ICML 2023

Each in-person presentation is assigned to Poster Session in AM or PM; please let us know if you would like to switch the assignment.

Updated on Thursday because the initial assignment did not consider the time slot preference we asked. Sorry for confusion.

Accepted Scientific Models

virtual

(M1) pressio-demoapps

Francesco Rizzi, Patrick Blonigan, Eric Parish, John Tencer, Jorio Cocola, Marcin Wrobel

Abstract Video

Abstract: pressio-demoapps is a collection of 1D, 2D and 3D problems of varying complexity (from linear advection, to reaction-diffusion and compressible Euler). It has native support for sample meshes and exact Jacobians. The code was originally started as part of the Pressio project to create a suite of benchmark problems to test ROMs and hyper-reduction techniques, but it is being developed to be self-contained, and it can be used for variety of purposes. For example, one can just use it for doing "standard" simulations, or just use the Python meshing scripts, or leverage the sample mesh capability to study function approximations. One of the objectives is to provide a simple and reliable testbed. This work has the following features: support for both C++ and Python, cell-centered finite volume discretization with various numerical schemes and exact Jacobians, built-in support for sample mesh in 1D, 2D and 3D for varying stencil sizes, and focus on providing self-contained and well-defined problems.
virtual

(M2) Hybrid Non-Linear Advection-Diffusion-Sorption Simulator

Vinicius Santana, Erbet Costa, Carine Rebello, Ana Mafalda Ribeiro, Chris Rackauckas, Idelfonso Nogueira

Abstract Video

Abstract: The simulator under consideration represents the transport of chemical species in porous media, where diffusion, advection, and sorption play central roles as primary transport mechanisms. Rooted in principles of mass conservation and constitutive laws, the model also integrates a parameterized universal approximator capable of learning mass transfer kinetics between phases from data. This model finds substantial applicability in a range of practical engineering scenarios, including the design of fixed bed reactors [https://doi.org/10.1002/aic.14701] and chromatographic separation processes [https://doi.org/10.1016/j.compchemeng.2019.06.010], and the study of subsurface soil contamination [http://arxiv.org/abs/2104.06010]. The continuous nature of transport equations results in partial differential equations (PDE), which necessitate the use of suitable numerical integration methods for solving or simulating the system. To accommodate this, the simulator was constructed to operate (both calibration and simulation) using the Julia Language, specifically harnessing the power of the SciML ecosystem’s libraries: OrdinaryDiffEq.jl, DiffEqFlux.jl, and SciMLSensitivity.jl. We employ the first two libraries to numerically integrate the discretized PDEs, while we use the last library to compute sensitivity information crucial for model calibration.
Motivation for combining the model with ML: The transport of chemicals in porous media involving sorption is typically modeled assuming that solid-fluid mass transfer follows first-order kinetics, akin to Newton’s law of cooling. However, there are instances where this assumption proves inaccurate, resulting in poor predictive performance of the model [https://doi.org/10.1016/j.cej.2018.07.119]. Due to the high costs associated with designing experiments to identify the true kinetic law in such systems, only a few studies in the literature have attempted to do so. Considering these limitations, this work proposes an alternative approach: substituting the solid-fluid phase mass transfer kinetics with an artificial neural network (ANN) within the partial differential equation framework. Breakthrough data is utilized to calibrate the parameters of the ANN. Following the training of the ANN, symbolic and sparse regression techniques are employed to derive a polynomial-like function that has similar predictive capabilities as the ANN. This refined model aims to enhance the accuracy and reliability of predictions in chemical transport through porous media.
PM

(M3) ChemGymRL: An Interactive Framework for Reinforcement Learning for Digital Chemistry

Chris Beeler, Sriram Ganapathi Subramanian, Kyle Sprague, Nouha Chatti, Colin Bellinger, Mitchell Shahen, Nicholas Paquin, Mark Baula, Amanuel Dawit, Zihan Yang, Xinkai Li, Mark Crowley, Isaac Tamblyn

Abstract

Abstract: (arXiv paper: https://arxiv.org/abs/2305.14177) The ChemGymRL Open Source Library enables the use of Reinforcement Learning (RL) algorithms to train agents towards the target of operating individual chemistry benches given specific material targets. The environment can be thought of as a virtual chemistry laboratory consisting of different stations (or benches) where a variety of tasks can be completed. The laboratory consists of three basic elements: vessels, shelves, and benches. Vessels contain materials, in pure or mixed form, with each vessel tracking the hidden internal state of their contents. Whether an agent can determine this state, through measurement or reasoning, is up to the design of each bench and the user’s goals. A shelf can hold any vessels not currently in use, as well as the resultants (or output vessels) of previous experiments. Benches are sub-environments which enact various physical or chemical processes on the vessels. Each bench recreates a simplified version of one task in a material design pipeline and has an observation and action space specific to the task at hand. ChemGymRL is designed in a modular fashion so that new benches can be added or modified with minimal difficulty or changes to the source code. A bench must be able to receive a set of initial experimental supplies, possibly including vessels, and return the results of the intended experiment, also including modified vessels. The details and methods of how the benches interact with the vessels between these two points are completely up to the user, including the goal of the bench. In this initial version of ChemGymRL we have implemented some core benches, which we describe in the following sections and which will allow us to demonstrate an example workflow.
Motivation for combining the model with ML: The goal of ChemGymRL is to simulate enough complexity of real-world chemistry experiments to allow meaningful exploration of algorithms for learning policies to control bench-specific agents, while keeping it simple enough that episodes can be rapidly generated during the RL algorithm development process. The environment supports the training of RL agents by associating positive and negative rewards based on the procedure and outcomes of actions taken by the agents. The aim is for ChemGymRL to help bridge the gap between autonomous laboratories and digital chemistry. This will have impacts for producing new materials, chemicals, and drugs. It will also require many technologies including search, feedback and control, and optimization, and artificial intelligence algorithms that can deal with the unique challenges of material design. This simulation environment encapsulates some of those challenges while maintaining as much realism as possible, and extensibility to allow open-source improvement of the simulations going forward. The framework raises interesting computational and modeling challenges for the Reinforcement Learning paradigm that are not always all present in other frameworks such as costs of observation, observations of various level of detail, and hierarchical planning challenges.
PM

(M4) The Computational Crystallography Toolbox

Vidya Ganapati, Daniel Tchon, Aaron S. Brewster, Nicholas K. Sauter

Abstract

Abstract: The Computational Crystallography Toolbox (CCTBX) is open-source software that allows for processing of crystallographic data, including from serial femtosecond crystallography (SFX), for macromolecular structure determination. We aim to use the modules in CCTBX to determine the oxidation state of individual metal atoms in a macromolecule. Changes in oxidation state are reflected in small shifts of the atom’s X-ray absorption edge. These energy shifts can be extracted from the diffraction images recorded in serial femtosecond crystallography, given knowledge of a forward physics model. However, as the diffraction changes only slightly due to the absorption edge shift, inaccuracies in the forward physics model make it extremely challenging to observe the oxidation state. In this work, we describe the potential impact of using self-supervised deep learning to correct the scientific model in CCTBX and provide uncertainty quantification. We provide code for forward model simulation and data analysis, built from CCTBX modules, at https://github.com/gigantocypris/SPREAD, which can be integrated with machine learning. We describe open questions in algorithm development to help spur advances through dialog between crystallographers and machine learning researchers. New methods could help elucidate charge transfer processes in many reactions, including key events in photosynthesis. We further describe CCTBX and the potential for applying machine learning in a paper at https://github.com/gigantocypris/SPREAD/blob/main/PAPER.pdf.
virtual

(M5) ADEPT - Automatic Differentiation Enabled Plasma Transport

Archis Joglekar, Alexander Thomas

Abstract

Abstract: Fusion and astrophysical plasmas are often modeled as charged fluids. To understand their dynamical behavior, the Euler partial-differential-equations for a charged fluid can be solved as an initial value problem or as an externally driven system. However, the fluid equations do not always capture the full richness of the plasma dynamics, for example, in scenarios where microphysics governs macroscopic behavior. Here, we present ADEPT, an Automatic Differentiation Enabled Plasma Transport code written in JAX that has been tested to reproduce known physics. ADEPT provides the user with the ability to train deep models for missing microphysics that improves the solvers ability to reproduce experimental data and/or first-principles simulations. Other applications include the ability to learn improved numerical methods, to perform parameter estimation and parameter discovery [1], and to perform sensitivity analyses. The GitHub repo includes the source code, installation and testing instructions, and an ab-initio simulation generated dataset on which we have trained a microphysics model [2].
[1] - A. S. Joglekar and A. G. R. Thomas - Unsupervised Discovery of Nonlinear Plasma Physics using Differentiable Kinetic Simulations - Journal of Plasma Physics - Dec 2022
[2] - A. S. Joglekar and A. G. R. Thomas - IoP Machine Learning Science & Technology - In Preparation
Motivation for combining the model with ML: Modeling plasma dynamics accurately is a notorious problem in modeling multiscale systems where missing physics can be accounted for by using analytically derived or phenenologically produced models. However, a plasma physics simulator that is capable of learning from experimental or ab-initio data can also account for the missing physics by using automatic differentiation to train models within the simulator. This is a timely application of machine learning to this simulator because of the ease with which a general numerical program can be expressed in modern deep learning frameworks. By using JAX, ADEPT is capable of exploiting GPUs as well as the rest of the ecosystem like Diffrax, Equinox, and jaxopt.

Accepted Papers

AM

(2) Predictive Modeling of Engine-out Emissions using a Combination of Computational Fluid Dynamics and Machine Learning

Alok Warey, Jian Gao, Ronald Grover Jr
PM

(3) Optimization or Architecture: What Matters in Non-Linear Filtering?

Ido Greenberg, Netanel Yannay, Shie Mannor
virtual

(4) Improving the Lipschitz stability in Spectral Transformer through Nearest Neighbour Coupling

Abhishek Kumar Sinha

Video
AM

(6) Neural Polytopes

Koji Hashimoto, Tomoya Naito, Hisashi Naito
AM

(7) Convolutional Neural network for local stabilization parameter prediction for Singularly Perturbed PDEs

Sangeeta Yadav
AM

(8) Meta-Learning Deep Kernels for Latent Force Inference

Jacob Moss, Felix Opolka, Jeremy England, Pietro Lio
AM

(9) OL-Transformer: A Fast and Universal Surrogate Simulator for Optical Multilayer Thin Film Structures

Taigao Ma, Haozhu Wang, L. Jay Guo

Video
AM

(12) Physics-based deep learning framework to learn and forecast cardiac electrophysiology dynamics

Victoriya Kashtanova, Maxime Sermesant, Patrick Gallinari
AM

(14) Speeding up Fourier Neural Operators via Mixed Precision

Renbo Tu, Colin White, Jean Kossaifi, Kamyar Azizzadenesheli, Gennady Pekhimenko, Anima Anandkumar

Video
virtual

(16) A Machine Learning Pressure Emulator for Hydrogen Embrittlement

Minh Chau, João Lucas Sousa Almeida, Elie Alhajjar, Alberto Costa Nogueira Jr

Video
AM

(17) Repurposing Density Functional Theory to Suit Deep Learning

Alexander Mathiasen, Hatem Helal, Paul Balanca, Kerstin Klaeser, Josef Dean, Carlo Luschi, Dominique Beaini, Andrew William Fitzgibbon, Dominic Masters
virtual

(18) Reinstating Continuous Climate Patterns From Small and Discretized Data

Xihaier Luo, Xiaoning Qian, Nathan Urban, Byung-Jun Yoon

Video
AM

(19) Predicting the stabilization quantity with neural networks for Singularly Perturbed Partial Differential Equations

Sangeeta Yadav
AM

(20) Learning Green’s Function Efficiently Using Low-Rank Approximations

Kishan Wimalawarne, Taiji Suzuki, Sophie Langer
AM

(24) Combining Thermodynamics-based Model of the Centrifugal Compressors and Active Machine Learning for Enhanced Industrial Design Optimization

Shadi Ghiasi, Guido Pazzi, Concettina Del Grosso, Giovanni De Magistris, Giacomo Veneri
AM

(27) Simulation-based Inference with the Generalized Kullback-Leibler Divergence

Benjamin Kurt Miller, Marco Federici, Christoph Weniger, Patrick Forré
AM

(28) Infinite-Fidelity Surrogate Learning via High-order Gaussian Processes

Shibo Li, Li Shi, Shandian Zhe
AM

(31) Understanding the Efficacy of U-Net & Vision Transformer for Groundwater Numerical Modelling

Maria Luisa Taccari, Oded Ovadia, He Wang, Xiaohui Chen, Adar Kahana, Peter Jimack

Video
AM

(32) Physics-Constrained Random Forests for Turbulence Model Uncertainty Estimation

Marcel Matha
AM

(34) ChatGPT-powered Conversational Drug Editing Using Retrieval and Domain Feedback

Shengchao Liu, Jiongxiao Wang, Yijin Yang, Chengpeng Wang, Ling Liu, Hongyu Guo, Chaowei Xiao
AM

(35) Using machine learning and 3D geophysical modelling for mineral exploration

Gerrit Olivier
AM

(37) An A-adaptive Loop Unrolled Architecture for Solving Inverse Problems with Forward Model Mismatch

Peimeng Guan, Naveed Iqbal, Mark A. Davenport, Mudassir Masood
AM

(38) Unbinned Profiled Unfolding

Jay Chan, Benjamin Nachman
AM

(39) Evaluating the diversity and utility of materials proposed by generative models

Alexander New, Michael Pekala, Elizabeth A Pogue, Nam Q Le, Janna Domenico, Christine D. Piatko, Christopher D Stiles
virtual

(40) Physics-Informed Neural Operator for Coupled Forward-Backward Partial Differential Equations

Xu Chen, Yongjie FU, Shuo Liu, Xuan Di

Video
AM

(43) Exploring the Existence of Atmospheric Blocking’s Precursor Patterns with Physics-Informed Explainable AI

Anh N Nhu, Lei Wang
AM

(44) Predicting Properties of Amorphous Solids with Graph Network Potentials

Muratahan Aykol, Jennifer N. Wei, Simon Batzner, Amil Merchant, Ekin Dogus Cubuk
PM

(45) Good Lattice Accelerates Physics-Informed Neural Networks

Takashi Matsubara, Takaharu Yaguchi
PM

(46) Task-Linear Deep Representation of Physical Systems

Matthieu Blanke, Marc Lelarge
PM

(47) Learning to Optimize Non-Convex Sum-Rate Maximization Problems

Qingyu Song, Guochen Liu, Hong Xu
AM

(48) ClimaX: A Foundation Model for Weather and Climate

Tung Nguyen, Johannes Brandstetter, Ashish Kapoor, Jayesh K Gupta, Aditya Grover
PM

(49) Synergizing Deep Reinforcement Learning and Biological Pursuit Behavioral Rule for Robust and Interpretable Navigation

Kazushi Tsutsui, Kazuya Takeda, Keisuke Fujii
PM

(51) Generating observation guided ensembles for data assimilation with denoising diffusion probabilistic model

Yuuichi Asahi, Yuta Hasegawa, Naoyuki Onodera, Takashi Shimokawabe, Hayato Shiba, Yasuhiro Idomura
virtual

(52) Open Source Infrastructure for Differentiable Density Functional Theory

Advika Vidhyadhiraja, Arun Pa Thiagarajan, Shang Zhu, Venkatasubraman Viswanathan, Bharath Ramsundar

Video
AM

(54) Integrating processed-based models and machine learning for crop yield prediction

Michiel Kallenberg, Bernardo Maestrini, Ron Bree, Paul Ravensbergen, Christos Pylianidis, Frits Evert, Ioannis N. Athanasiadis
virtual

(55) INFINITY: Neural Field Modeling for Reynolds-Averaged Navier-Stokes Equations

Louis Serrano, Léon Migus, Yuan Yin, Jocelyn Ahmed Mazari, Jean-Noël Vittaut, Patrick Gallinari

Video
AM

(56) Neural Modulation Fields for Conditional Cone Beam Neural Tomography

Samuele Papa, David M Knigge, Riccardo Valperga, Nikita Moriakov, Miltiadis Kofinas, Jan-jakob Sonke, Efstratios Gavves
AM

(57) Diffusion model based data generation for partial differential equations

Rucha Apte, Sheel Nidhan, Rishikesh Ranade, Jay Pathak
virtual

(58) Understanding Energy-Based Modeling of Proteins via an Empirically Motivated Minimal Ground Truth Model

Peter William Fields, Vudtiwat Ngampruetikorn, Rama Ranganathan, David J. Schwab, Stephanie Palmer

Video
PM

(59) Adaptive Bias Correction for Improved Subseasonal Forecasting

Soukayna Mouatadid, Paulo Orenstein, Genevieve Elaine Flaspohler, Judah Cohen, Miruna Oprescu, Ernest Fraenkel, Lester Mackey
PM

(61) Titanium 3D Microstructure for Physics-based Generative Models: A Dataset and Primer

Devendra Kumar Jangid, Neal R Brodnik, McLean P Echlin, Samantha Daly, Tresa Pollock, B.S. Manjunath
virtual

(65) CAAFE: Combining Large Language Models with Tabular Predictors for Semi-Automated Data Science

Noah Hollmann, Samuel Müller, Frank Hutter

Video
PM

(66) Hybrid Diffusions for Stable Molecular Structure Generation via Explicit Energy-based Model

Youngwoo Cho, Seunghoon Yi, Soo Kyung Kim, Hongkee Yoon, Joonseok Lee
AM

(67) Accelerating Molecular Graph Neural Networks via Knowledge Distillation

Filip Ekström Kelvinius, Dimitar Georgiev, Artur Toshev, Johannes Gasteiger
AM

(68) How important are specialized transforms in Neural Operators?

Ritam Majumdar, Shirish Karande, Lovekesh Vig

Video
PM

(73) What if We Enrich day-ahead Solar Irradiance Time Series Forecasting with Spatio-Temporal Context?

Oussama Boussif, Ghait Boukachab, Dan Assouline, Stefano Massaroli, Tianle Yuan, Loubna Benabbou, Yoshua Bengio
PM

(75) NuCLR: Nuclear Co-Learned Representations

Niklas Nolte, Ouail Kitouni, Mike Williams, Sokratis Trifinopoulos, Subhash Kantamneni
AM

(76) Coupling Self-Attention Generative Adversarial Network and Bayesian Inversion for Carbon Storage System

Jichao Bao, Jonghyun Lee, Hongkyu Yoon
PM

(77) Learning from Topology: Cosmological Parameter Estimation from the Large-scale Structure

Jacky H. T. Yip, Adam Rouhiainen, Gary Shiu
PM

(78) Reliable coarse-grained turbulent simulations through combined offline learning and neural emulation

Christian Pedersen, Laure Zanna, Joan Bruna, Pavel Perezhogin
PM

(80) Estimation of Physical Coefficients for CO_2 Sequestration using Deep Generative Priors based Inverse Modeling Framework

Jiawei Shen, Jonghyun Lee, Hongkyu Yoon
virtual

(81) How to Select Physics-Informed Neural Networks in the Absence of Ground Truth: A Pareto Front-Based Strategy

Zhao Wei, Jian Cheng Wong, Nicholas Wei Yong Sung, Abhishek Gupta, Chin Chun Ooi, Pao-Hsiung Chiu, My Ha Dao, Yew-Soon Ong

Video
virtual

(83) Multi-Objective PSO-PINN

Caio Davi, Ulisses Braga-Neto

Video
PM

(84) RANS-PINN based Simulation Surrogates for Predicting Turbulent Flows

Shinjan Ghosh, Amit Chakraborty, Georgia Olympia Brikis, Biswadip Dey
PM

(85) A language-based recommendation system for material discovery

Jiaxing Qu, Yuxuan Richard Xie, Elif Ertekin
PM

(88) Knowledge-Guided Additive Modeling For Supervised Regression

Yann Claes, Van Anh Huynh-Thu, Pierre Geurts