Climate Change AI Workshop Papers

On this page, we show accepted works from all our workshops on "Tackling Climate Change with Machine Learning."

Venue Title Subject Areas
NeurIPS 2024 Time Series Viewmakers for Robust Disruption Prediction in Nuclear Fusion (Papers Track)
Abstract and authors: (click to expand)

Abstract: Tokamaks, as a leading technology in the quest for nuclear fusion energy, play a pivotal role in the fight against climate change. For tokamaks to become a viable solution for clean energy however, they must effectively detect and manage disruptions — plasma instabilities that can cause significant damages, impairing the reliability and efficiency required for their widespread adoption as a clean energy source. Machine learning (ML) models have shown promise in predicting disruptions for specific tokamaks, but they often struggle in generalizing to the diverse characteristics and dynamics of different machines. This limits the effectiveness of ML models across different tokamak designs and operating conditions, which is a critical barrier to scaling fusion technology. Given the success of data augmentation in improving model robustness and generalizability in other fields, this study explores the use of a novel time series viewmaker network to generate diverse augmentations or "views" of training data. Our results show that incorporating views during training improves AUC and F2 scores on DisruptionBench tasks compared to standard or no augmentations. This approach represents a promising step towards developing more broadly applicable ML models for disruption avoidance, which is essential for advancing fusion technology and, ultimately, addressing climate change through reliable and sustainable energy production.

Authors: Dhruva Chayapathy (Alpharetta High School); Tavis Siebert (UC Berkeley); Lucas Spangher (Google); Cristina Rea (MIT Plasma Science and Fusion Center)

Power & Energy Time-series Analysis
NeurIPS 2024 Explainable Meta Bayesian Optimization with Human Feedback for Scientific Applications like Fusion Energy (Papers Track)
Abstract and authors: (click to expand)

Abstract: We introduce Meta Bayesian Optimization with Human Feedback (MBO-HF), which integrates Meta-Learning and expert preferences to enhance BO. MBO-HF employs Transformer Neural Processes (TNPs) to create a meta-learned surrogate model and a human-informed acquisition function (AF) to suggest and explain proposed candidate experiments. MBO-HF outperforms current methods in optimizing various scientific experiments and benchmarks in simulation, including the energy yield of the inertial confinement fusion (ICF), practical molecular optimization (PMO), and critical temperature maximization for superconducting materials.

Authors: Ricardo Luna Gutierrez (Hewlett Packard Enterprise); Sahand Ghorbanpour (Hewlett Packard Enterprise); Vineet Gundecha (Hewlett Packard Enterpise); Rahman Ejaz (University of Rochester); Varchas Gopalaswamy (University of Rochester); Riccardo Betti (University of Rochester); Avisek Naug (Hewlett Packard Enterprise); Desik Rengarajan (Hewlett Packard Labs); Ashwin Ramesh Babu (Hewlett Packard Enterprise Labs); Paolo Faraboschi (Hewlett Packard Enterprise); Soumyendu Sarkar (Hewlett Packard Enterprise)

Power & Energy Reinforcement Learning
NeurIPS 2024 Enhancing Reinforcement Learning-Based Control of Wave Energy Converters Using Predictive Wave Modeling (Papers Track)
Abstract and authors: (click to expand)

Abstract: Ocean wave energy is a reliable form of clean, renewable energy that has been under-explored compared to solar and wind. Wave Energy Converters (WEC) are devices that convert wave energy to electricity. To achieve a competitive Levelized Cost of Energy (LCOE), WECs require complex controllers to maximize the absorbed energy. Traditional engineering controllers, like spring-damper, cannot anticipate incoming waves, missing vital information that could lead to higher energy capture. Reinforcement Learning (RL) based controllers can instead optimize for long-term gains by being informed about the future waves. Prior works have utilized incoming wave information, achieving significant gains in energy capture. However, this has only been done via simulated waves (perfect prediction), making them impractical in real-life deployment. In this work, we develop a neural network based model for wave prediction. While prior works use auto-regressive techniques, we predict waves using information available on-device like position, acceleration, etc. We show that replacing the simulated waves with the wave predictor model can still maintain the gain in energy capture achieved by the RL controller in simulations.

Authors: Vineet Gundecha (Hewlett Packard Enterpise); Arie Paap (Carnegie Clean Energy); mathieu Cocho (Carnegie Clean Energy); Sahand Ghorbanpour (Hewlett Packard Enterprise); Alexandre Pichard (Carnegie Clean Energy); Ashwin Ramesh Babu (Hewlett Packard Enterprise Labs); Soumyendu Sarkar (Hewlett Packard Enterprise)

Oceans & Marine Systems Power & Energy Reinforcement Learning Time-series Analysis
NeurIPS 2024 Satellite Sunroof: High-res Digital Surface Models and Roof Segmentation for Global Solar Mapping (Papers Track)
Abstract and authors: (click to expand)

Abstract: The transition to renewable energy, particularly solar, is key to mitigating climate change. Google's Solar API aids this transition by estimating solar potential from aerial imagery, but its impact is constrained by geographical coverage. This paper proposes expanding the API's reach using satellite imagery, enabling global solar potential assessment. We tackle challenges involved in building a Digital Surface Model (DSM) and roof instance segmentation from lower resolution and single oblique views using deep learning models. Our models, trained on aligned satellite and aerial datasets, produce 25cm DSMs and roof segments. With ~1m DSM MAE on buildings, ~5deg roof pitch error and ~56% IOU on roof segmentation, they significantly enhance the Solar API's potential to promote solar adoption.

Authors: Vishal Batchu (Google Research); Alex Wilson (Google); Betty Peng (Google); Carl Elkin (Google); Umangi Jain (University of Toronto); Christopher Arsdale (Google Research); Ross Goroshin (Google); Varun Gulshan (Google Research)

Earth Observation & Monitoring Buildings Cities & Urban Planning Power & Energy Computer Vision & Remote Sensing
NeurIPS 2024 Temporal sequence transformer to advance long-term streamflow prediction (Papers Track) Best ML Innovation
Abstract and authors: (click to expand)

Abstract: Accurate streamflow prediction is crucial for understanding climate change impacts on water resources and for effective management of extreme hydrological events. While Long Short-Term Memory (LSTM) networks have been the dominant data-driven approach for streamflow forecasting, recent advancements in transformer architectures for time series tasks have shown promise in outperforming traditional LSTM models. This study introduces a transformer-based model that integrates historical streamflow data with climatic variables to enhance streamflow prediction accuracy. We evaluated our transformer model against a benchmark LSTM across five diverse basins in the United States. Results demonstrate that the transformer architecture consistently outperforms the LSTM model across all evaluation metrics, highlighting its potential as a more effective tool for hydrological forecasting. This research contributes to the ongoing development of advanced AI techniques for improved water resource management and climate change adaptation strategies.

Authors: Ruhaan Singh (Farragut High School); Dan Lu (Oak Ridge National Laboratory); Kshitij Tayal (Oak Ridge National Laboratory)

Time-series Analysis Climate Science & Modeling
NeurIPS 2024 Mapping Air Pollution Sources with Sequential Transformer Chaining: A Case Study in South Asia (Papers Track)
Abstract and authors: (click to expand)

Abstract: This study presents a comprehensive framework for detecting pollution sources, specifically factory and brick kiln chimneys, in major South Asian cities using a combination of remote sensing data and advanced deep learning techniques. We first identify hotspots of Acute Respiratory Infections (ARI) by correlating health data with air pollutant concentrations, including sulfur dioxide (SO_2), nitrogen dioxide (NO_2), and carbon monoxide (CO). For these identified hotspots, both low-resolution and high-resolution satellite imagery are acquired. Our approach employs a sequential process, beginning with a Vision Transformer model that utilizes high resolution satellite imagery and a broad range of text inputs with a lower confidence threshold to initially filter the data. This is followed by the application of the Remote CLIP model, which is run twice in succession using satellite imagery paired with different text inputs to refine the detection further. This sequential tranformer chaining filter out 99% of irrelevant data from high-resolution imagery. The final step involves manual annotation on the remaining 1% of the data, ensuring high accuracy and minimizing errors. Additionally, a novel multispectral chimney index is developed for detecting chimneys in low-resolution imagery. The study introduces a unique, annotated chimney detection dataset capturing diverse chimney types, which improves detection accuracy. The results provide actionable insights for public health interventions and support regulatory measures aimed at achieving the United Nations' Sustainable Development Goal 3 on health and well-being. We plan to make the dataset and code publicly available following the acceptance of this paper.

Authors: Hafiz Muhammad Abubakar (Beaconhouse National University); Raahim Arbaz (Beaconhouse National University); Hasnain Ahmad (Beaconhouse National University); Mubasher Nazir (Solve Agri Pak Private Limited); Usman Nazir (Beaconhouse National University)

Climate Justice Health
NeurIPS 2024 Icy Waters: Developing a Test-Suite to Benchmark Sea Ice Concentration Forecasting (Papers Track)
Abstract and authors: (click to expand)

Abstract: Artificial intelligence (AI) for Climate Change efforts have made significant progress in forecasting atmospheric weather patterns and events. Despite this, translating these gains in the context of phenomenon on earth surface, e.g. sea-ice concentration, has been limited because of differences in how these physical processes evolve. Sea ice concentration is one of the key indicators of climate change and is also critical for a number of different applications and indigenous peoples. Consequently, there is an acute need to develop a baseline of a diverse set of modern machine learning techniques within the Arctic. Our work aims to fill this gap, with the goal of both informing current research, as well as pointing out limitations with certain architectures. We achieve this by providing baselines for a number of different convolutional LSTMs, transformer based, and neural operator based machine learning methods.

Authors: Kiernan McGuigan (University of Waterloo); Sirisha Rambhatla (University of Waterloo); K Andrea Scott (University of Waterloo)

Oceans & Marine Systems Time-series Analysis
NeurIPS 2024 A Deep Learning Approach to the Automated Segmentation of Bird Vocalizations from Weakly Labeled Crowd-sourced Audio (Papers Track)
Abstract and authors: (click to expand)

Abstract: Ecologists interested in monitoring the effects caused by climate change are increasingly turning to passive acoustic monitoring, the practice of placing autonomous audio recording units in ecosystems to monitor species richness and occupancy via species calls. However, identifying species calls in large datasets by hand is an expensive task, leading to a reliance on machine learning models. Due to a lack of annotated datasets of soundscape recordings, these models are often trained on large databases of community created focal recordings. A challenge of training on such data is that clips are given a "weak label," a single label that represents the whole clip. This includes segments that only have background noise but are labeled as calls in the training data, reducing model performance. Heuristic methods exist to convert clip-level labels to "strong" call-specific labels, where the label tightly bounds the temporal length of the call and better identifies bird vocalizations. Our work improves on the current weakly to strongly labeled method used on the training data for BirdNET, the current most popular model for audio species classification. We utilize an existing RNN-CNN hybrid, resulting in a precision improvement of 12% (going to 90% precision) against our new strongly hand-labeled dataset of Peruvian bird species.

Authors: Jacob Ayers (Engineers for Exploration at UCSD); Sean Perry (University of California San Diego); Samantha Prestrelski (UC San Diego); Tianqi Zhang (Engineers for Exploration); Ludwig von Schoenfeldt (University of California San Diego); Mugen Blue (UC Merced); Gabriel Steinberg (Demining Research Community); Mathias Tobler (San Diego Zoo Wildlife Alliance); Ian Ingram (San Diego Zoo Wildlife Alliance); Curt Schurgers (UC San Diego); Ryan Kastner (University of California San Diego)

Earth Observation & Monitoring Ecosystems & Biodiversity
NeurIPS 2024 Enabling Adoption of Regenerative Agriculture through Soil Carbon Copilots (Papers Track)
Abstract and authors: (click to expand)

Abstract: Mitigating climate change requires transforming agriculture to minimize environmental impact and build climate resilience. Regenerative agricultural practices enhance soil organic carbon (SOC) levels, thus improving soil health and sequestering carbon. A challenge to increasing regenerative agriculture practices is cheaply measuring SOC over time and then understanding how SOC is affected by regenerative agricultural practices and other environmental factors and farm management practices. To address this challenge, we introduce an AI-driven Soil Organic Carbon Copilot that automates the ingestion of complex multi-resolution, multi-modal data to provide large-scale insights into soil health and regenerative practices. Our data includes extreme weather event data (e.g., drought conditions and wildfire incidents), farm management data (e.g., cropland information and tillage predictions), and SOC predictions. We find that integrating public data and specialized models enables large-scale, localized analysis for sustainable agriculture. In comparisons of agricultural activities and practices across California counties, we find evidence that diverse agricultural activity may mitigate the negative effects of tillage; and that while extreme weather conditions heavily affect SOC, composting may mitigate SOC loss. Finally, implementing role-specific personas empowers agronomists, farm consultants, policymakers, and other stakeholders to implement evidence-based strategies that promote sustainable agriculture and build climate resilience.

Authors: Margaret Capetz (UCLA); Swati Sharma (Microsoft Research); Peder Olsen (Microsoft); RAFAEL PADILHA (Microsoft Research); Jessica Wolk (Microsoft); Emre Kiciman (Microsoft Research); Ranveer Chandra (Microsoft Research)

Carbon Capture & Sequestration Natural Language Processing
NeurIPS 2024 Carbon-Aware Spatio-Temporal Workload Distribution in Cloud Data Center Clusters Using Reinforcement Learning (Papers Track)
Abstract and authors: (click to expand)

Abstract: Reducing the environmental impact of cloud computing requires efficient workload distribution across geographically dispersed Data Center Clusters (DCCs). In this paper, we introduce Green-DCC, which proposes Reinforcement Learning-based hierarchical controller techniques to dynamically optimize temporal and geographical workload distribution between data centers that belong to the same DCC. The environment models non-uniform external weather, carbon intensity, computing resources, cooling capabilities, and dynamic bandwidth costs, which provide constraints and interdependencies. We adapted and evaluated various reinforcement learning approaches, comparing their aggregate carbon emissions across the DCC, demonstrating Green-DCC's effectiveness for controlling and testing advanced data center control algorithms for sustainability.

Authors: Soumyendu Sarkar (Hewlett Packard Enterprise); Antonio Guillen-Perez (Hewlett Packard Enterprise); Vineet Gundecha (Hewlett Packard Enterpise); Avisek Naug (Hewlett Packard Enterprise); Ricardo Luna Gutierrez (Hewlett Packard Enterprise); Sajad Mousavi (Hewlett Packard Enterprise); Paolo Faraboschi (Hewlett Packard Enterprise); Cullen Bash (Hewlett Packard Enterprise)

Power & Energy Reinforcement Learning
NeurIPS 2024 InvestESG: A Multi-agent Reinforcement Learning Benchmark for Studying Climate Investment as a Social Dilemma (Papers Track)
Abstract and authors: (click to expand)

Abstract: InvestESG is a novel multi-agent reinforcement learning (MARL) benchmark designed to study the impact of Environmental, Social, and Governance (ESG) disclosure mandates on corporate climate investments. Supported by both PyTorch and JAX implementation, the benchmark models an intertemporal social dilemma where companies balance short-term profit losses from climate mitigation efforts and long-term benefits from reducing climate risk, while ESG-conscious investors attempt to influence corporate behavior through their investment decisions, in a scalable and hardware-accelerated manner. Companies allocate capital across mitigation, greenwashing, and resilience, with varying strategies influencing climate outcomes and investor preferences. Our experiments show that without ESG-conscious investors with sufficient capital, corporate mitigation efforts remain limited under the disclosure mandate. However, when a critical mass of investors prioritizes ESG, corporate cooperation increases, which in turn reduces climate risks and enhances long-term financial stability. Additionally, providing more information about global climate risks encourages companies to invest more in mitigation, even without investor involvement. Our findings align with empirical research using real-world data, highlighting MARL's potential to inform policy by providing insights into large-scale socio-economic challenges through efficient testing of alternative policy and market designs.

Authors: Xiaoxuan Hou (University of Washington); Jiayi Yuan (University of Washington); Natasha Jaques (University of Washington)

Public Policy Climate Finance & Economics Reinforcement Learning
NeurIPS 2024 Exploring Vision Transformers for Early Detection of Climate Change Signals (Papers Track)
Abstract and authors: (click to expand)

Abstract: This study evaluates Vision Transformers (ViTs) for detecting anthropogenic climate change signals, crucial for effective policy planning and risk assessment. Compared to previously suggested models like CNN, MLP, and ridge regression, ViTs consistently detect forced climate signals earlier across three reanalysis datasets (ERA5, JRA-3Q, and MERRA-2). Interpretation with Integrated Gradients reveals consistent spatial patterns, suggesting ViTs utilize physically-grounded signals. This work highlights ViTs' potential to advance climate change detection and attribution tasks.

Authors: Sungduk Yu (Intel Labs); Brian White (UNC Chapel Hill); Anahita Bhiwandiwalla (Intel Labs); Yaniv Gurwicz (Intel Labs); Musashi Hinck (Intel Corporation); Matthew Olson (Intel Labs); Raanan Rohekar (Intel Labs); Vasudev Lal (Intel Corp)

Climate Science & Modeling Earth Observation & Monitoring
NeurIPS 2024 No Location Left Behind: Introducing the Fairness Assessment for Implicit Representations of Earth Data (Papers Track)
Abstract and authors: (click to expand)

Abstract: Encoding and predicting physical measurements such as temperature or carbon dioxide is instrumental to many high-stakes challenges – including climate change. Yet, all recent advances solely assess models’ performances at a global scale. But while models’ predictions are improving on average over the entire globe, performances on sub-groups such as islands or coastal areas are left uncharted. To ensure safe deployment of those models, we thus introduce FAIR-Earth, a fine-grained evaluation suite made of diverse and high-resolution dataset. Our findings are striking–current methods produce highly biased predictions towards specific geospatial locations. The specifics of the biases vary based on the data modality and hyper-parameters of the models. Hence, we hope that FAIR-Earth will enable future research to design solutions aware of those per-group biases.

Authors: Daniel Cai (Brown University); Randall Balestriero (Brown University)

Generative Modeling Climate Justice Climate Science & Modeling Earth Observation & Monitoring Extreme Weather Interpretable ML
NeurIPS 2024 Feasibility of Forecasting Highly Resolved Power Grid Frequency Utilizing Temporal Fusion Transformers (Papers Track)
Abstract and authors: (click to expand)

Abstract: As our society moves toward a decarbonized energy system, we need to improve our ability to model, predict, and understand power system behavior and dynamics. The balance between generation and demand on short time scales is reflected by the power grid frequency, making it central to the control of power grids. Hence, an accurate understanding and forecasting of power grid frequency could ease the planning of control actions and thus improve system stability and help save costs. Whether deep learning approaches can provide forecasts of the highly resolved and noisy time series, as they are present in the case of power grid frequency, remains an open question. In this paper, we find that the Temporal Fusion Transformer (TFT) is able to outperform baseline models, while a comparably simple multilayer perceptron is not. By reducing the time resolution of the frequency time series, we investigate and quantify the trade-off between the energy consumption and prediction performance % or forecasting accuracy? of the TFT. Furthermore, the inclusion of additional exogenous variables (e.g. calendar features, load, or generation) further improves the performance of the TFT. Utilizing the TFT's inherent interpretability, we identify the forecasted load ramp, the current hour, and the current month as the most relevant features.

Authors: Hadeer El Ashhab (KIT); Benjamin Schäfer (Karlsruhe Institute of Technology); Sebastian Pütz (Karlsruhe Institute of Technology)

Power & Energy Time-series Analysis
NeurIPS 2024 A Water Efficiency Dataset for African Data Centers (Papers Track)
Abstract and authors: (click to expand)

Abstract: AI computing and data centers consume a large amount of freshwater, both directly for cooling and indirectly for electricity generation. While most attention has been paid to developed countries such as the U.S., this paper presents the first-of-its-kind dataset that combines nation-level weather and electricity generation data to estimate water usage efficiency for data centers in 41 African countries across five different climate regions. We also use our dataset to evaluate and estimate the water consumption of inference on two large language models (i.e., Llama-3-70B and GPT-4) in 11 selected African countries. Our findings show that writing a 10-page report using Llama-3-70B could consume about 0.7 liters of water, while the water consumption by GPT-4 for the same task may go up to about 60 liters. For writing a medium-length email of 120-200 words, Llama-3-70B and GPT-4 could consume about 0.13 liters and 3 liters of water, respectively. Interestingly, given the same AI model, 8 out of the 11 selected African countries consume less water than the global average, mainly because of lower water intensities for electricity generation. However, water consumption can be substantially higher in some African countries with a steppe climate than the U.S. and global averages, prompting more attention when deploying AI computing in these countries.

Authors: NOAH SHUMBA (Carnegie Mellon University Africa); Opelo Tshekiso (Carnegie Mellon University Africa); Pengfei Li (UCR); Giulia Fanti (CMU); Shaolei Ren (UC Riverside)

Societal Adaptation & Resilience
NeurIPS 2024 Using Convolutional Neural Processes to Produce High-Resolution Weather Datasets Over New Zealand (Papers Track)
Abstract and authors: (click to expand)

Abstract: In recent years, there has been a surge in the development and success of artificial intelligence methods for global weather forecasting and climate modelling. For regional models, however, there is a lack of high-resolution datasets that can be used to train data-driven forecasting models. In this work, we showcase the use of convolutional neural processes (ConvNP) to generate hourly high-resolution (1km) weather datasets over New Zealand for temperature and precipitation. ConvNP models allow us to produce datasets that are enhanced by station observations and provide uncertainty estimates in their predictions. The generated datasets have applications in AI weather and climate models, model verification, and broader environmental research.

Authors: Emily O'Riordan (Bodeker Scientific)

Meta- and Transfer Learning Climate Science & Modeling Extreme Weather Uncertainty Quantification & Robustness
NeurIPS 2024 AI-Driven Predictive Modeling of PFAS Contamination in Aquatic Ecosystems: Exploring A Geospatial Approach (Papers Track)
Abstract and authors: (click to expand)

Abstract: Per- and polyfluoroalkyl substances (PFAS), a class of synthetic fluorinated compounds termed “forever chemicals”, have garnered significant attention due to their persistence, widespread environmental presence, bioaccumulative properties, and associated risks for human health. Their presence in aquatic ecosystems highlights the link between human activity and the hydrological cycle. They also disrupt aquatic life, interfere with gas exchange, and disturb the carbon cycle, contributing to greenhouse gas emissions and exacerbating climate change. Federal agencies, state governments and non-government research and public interest organizations have emphasized the need for documenting the sites and the extent of PFAS contamination. However, the time-consuming and expensive nature of data collection and analysis poses challenges. It hinders the rapid identification of locations at high risk of PFAS contamination, which may then require further sampling or remediation. To address this data limitation, our study leverages a novel geospatial dataset, machine learning models including frameworks such as Random Forest, IBM-NASA's Prithvi and UNet, and geospatial analysis to predict regions with high PFAS concentrations in surface water. Using fish data from the National Rivers and Streams Assessment (NRSA) dataset by the Environmental Protection Agency (EPA), our analysis suggests the potential value of machine learning based models for targeted deployment of sampling investigations and remediation efforts.

Authors: Jowaria Khan (University of Michigan); David Andrews (Environmental Working Group); Kaley Beins (Environmental Working Group); Sydney Evans (Environmental Working Group); Alexa Friedman (Environmental Working Group); Elizabeth Bondi-Kelly (MIT)

Computer Vision & Remote Sensing Earth Observation & Monitoring Climate Science & Modeling Ecosystems & Biodiversity Health Chemistry & Materials Data Mining Uncertainty Quantification & Robustness
NeurIPS 2024 End-to-End Conformal Calibration for Robust Grid-Scale Battery Storage Optimization (Papers Track)
Abstract and authors: (click to expand)

Abstract: The rapid proliferation of intermittent renewable electricity generation demands a corresponding growth in grid-scale energy storage systems to enable grid decarbonization. To encourage investment in energy storage infrastructure, storage operators rely on forecasts of electricity prices along with uncertainty estimates to maximize profit while managing risk. However, well-calibrated uncertainty estimates can be difficult to obtain in high-capacity prediction models such as deep neural networks. Moreover, in high-dimensional settings, there may be many valid uncertainty estimates with varied performance profiles—i.e., not all uncertainty is equally valuable for downstream decision-making. To address this challenge, this paper develops an end-to-end framework for conditional robust optimization, with robustness and calibration guarantees provided by conformal prediction. We represent arbitrary convex uncertainty sets with sublevel sets of partially input-convex neural networks, which are learned as part of our framework. We demonstrate the value of our approach for robust decision-making on a battery storage arbitrage application.

Authors: Christopher Yeh (California Institute of Technology); Nicolas Christianson (California Institute of Technology); Adam Wierman (California Institute of Technology); Yisong Yue (Caltech)

Uncertainty Quantification & Robustness Power & Energy
NeurIPS 2024 AtmosArena: Benchmarking Foundation Models for Atmospheric Sciences (Papers Track)
Abstract and authors: (click to expand)

Abstract: Deep learning has emerged as a powerful tool for atmospheric sciences, showing significant utility across various tasks in weather and climate modeling. In line with recent progress in language and vision foundation models, there are growing efforts to scale and finetune such models for multi-task spatiotemporal reasoning. Despite promising results, existing works often evaluate their model on a small set of non-uniform tasks, which makes it hard to quantify broad generalization across diverse tasks and domains. To address this challenge, we introduce AtmosArena, the first multi-task benchmark dedicated to foundation models in atmospheric sciences. AtmosArena comprises a suite of tasks that cover a broad spectrum of applications in atmospheric physics and atmospheric chemistry. To showcase the capabilities and key features of our benchmark, we conducted extensive experiments to evaluate two state-of-the-art deep learning models, ClimaX and Stormer on AtmosArena, and compare their performance with other deep learning and traditional baselines. By providing a standardized, open-source benchmark, we aim to facilitate further advancements in the field, much like open-source benchmarks have driven the development of foundation models for language and vision.

Authors: Tung Nguyen (University of California, Los Angeles); Prateik Sinha (UCLA); Advit Deepak (University of California, Los Angeles); Karen A. McKinnon (University of California, Los Angeles); Aditya Grover (UCLA)

Climate Science & Modeling Meta- and Transfer Learning
NeurIPS 2024 Harnessing AI for Wildfire Defense: An approach to Predict and Mitigate Global Fire Risk (Papers Track)
Abstract and authors: (click to expand)

Abstract: Wildfires pose a critical threat to wildlife, economies, properties, and human lives globally, making accurate risk assessment essential for effective management and mitigation. This study introduces a novel machine learning-based approach utilizing a Convolutional Neural Network (CNN) to evaluate wildfire risks across diverse ecosystems. Leveraging a comprehensive dataset of remote-sensed variables—including topography, vegetation health indicators, and climatic conditions—our model operates at a spatial resolution of 1000 meters per pixel, providing enhanced precision in predicting wildfire occurrences. The CNN outperforms state-of-the-art models, achieving a fire detection ratio of 0.82 and a no-fire detection ratio of 0.87. The results demonstrate that most dataset variables are crucial for accurate risk assessment, although some are non-essential. By integrating data from regions around the globe, this study underscores the feasibility and effectiveness of implementing globally scalable wildfire prediction tools.

Authors: Hassan Ashfaq (Ghulam Ishaq Khan Institute of Engineering Sciences and Technology)

Computer Vision & Remote Sensing Climate Science & Modeling Earth Observation & Monitoring Ecosystems & Biodiversity Extreme Weather Forests Active Learning Generative Modeling Interpretable ML Reinforcement Learning
NeurIPS 2024 Tree Species Classification using Machine Learning and 3D Tomographic SAR - a case study in Northern Europe (Papers Track)
Abstract and authors: (click to expand)

Abstract: Tree species classification plays an important role in nature conservation, forest inventories, forest management, and the protection of endangered species. Over the past four decades, remote sensing technologies have been extensively utilized for tree species classification, with Synthetic Aperture Radar (SAR) emerging as a key technique. In this study, we employed TomoSense, a 3D tomographic dataset, which utilizes a stack of single-look complex (SLC) images, a byproduct of SAR, captured at different incidence angles to generate a three-dimensional representation of the terrain. Our research focuses on evaluating multiple tabular machine-learning models using the height information derived from the tomographic image intensities to classify eight distinct tree species. The SLC data and tomographic imagery were analyzed across different polarimetric configurations and geosplit configurations. We investigated the impact of these variations on classification accuracy, comparing the performance of various tabular machine-learning models and optimizing them using Bayesian optimization. Additionally, we incorporated a proxy for actual tree height using point cloud data from Light Detection and Ranging (LiDAR) to provide height statistics associated with the model’s predictions. This comparison offers insights into the reliability of tomographic data in predicting tree species classification based on height.

Authors: Jumpei Takami (United Nations Office for Outer Space Affairs); Grace Colverd (University of Cambridge); Laura Schade (Department of Energy Security and Net Zero - UKGOV); Karol Bot (INESCTEC); Joseph Gallego (Drexel University)

Earth Observation & Monitoring Climate Science & Modeling
NeurIPS 2024 Towards Using Machine Learning to Generatively Simulate EV Charging in Urban Areas (Papers Track)
Abstract and authors: (click to expand)

Abstract: This study addresses the challenge of predicting electric vehicle (EV) charging profiles in urban locations with limited data. Utilizing a neural network architecture, we aim to uncover latent charging profiles influenced by spatio-temporal factors. Our model focuses on peak power demand and daily load shapes, providing insights into charging behavior. Our results indicate significant impacts from the type of Basic Administrative Units on predicted load curves, which contributes to the understanding and optimization of EV charging infrastructure in urban settings and allows Distribution System Operators (DSO) to more efficiently plan EV charging infrastructure expansion.

Authors: Marek Miltner (Stanford University; Czech Technical University); Jakub Zíka (CTU); Daniel Vašata (Czech Technical University in Prague, Faculty of Information Technology); Artem Bryksa (CTU); Magda Friedjungová (Czech Technical University in Prague, Faculty of Information Technology); Ondřej Štogl (CTU); Ram Rajagopal (Stanford University); Oldřich Starý (CTU)

Power & Energy Behavioral and Social Science Cities & Urban Planning Transportation Active Learning Data Mining Interpretable ML
NeurIPS 2024 High-Resolution Domestic Energy Modelling for National Energy and Retrofit Planning (Papers Track)
Abstract and authors: (click to expand)

Abstract: The UK's building stock, responsible for 13% of national greenhouse gas emissions in 2023, plays a crucial role in meeting the country's ambitious 2030 emissions reduction target. With the UK currently off-track and the building sector's emissions reductions slowing since 2014, there is an urgent need for improved energy modelling and policy development. We introduce a novel dataset for small-neighbourhood energy modelling in England and Wales to address this challenge. Covering 614k postcodes, ranging from 5-150 households, our dataset integrates domestic energy consumption data with variables spanning building characteristics, local environment, and socio-demographics. This dataset offers a new level of granularity in national energy analysis. It can provide insights for retrofit planning, material stock analysis and energy policy, transforming approaches to small-scale energy analysis and supporting the UK's climate goals.

Authors: Grace Colverd (University of Cambridge); Ronita Bardhan (University of Cambridge); Jonathan Cullen (University of Cambridge)

Buildings Cities & Urban Planning
NeurIPS 2024 Continuous latent representations for modeling precipitation with deep learning (Papers Track)
Abstract and authors: (click to expand)

Abstract: The sparse and spatio-temporally discontinuous nature of precipitation data presents significant challenges for simulation and statistical processing for bias correction and downscaling. These include incorrect representation of intermittency and extreme values (critical for hydrology applications), Gibbs phenomenon upon regridding, and lack of fine scales details. To address these challenges, a common approach is to transform the precipitation variable nonlinearly into one that is more malleable. In this work, we explore how deep learning can be used to generate a smooth, spatio-temporally continuous variable as a proxy for simulation of precipitation data. We develop a normally distributed field called pseudo-precipitation (PP) as an alternative for simulating precipitation. The practical applicability of this variable is investigated by applying it for downscaling precipitation from 1\degree (\(\sim\) 100 km) to 0.25\degree (\(\sim\) 25 km).

Authors: Gokul Radhakrishnan (Verisk Analytics); Rahul Sundar (Verisk, India); Nishant Parashar (Verisk Analytics); Antoine Blanchard (Verisk); Daiwei Wang (Verisk Analytics); Boyko Dodov (Verisk Analytics)

Extreme Weather Generative Modeling
NeurIPS 2024 Improving Power Plant CO2 Emission Estimation with Deep Learning and Satellite/Simulated Data (Papers Track)
Abstract and authors: (click to expand)

Abstract: CO2 emissions from power plants, as significant super emitters, contribute substantially to global warming. Accurate quantification of these emissions is crucial for effective climate mitigation strategies. While satellite-based plume inversion offers a promising approach, challenges arise from data limitations and the complexity of atmospheric conditions. This study addresses these challenges by (a) expanding the available dataset through the integration of NO2 data from Sentinel-5P, generating continuous XCO2 maps, and incorporating real satellite observations from OCO-2/3 for over 71 power plants in data-scarce regions; and (b) employing a customized U-Net model capable of handling diverse spatio-temporal resolutions for emission rate estimation. Our results demonstrate significant improvements in emission rate accuracy compared to previous methods [11]. By leveraging this enhanced approach, we can enable near real-time, precise quantification of major CO2 emission sources, supporting environmental protection initiatives and informing regulatory frameworks.

Authors: Dibyabha Deb (Manipal Institute of Technology); Kamal Das (IBM Research)

Computer Vision & Remote Sensing Carbon Capture & Sequestration Earth Observation & Monitoring Unsupervised & Semi-Supervised Learning
NeurIPS 2024 Advancing Heat Demand Forecasting with Attention Mechanisms: Opportunities and Challenges (Papers Track)
Abstract and authors: (click to expand)

Abstract: Global leaders and policymakers are unified in their unequivocal commitment to decarbonization efforts in support of Net-Zero agreements. District Heating Systems (DHS), while contributing to carbon emissions due to the continued reliance on fossil fuels for heat production, are embracing more sustainable practices albeit with some sense of vulnerability as it could constrain their ability to adapt to dynamic demand and production scenarios. As demographic demands grow and renewables become the central strategy in decarbonizing the heating sector, the need for accurate demand forecasting has intensified. Advances in digitization have paved the way for Machine Learning (ML) based solutions to become the industry standard for modeling complex time series patterns. In this paper, we focus on building a Deep Learning (DL) model that uses deconstructed components of independent and dependent variables that affect heat demand as features to perform multi-step ahead forecasting of head demand. The model represents the input features in a time-frequency space and uses an attention mechanism to generate accurate forecasts. The proposed method is evaluated on a real-world dataset and the forecasting performance is assessed against LSTM and CNN-based forecasting models. Across different supply zones, the attention-based models outperforms the baselines quantitatively and qualitatively, with an Mean Absolute Error (MAE) of 0.105 0.06kWh and a Mean Absolute Percentage Error (MAPE) of 5.4% 2.8%, in comparison the second best model with a MAE of 21 0.10 0.06kWh and a MAPE of 5.6% 3%.

Authors: Adithya Ramachandran (Pattern Recognition Lab, Friedrich Alexander University, Erlangen); Andreas Maier (Pattern Recognition Lab, FAU Erlangen-Nuremberg); Siming Bayer (Pattern Recognition Lab, Friedrich-Alexander University)

Cities & Urban Planning Societal Adaptation & Resilience
NeurIPS 2024 AQ-PINNs: Attention-Enhanced Quantum Physics-Informed Neural Networks for Carbon-Efficient Climate Modeling (Papers Track)
Abstract and authors: (click to expand)

Abstract: The growing computational demands of artificial intelligence (AI) in addressing climate change raise significant concerns about inefficiencies and environmental impact, as highlighted by Jevons' paradox. We propose an attention-enhanced quantum physics-informed neural network (AQ-PINN) to tackle these challenges. This approach integrates quantum computing techniques into physics-informed neural networks (PINNs) for climate modeling, aiming to enhance predictive accuracy in fluid dynamics governed by the Navier-Stokes equations while reducing the computational burden and carbon footprint. By harnessing variational quantum multi-head self-attention mechanisms, our AQ-PINN achieves a 51.51% reduction in model parameters compared to classical multi-head self-attention methods while maintaining comparable convergence and loss. It also employs quantum tensor networks to enhance representational capacity, which can lead to more efficient gradient computations and reduced susceptibility to barren plateaus. Our AQ-PINN represents a crucial step towards more sustainable and effective climate modeling solutions.

Authors: Siddhant Dutta (SVKM's Dwarkadas J. Sanghvi College of Engineering); Nouhaila Innan (New York University Abu Dhabi); Sadok Ben Yahia (Southern Denmark University); Muhammad Shafique (New York University Abu Dhabi)

Climate Science & Modeling Carbon Capture & Sequestration Hybrid Physical Models
NeurIPS 2024 Enhancing Sustainability in Liquid-Cooled Data Centers with Reinforcement Learning Control (Papers Track)
Abstract and authors: (click to expand)

Abstract: The growing energy demands of machine learning workloads require sustainable data centers with lower carbon footprints and reduced energy consumption. Supercomputing and many high-performance computing (HPC) data centers, which use liquid cooling for greater efficiency than traditional air cooling systems, can significantly benefit from advanced optimization techniques to control liquid cooling. We present RL-LC, a novel Reinforcement Learning (RL) based approach designed to enhance the efficiency of liquid cooling in these environments. RL-LC integrates a customizable analytical liquid cooling model suitable for simulations or digital twins of data centers, focusing on minimizing energy consumption and carbon emissions. Our method achieves an average reduction of approximately 4% compared to industry-standard ASHRAE guidelines, contributing to more sustainable data center management and offering valuable insights for reducing the environmental impact of HPC operations.

Authors: Avisek Naug (Hewlett Packard Enterprise); Antonio Guillen-Perez (Hewlett Packard Enterprise); Vineet Gundecha (Hewlett Packard Enterpise); Ricardo Luna Gutierrez (Hewlett Packard Enterprise); Ashwin Ramesh Babu (Hewlett Packard Enterprise Labs); Sajad Mousavi (Hewlett Packard Enterprise); Paolo Faraboschi (Hewlett Packard Enterprise); Cullen Bash (HPE); Soumyendu Sarkar (Hewlett Packard Enterprise)

Power & Energy Reinforcement Learning
NeurIPS 2024 Classification of Snow Depth Measurements for tracking plant phenological shifts in Alpine regions (Papers Track)
Abstract and authors: (click to expand)

Abstract: Ground-based snow depth measurements are often realized using ultrasonic or laser technologies, which by their nature measure the height of any underlying object, whether it is snow or vegetation in snow-free periods. We propose a machine learning approach to the automated classification of snow depth measurements into a snow cover class and a class corresponding to everything else, which takes into account both the temporal context and the dependencies between snow depth and other sensor measurements. Through a series of experiments we demonstrate that our approach simplifies the detection of seasonal snowmelt and corresponding onset of plant growth, which we used to assess climate-change related phenological shifts in otherwise rather poorly monitored high alpine regions.

Authors: Jan Svoboda (WSL Institute for Snow and Avalanche Research SLF); Michael Zehnder (WSL Institute for Snow and Avalanche Research SLF); Marc Ruesch (WSL Institute for Snow and Avalanche Research SLF); David Liechti (WSL Institute for Snow and Avalanche Research SLF); Corinne Jones (Swiss Data Science Center); Michele Volpi (Swiss Data Science Center, ETH Zurich); Christian Rixen (WSL Institute for Snow and Avalanche Research SLF); Jürg Schweizer (WSL Institute for Snow and Avalanche Research SLF)

Climate Science & Modeling Earth Observation & Monitoring Ecosystems & Biodiversity Data Mining Time-series Analysis
NeurIPS 2024 Climate PAL: Climate Analysis through Conversational AI (Papers Track)
Abstract and authors: (click to expand)

Abstract: To support climate change research and its communication to the public, we propose Climate Projection and Analysis with Language models (Climate PAL). Our system allows users to retrieve and analyze climate projection data through conversational English. Using a crowdsourced evaluation dataset, we demonstrate that Climate PAL's retrieved data are more relevant to user queries, with over 20% higher accuracy than baselines on several key metrics.

Authors: Sonia Cromp (University of Wisconsin-Madison); Behrad Rabiei (University of California San Diego); Maxwell Elling (University of Colorado Boulder); Alexander Herron (National Aeronautics and Space Administration); Michael Hendrickson (National Aeronautics and Space Administration)

Climate Science & Modeling Generative Modeling
NeurIPS 2024 Adaptive Policy Regularization for Offline-to-Online Reinforcement Learning in HVAC Control (Papers Track)
Abstract and authors: (click to expand)

Abstract: Reinforcement learning (RL)-based control methods have been extensively studied to improve building heating, ventilation, and air conditioning (HVAC) efficiency. Data-driven approaches demonstrate better transferability and scalability, making them useful in real-world applications. Most prior works focus on online learning requiring simulators or models of environment dynamics. However, transferring thermal simulators between environments is inefficient. We build on recent works that employ offline training on static datasets from unknown policies. Pure offline RL is constrained by the replay buffer's distribution, we propose using offline-to-online RL to enhance pre-trained offline models through online adaptation to distribution shifts. We show that direct online fine-tuning deteriorates performance on offline policies. To address this, we propose automatically tuning the actor's regularization during training to optimize the exploration-exploitation tradeoff. Specifically, we leverage simple moving averages of mean Q-values sampled throughout training. Simulation experiments demonstrate our method outperforms state-of-the-art approaches under various conditions, improving performance by 32.9% and enhancing pre-trained models' capabilities online.

Authors: Hsin-Yu Liu (University of California San Diego); Bharathan Balaji (Amazon); Rajesh Gupta (UC San Diego); Dezhi Hong (Amazon)

Ecosystems & Biodiversity Climate Science & Modeling
NeurIPS 2024 Towards turbine-location-aware multi-decadal wind power predictions with CMIP6 (Papers Track)
Abstract and authors: (click to expand)

Abstract: With the increasing amount of renewable energy in the grid, long-term wind power forecasting for multiple decades becomes more critical. In these long-term forecasts, climate data is essential as it allows us to account for climate change. Yet the resolution of climate models is often very coarse. In this paper, we show that by including turbine locations when downscaling with Gaussian Processes, we can generate valuable aggregate wind power predictions despite the low resolution of the CMIP6 climate models. This work is a first step towards multi-decadal turbine-location-aware wind power forecasting using global climate model output.

Authors: Nina Effenberger (University of Tübingen); Nicole Ludwig (University of Tübingen)

Power & Energy Climate Science & Modeling Hybrid Physical Models
NeurIPS 2024 Equity-Aware Spatial-Temporal Workload Shifting for Sustainable AI Data Centers (Papers Track)
Abstract and authors: (click to expand)

Abstract: The escalated demand for hyperscale data centers due to generative AI, has intensified the operational load, leading to increased energy consumption, water usage, and carbon emissions. We propose EquiShift, a novel equitable spatial-temporal workload balancing algorithm that shifts workloads spatially and temporarily across geographically different data centers to minimize the overall energy costs while ensuring fair distribution of water and carbon footprints. Concretely, EquiShift introduces a model predictive control (MPC) framework to solve the equitable load balancing problem, leveraging the predictive capabilities of MPC to optimize load distribution in real-time. Finally, we present comparative evaluations against state-of-the-art load-balancing algorithms to demonstrate the performance of EquiShift which underscores the potential of equitable load balancing as a key strategy for enhancing the sustainability of data centers while achieving fairness in the face of growing computational demands.

Authors: Mohammad Islam (University of California Riverside); Shaolei Ren (UC Riverside)

Power & Energy
NeurIPS 2024 Climate Impact Assessment Requires Weighting: Introducing the Weighted Climate Dataset (Papers Track)
Abstract and authors: (click to expand)

Abstract: High-resolution gridded climate data are readily available from multiple sources, yet climate research and decision-making increasingly require country and region-specific climate information weighted by socio-economic factors. Moreover, the current landscape of disparate data sources and inconsistent weighting methodologies exacerbates the reproducibility crisis and undermines scientific integrity. To address these issues, we have developed a globally comprehensive dataset at both country (GADM0) and region (GADM1) levels, encompassing various climate indicators (precipitation, temperature, SPEI, wind gust). Our methodology involves weighting gridded climate data by population density, night-time light intensity, cropland area, and concurrent population count – all proxies for socio-economic activity – before aggregation. We process data from multiple sources, offering daily, monthly, and annual climate variables spanning from 1900 to 2023. A unified framework streamlines our preprocessing steps, and rigorous validation against leading climate impact studies ensures data reliability. The resulting Weighted Climate Dataset is publicly accessible through an online dashboard at https://weightedclimatedata.streamlit.app/.

Authors: Marco Gortan (University of Basel); Lorenzo Testa (Carnegie Mellon University); Giorgio Fagiolo (Sant'Anna School of Advanced Studies); Francesco Lamperti (Sant'Anna School of Advanced Studies)

Extreme Weather Public Policy Societal Adaptation & Resilience Uncertainty Quantification & Robustness
NeurIPS 2024 Meta-Learned Bayesian Optimization for Energy Yield in Inertial Confinement Fusion (Papers Track)
Abstract and authors: (click to expand)

Abstract: With the growing demand for clean energy, fusion presents a promising path to sustainable power generation. Inertial confinement fusion (ICF) experiments trigger nuclear reactions by firing lasers at a fuel target, typically composed of deuterium and tritium. These experiments are costly and require complex optimization of the laser pulse shape across multiple shots to maximize energy yield. Even though Bayesian Optimization (BO) has been commonly used to optimize such expensive scientific experiments, vanilla BO methods do not leverage prior knowledge of the function from simulations or past experiments and fail to achieve high sample efficiency. In this work, we adapted and explored BO meta-learning techniques for ICF that either meta-learn the BO surrogate model, the acquisition function, or both from simulations. Our results demonstrate that the three meta-learning techniques we investigated, Meta-Learning Acquisition Functions for BO (MetaBO), Rank-Weighted Gaussian Process Ensemble (RGPE), and Neural Acquisition Processes (NAP), drastically reduce the number of experiments needed to achieve a satisfactory yield in ICF simulations.

Authors: Vineet Gundecha (Hewlett Packard Enterpise); Ricardo Luna Gutierrez (Hewlett Packard Enterprise); Sahand Ghorbanpour (Hewlett Packard Enterprise); Rahman Ejaz (University of Rochester); Varchas Gopalaswamy (University of Rochester); Riccardo Betti (University of Rochester); Avisek Naug (Hewlett Packard Enterprise); Paolo Faraboschi (Hewlett Packard Enterprise); Soumyendu Sarkar (Hewlett Packard Enterprise)

Power & Energy Meta- and Transfer Learning
NeurIPS 2024 WildfireGPT: Tailored Large Language Model for Wildfire Analysis (Papers Track)
Abstract and authors: (click to expand)

Abstract: Recent advancement of large language models (LLMs) represents a transformational capability at the frontier of artificial intelligence. However, LLMs are generalized models, trained on extensive text corpus, and often struggle to provide context-specific information, particularly in areas requiring specialized knowledge, such as wildfire details within the broader context of climate change. For decision-makers focused on wildfire resilience and adaptation, it is crucial to obtain responses that are not only precise but also domain-specific. To that end, we developed WildfireGPT, a prototype LLM agent designed to transform user queries into actionable insights on wildfire risks. We enrich WildfireGPT by providing additional context, such as climate projections and scientific literature, to ensure its information is current, relevant, and scientifically accurate. This enables WildfireGPT to be an effective tool for delivering detailed, user-specific insights on wildfire risks to support a diverse set of end users, including but not limited to researchers and engineers, for making positive impact and decision making.

Authors: Yangxinyu Xie (University of Pennsylvania); Bowen Jiang (University of Pennsylvania); Tanwi Mallick (Argonne National Laboratory); Joshua Bergerson (Argonne National Laboratory); John Hutchison (Argonne National Laboratory); Duane Verner (Argonne National Laboratory); Jordan Branham (Argonne National Laboratory); M. Ross Alexander (Argonne National Laboratory); Robert Ross (Argonne National Laboratory); Yan Feng (Argonne National Laboratory); Leslie-Anne Levy (Argonne National Laboratory); Weijie Su (University of Pennsylvania); Camillo Jose Taylor (University of Pennsylvania)

Disaster Management and Relief Societal Adaptation & Resilience Recommender Systems
NeurIPS 2024 Methane SatMapper: Methane Detection from Satellite Imagery Using Hyperspectral Transformer (Papers Track)
Abstract and authors: (click to expand)

Abstract: Methane (CH4) plays a critical role in accelerating global climate change, and recent advancements using Sentinel-2 satellite imagery have demonstrated potential in detecting and quantifying significant methane emissions. However, existing approaches often rely on temporal analysis of shortwave-infrared spectra, assuming consistent ground conditions and prior knowledge of methane-free periods, which can lead to errors and limit scalability. To overcome these challenges, we present Methane SatMapper, an innovative end-to-end spectral transformer model specifically designed to accurately identify and quantify methane plumes. Our model introduces two novel modules: one that identifies potential methane emission sites by analyzing solar radiation absorption in the spectral domain and another that localizes and quantifies methane plumes without the need for temporal data. By utilizing all 12 spectral channels of Sentinel-2 imagery, our architecture effectively estimates ground terrain and detects methane emissions, providing enhanced robustness to variable ground conditions and increased computational efficiency by eliminating the need for historical time-series data. Primary evaluations confirm that Methane SatMapper delivers precise and reliable methane detection, addressing key limitations in scalability and temporal dependence.

Authors: Satish Kumar (University of California, Santa Barbara); ASM Iftekhar (Microsoft); Bowen Zhang (University Of California, Santa Barbara); Richard Sserunjogi (Makerere University); Mehan Jayasuriya (Mozilla Technology Foundation)

Earth Observation & Monitoring Climate Science & Modeling Causal & Bayesian Methods Computer Vision & Remote Sensing
NeurIPS 2024 A Low-Complexity Data-Driven Algorithm for Residential PV-Storage Energy Management (Papers Track)
Abstract and authors: (click to expand)

Abstract: This paper uses the principles of online convex learning to propose a momentum-optimized smart (MOS) controller for energy management of residential PV-storage systems. Using the self-consumption-maximization application and practical data, the method's performance is compared to classical rolling-horizon quadratic programming. Findings support online learning methods for residential applications given their low complexity and small computation, communication, and data footprint. Consequences include improved economics for residential PV-storage systems and mitigation of distribution systems' operational challenges associated with high PV penetration.

Authors: Mostafa Farrokhabadi (University of Calgary)

Power & Energy Time-series Analysis
NeurIPS 2024 Climate Adaptation with Reinforcement Learning: Experiments with Flooding and Transportation in Copenhagen (Papers Track)
Abstract and authors: (click to expand)

Abstract: Due to climate change the frequency and intensity of extreme rainfall events, which contribute to urban flooding, are expected to increase in many places. These floods can damage transport infrastructure and disrupt mobility, highlighting the need for cities to adapt to escalating risks. Reinforcement learning (RL) serves as a powerful tool for uncovering optimal adaptation strategies, determining how and where to deploy adaptation measures effectively, even under significant uncertainty. In this study, we leverage RL to identify the most effective timing and locations for implementing measures, aiming to reduce both direct and indirect impacts of flooding. Our framework integrates climate change projections of future rainfall events and floods, models city-wide motorized trips, and quantifies direct and indirect impacts on infrastructure and mobility. Preliminary results suggest that our RL-based approach can significantly enhance decision-making by prioritizing interventions in specific urban areas and identifying the optimal periods for their implementation. Our framework is publicly available: \url{https://github.com/MLSM-at-DTU/floods_transport_rl}.

Authors: Miguel Costa (Technical University of Denmark); Morten W. Petersen (Technical University of Denmark); Arthur Vandervoort (Technical University of Denmark); Martin Drews (Technical University of Denmark); Karyn Morrissey (Technical University of Denmark); Francisco Pereira (DTU)

Transportation Reinforcement Learning
NeurIPS 2024 Learning the Indicators of Energy Burden for Knowledge Informed Policy (Papers Track)
Abstract and authors: (click to expand)

Abstract: The United States is one of the largest energy consumers per capita, which puts an expectation on households to have adequate energy expenditures to keep up with modern society. This adds additional stress on low-income households that may need to limit energy use due to financial constraints. This paper investigates energy burden, the ratio of household energy bills to household income, within the United States West. Self-Organizing Maps, an unsupervised neural network, is used to learn the indicators attributed to energy burden to inform public policy. This is one of the first studies to consider environmental justice indicators, which include outdoor air quality metrics and health disparities as energy burden indicators. The results show significant (p<0.05) differences among high energy burden areas and those with no energy burden for the environmental justice indicators. Thus, beyond the socioeconomic hardships of marginalized communities, counties with high energy burden suffer from environmental and health hazards, which will be amplified under a changing climate.

Authors: Jasmine Garland (University of Colorado Boulder); Rajagopalan Balaji (University of Colorado, Boulder); Kyri Baker (University of Colorado, Boulder); Ben Livneh (University of Colorado, Boulder)

Public Policy Behavioral and Social Science Climate Justice Health Unsupervised & Semi-Supervised Learning
NeurIPS 2024 Paraformer: Parameterization of Sub-grid Scale Processes Using Transformers (Papers Track)
Abstract and authors: (click to expand)

Abstract: One of the major sources of uncertainty in the current generation of Global Climate Models (GCMs) is the simulation of sub-grid scale physical processes. Over the years, with significantly improved computational performance, a series of Deep Learning (DL) parameterization schemes have been developed and incorporated into GCMs. However, these schemes use classic architectures whereas the new attention mechanism is not widely investigated. We proposed a “memory-aware” transformer-based model on ClimSim, the largest-ever dataset for climate parameterization. Our results show that the attention mechanism successfully captures the complex non-linear dependencies of sub-grid scale variables and reduces the prediction error.

Authors: Shuochen Wang (Northeastern University); Nishant Yadav (Microsoft); Auroop Ganguly (Northeastern University)

Climate Science & Modeling
NeurIPS 2024 Efficient Localized Adaptation of Neural Weather Forecasting: A Case Study in the MENA Region (Papers Track)
Abstract and authors: (click to expand)

Abstract: Accurate weather and climate modeling are critical for both scientific advancement and safeguarding communities against environmental risks. Traditional approaches rely heavily on Numerical Weather Prediction (NWP) models, which simulate energy and matter flow across Earth's systems. However, heavy computational requirements and low efficiency restrict the suitability of NWP, leading to a pressing need for enhanced modeling techniques. Neural network-based models have emerged as promising alternatives, leveraging data-driven approaches to forecast atmospheric variables. In this work, we focus on limited-area modeling and train our model specifically for localized region-level downstream tasks. As a case study, we consider the MENA region due to its unique climatic challenges, where accurate localized weather forecasting is crucial for managing water resources, agriculture and mitigating the impacts of extreme weather events. This targeted approach allows us to tailor the model's capabilities to the unique conditions of the region of interest. Our study aims to validate the effectiveness of integrating parameter-efficient fine-tuning (PEFT) methodologies, specifically Low-Rank Adaptation (LoRA) and its variants, to enhance forecast accuracy, as well as training speed, computational resource utilization, and memory efficiency in weather and climate modeling for specific regions. Our codebase and pre-trained models can be accessed at \url{https://github.com/akhtarvision/weather-regional}.

Authors: Muhammad Akhtar Munir (MBZUAI); Fahad Shahbaz (MBZUAI); Salman Khan (MBZUAI)

Climate Science & Modeling Earth Observation & Monitoring
NeurIPS 2024 DivShift: Exploring Domain-Specific Distribution Shift in Large-Scale, Volunteer-Collected Biodiversity Datasets (Papers Track)
Abstract and authors: (click to expand)

Abstract: Climate change is negatively impacting the world's biodiversity. To build automated systems to monitor these negative biodiversity impacts, large-scale, volunteer-collected datasets like iNaturalist are built from community-identified, natural imagery. However, such volunteer-based data are opportunistic and lack a structured sampling strategy, resulting in geographic, temporal, observation quality, and socioeconomic, biases that stymie uptake of these models for downstream biodiversity monitoring tasks. Here we introduce DivShift North American West Coast (DivShift-NAWC), a curated dataset of almost 8 million iNaturalist plant images across the western coast of North America, for exploring the effects of these biases on deep learning model performance. We compare model performance across four known biases and observe that they indeed confound model performance. We suggest practical strategies for curating datasets to train deep learning models for monitoring climate change's impacts on the world's biodiversity.

Authors: Elena Sierra (Stanford University); Lauren Gillespie (Stanford University); Salim Soltani (University of Freiburg); Moisés Expósito-Alonso (University of California, Berkeley); Teja Kattenborn (University of Freiburg)

Ecosystems & Biodiversity Earth Observation & Monitoring Computer Vision & Remote Sensing
NeurIPS 2024 RAIN: Reinforcement Algorithms for Improving Numerical Weather and Climate Models (Papers Track)
Abstract and authors: (click to expand)

Abstract: This study explores integrating reinforcement learning (RL) with idealised climate models to address key parameterisation challenges in climate science. Current climate models rely on complex mathematical parameterisations to represent sub-grid scale processes, which can introduce substantial uncertainties. RL offers capabilities to enhance these parameterisation schemes, including direct interaction, handling sparse or delayed feedback, continuous online learning, and long-term optimisation. We evaluate the performance of eight RL algorithms on two idealised environments: one for temperature bias correction, another for radiative-convective equilibrium (RCE) imitating real-world computational constraints. Results show different RL approaches excel in different climate scenarios with exploration algorithms performing better in bias correction, while exploitation algorithms proving more effective for RCE. These findings support the potential of RL-based parameterisation schemes to be integrated into global climate models, improving accuracy and efficiency in capturing complex climate dynamics. Overall, this work represents an important first step towards leveraging RL to enhance climate model accuracy, critical for improving climate understanding and predictions. Code accessible at https://github.com/p3jitnath/climate-rl.

Authors: Pritthijit Nath (University Of Cambridge); Henry Moss (Secondmind); Emily Shuckburgh (University Of Cambridge); Mark Webb (Met Office)

Climate Science & Modeling Hybrid Physical Models Reinforcement Learning
NeurIPS 2024 Exploring Physics-Informed Neural Networks for Crop Yield Loss Forecasting (Papers Track)
Abstract and authors: (click to expand)

Abstract: In response to climate change, assessing crop productivity under extreme weather conditions is essential to enhance food security. Crop simulation models, which align with physical processes, offer explainability but often perform poorly. Conversely, machine learning (ML) models for crop modeling are powerful and scalable yet operate as black boxes and lack adherence to crop growth’s physical principles. To bridge this gap, we propose a novel method that combines the strengths of both approaches by estimating the water use and the crop sensitivity to water scarcity at the pixel level. This approach enables yield loss estimation grounded in physical principles by sequentially solving the equation for crop yield response to water scarcity, using an enhanced loss function. Leveraging Sentinel-2 satellite imagery, climate data, simulated water use data, and pixel-level yield data, our model demonstrates high accuracy, achieving an $R^2$ of up to 0.77—matching or surpassing state-of-the-art models like RNNs and Transformers. Additionally, it provides interpretable and physical consistent outputs, supporting industry, policymakers, and farmers in adapting to extreme weather conditions.

Authors: Miro Miranda (University of Kaiserslautern-Landau); Marcela Charfuelan (German Research Center for Artificial Intelligence); Andreas Dengel (DFKI GmbH)

Agriculture & Food Hybrid Physical Models
NeurIPS 2024 Modeling Pollution Spread with Obstructions using Physics-Informed Neural Networks (Papers Track)
Abstract and authors: (click to expand)

Abstract: Pollution modeling plays a crucial role in combating climate change and protecting public health. Scientists rely on traditional numerical methods, such as finite difference and computational fluid dynamics, for accurate pollution simulations; however, they are resource-intensive and time-consuming often taking days to execute. Physics-Informed Neural Networks (PINNs) present a promising alternative, capable of significantly improving speed while maintaining high accuracy. In our research, we developed a PINN to model pollution spread under laminar flow conditions in a two-dimensional environment with obstructions and reflecting boundaries. Our model integrates the Navier-Stokes and advection-diffusion partial differential equations (PDEs) and enforces zero-flux Neumann boundaries at obstruction surfaces and simulation edges. We used a hybrid learning approach, generating a dataset of 6.1 million colocation points for supervised learning and embedding physical laws into the loss function for unsupervised learning. In our experiments, the PINN was over 2520 times faster than the traditional numerical solver, returning results in under 2190 milliseconds and achieving a mean squared error below 3E-5. Our findings demonstrate that PINNs not only offer a drastic reduction in computational time but also scale favorably with both the time domain and spatial resolution, making them a viable solution for real-time pollution monitoring and emergency response planning. Future work will focus on extending our model to dynamic obstructions and arbitrary grid geometries.

Authors: Yash Ranjith (Westmont)

Climate Science & Modeling Hybrid Physical Models Unsupervised & Semi-Supervised Learning
NeurIPS 2024 Prediction of PM2.5 concentration based on ARIMA, LSTM, and TCN models in Kigali, Rwanda (Papers Track)
Abstract and authors: (click to expand)

Abstract: PM2.5 pollution is a major health concern, especially in areas lacking robust real-time monitoring and predictive capabilities. This study presents a comparative analysis of three forecasting models—Autoregressive Integrated Moving Average (ARIMA), Long Short-Term Memory (LSTM), and Temporal Convolutional Networks (TCN)—to predict PM2.5 concentrations in four regions of Kigali, Rwanda. Utilizing a dataset spanning from late 2020 to 2024, these models were trained on historical air quality data obtained from sensors. Our findings reveal that the LSTM model consistently outperforms both TCN and ARIMA models, delivering lower Root Mean Square Error (RMSE) and Mean Absolute Error (MAE) in predicting PM2.5 levels. These results underscore the effectiveness of LSTM models in providing more accurate air quality forecasts in complex temporal environments. This research lays the groundwork for enhancing air quality monitoring and public health strategies in Rwanda.

Authors: Yamlak Bogale (Carnegie Mellon University Africa); Choukouriyah Arinloye (Carnegie Mellon University Africa); Joselyne Muragijemariya (Carnegie Mellon University Africa)

Time-series Analysis Climate Science & Modeling
NeurIPS 2024 Wildflower Monitoring with Expert-annotated Images and Flowering Phenology (Papers Track)
Abstract and authors: (click to expand)

Abstract: Understanding biodiversity trends is essential for preservation policy planning, and advanced computer vision solutions now enable large-scale automated monitoring for many biodiversity use cases. Wildflower monitoring, in particular, presents unique challenges. Visual similarities in shape and color may exist between different species, while flowers within a species may have significant visual differences. Moreover, flowers follow a growth cycle and look distinctly different over the year, while different species flower at different times of the year. Having access to flowering phenology, more accurate predictions may be made. We propose a novel multi-modal wildflower monitoring task to better identify species, levering both expert-annotated wildflower images and flowering phenology estimates. Moreover, we benchmark several state-of-the-art models using two groups of common wildflower species that have high inter-class similarity, and show that this multi-modal approach significantly outperforms image-only baselines. With this work, we aim to encourage the development of standards for automated wildflower monitoring as a step towards bending the curve of biodiversity loss. The data and the code are publicly available https://georgianagmanolache.github.io/wildflowerpower/

Authors: Georgiana Manolache (Fontys University of Applied Science); Gerard Schouten (Fontys University of Applied Sciences)

Ecosystems & Biodiversity Public Policy Computer Vision & Remote Sensing Meta- and Transfer Learning
NeurIPS 2024 Light-weight geospatial model for global deforestation attribution (Papers Track)
Abstract and authors: (click to expand)

Abstract: Forests are in decline worldwide and it is critical to attribute forest cover loss to its causes. We gathered a curated global dataset of all forest loss drivers and developed a neural network model to recognize the main drivers of deforestation or forest degradation at 1-km scale. Using remote sensing satellite data together with ancillary biophysical and socioeconomic data the model estimates the dominant drivers of forest loss from 2001 to 2022. Using a relatively light-weight geospatial model allowed us to to train a single world-wide model. We generated a global map of drivers of forest loss that is being validated, and present the first insights such data can provide.

Authors: Anton Raichuk (Google); Michelle Sims (WRI); Radost Stanimirova (WRI); Maxim Neumann (Google)

Earth Observation & Monitoring Climate Science & Modeling Ecosystems & Biodiversity Forests Computer Vision & Remote Sensing
NeurIPS 2024 HVAC-DPT: A Decision Pretrained Transformer for HVAC Control (Papers Track)
Abstract and authors: (click to expand)

Abstract: Building operations consume approximately 40% of global energy, with Heating, Ventilation, and Air Conditioning (HVAC) systems responsible for up to 50% of this consumption [1, 2]. As HVAC energy demands are expected to rise, optimising system efficiency is crucial for reducing future energy use and mitigating climate change [3]. Existing control strategies lack generalisation and require extensive training and data, limiting their rapid deployment across diverse buildings. This paper introduces HVAC-DPT, a Decision-Pretrained Transformer using in-context Reinforcement Learning (RL) for multi-zone HVAC control. HVAC-DPT frames HVAC control as a sequential prediction task, training a causal transformer on inter- action histories generated by diverse RL agents. This approach enables HVAC-DPT to refine its policy in-context, without modifying network parameters, allowing for deployment across different buildings without the need for additional training or data collection. HVAC-DPT reduces energy consumption in unseen buildings by 45% compared to the baseline controller, offering a scalable and effective approach to mitigating the increasing environmental impact of HVAC systems.

Authors: Anaïs Berkes (University of Cambridge)

Buildings Generative Modeling Meta- and Transfer Learning Reinforcement Learning
NeurIPS 2024 Regional Ocean Forecasting with Hierarchical Graph Neural Networks (Papers Track)
Abstract and authors: (click to expand)

Abstract: Accurate ocean forecasting systems are vital for understanding marine dynamics, which play a crucial role in environmental management and climate adaptation strategies. Traditional numerical solvers, while effective, are computationally expensive and time-consuming. Recent advancements in machine learning have revolutionized weather forecasting, offering fast and energy-efficient alternatives. Building on these advancements, we introduce SeaCast, a neural network designed for high-resolution, medium-range ocean forecasting. SeaCast employs a graph-based framework to effectively handle the complex geometry of ocean grids and integrates external forcing data tailored to the regional ocean context. Our approach is validated through experiments at a high spatial resolution using the operational numerical model of the Mediterranean Sea provided by the Copernicus Marine Service, along with both numerical and data-driven atmospheric forcings.

Authors: Daniel Holmberg (University of Helsinki); Emanuela Clementi (CMCC Foundation); Teemu Roos (University of Helsinki)

Oceans & Marine Systems Climate Science & Modeling
NeurIPS 2024 Parakeet: Emission Factor Recommendation for Carbon Footprinting with Generative AI (Papers Track)
Abstract and authors: (click to expand)

Abstract: Accurately quantifying greenhouse gas (GHG) emissions from products and business activities is crucial for organizations to measure their environmental impact and undertake mitigation actions. Life cycle assessment (LCA) is the scientific discipline for measuring GHG emissions associated with each stage of a product or activity, from raw material extraction to disposal. Measuring the emissions outside of a product owner's control is challenging, and practitioners rely on emission factors (EFs) – estimates of GHG emissions per unit of activity – to model and estimate indirect impacts. These EFs come from prior LCA studies and are collated into databases. The current practice of manually finding the appropriate EF to use from databases is time-consuming, error-prone, and requires domain expertise, hindering scalability and accuracy in emissions quantification. We present a novel AI-assisted method that leverages large language models to automatically recommend EFs. Our method parses business activity descriptions and recommends the appropriate EF with a human-interpretable justification. We benchmark our solution across multiple domains and find it achieves state-of-the-art performance in EF recommendation, with an average Precision@1 of 88.4%. By streamlining and automating the EF selection process, our AI-assisted method enables scalable and accurate quantification of GHG emissions, supporting organizations' sustainability initiatives and driving progress toward net-zero emissions targets across industries.

Authors: Bharathan Balaji (Amazon); Nina Domingo (Amazon); Abu Zaher Faridee (Amazon); Venkata Sai Gargeya Vunnava (amazon); Anran Wang (Amazon); Fahimeh Ebrahimi Meymand (Amazon); Kellen Axten (Amazon); Aravind Srinivasan (Amazon); Qingshi Tu (University of British Columbia); Harsh Gupta (Amazon); Shikha Gupta (Amazon); Soma Ramalingam (Amazon); Jeremie Hakian (Amazon); Jared Kramer (Amazon)

Climate Finance & Economics Natural Language Processing Recommender Systems
NeurIPS 2024 Towards CLEAN: Cycle Learning Energy-Emission Assessment Network (Papers Track)
Abstract and authors: (click to expand)

Abstract: Formulating effective climate policies is challenging due to the limitations of current decision-making models, such as Computable General Equilibrium models, which often rely on static, linear assumptions that oversimplify complex, real-world scenarios. To address these limitations, we introduce the Cycle Learning Energy-Emission Assessment Network, a scalable deep-learning framework trained on the Carbon Monitor Energy-Emission time series dataset. CLEAN captures global patterns between energy structures and sectoral carbon emissions, enabling accurate energy-emission predictions at national, regional, city, and sectoral levels. By framing the problem as a bidirectional supervised regression task, our model achieves a Mean Absolute Percentage Error of approximately 5%, significantly outperforming traditional models like XGBoost, especially with our novel data augmentation method, Densify. This demonstrates CLEAN’s superior performance and generalization capabilities, making it a powerful tool for climate policy analysis. The CMEE dataset, CLEAN model, and Densify augmentation methods are open-sourced at \url{https://anonymous.4open.science/r/CLEAN-D66A}.

Authors: Yanming Guo (University of Sydney); Jin Ma (University of Sydney); Kevin Credit (Maynooth University); Qian Xiao (Maynooth University)

NeurIPS 2024 Multi-Source Temporal Attention Network for Precipitation Nowcasting (Papers Track) Best Pathway to Impact
Abstract and authors: (click to expand)

Abstract: Precipitation nowcasting is crucial across various industries and plays a significant role in mitigating and adapting to climate change. We introduce an efficient deep learning model for precipitation nowcasting, capable of predicting rainfall up to 8 hours in advance with greater accuracy than existing operational physics-based and extrapolation-based models. Our model leverages multi-source meteorological data and physics-based forecasts to deliver high-resolution predictions in both time and space. It captures complex spatio-temporal dynamics through temporal attention networks and is optimized using data quality maps and dynamic thresholds. Experiments demonstrate that our model outperforms state-of-the-art, and highlight its potential for fast reliable responses to evolving weather conditions.

Authors: Rafael Pablos Sarabia (Aarhus University & Cordulus); Joachim Nyborg (Cordulus); Morten Birk (Cordulus); Jeppe Liborius Sjørup (Cordulus); Anders Lillevang Vesterholt (Cordulus); Ira Assent (Aarhus University)

Computer Vision & Remote Sensing Earth Observation & Monitoring Extreme Weather
NeurIPS 2024 Critical misalignments between climate action and sustainable development goals revealed (Papers Track)
Abstract and authors: (click to expand)

Abstract: A mere 12 percent of the Sustainable Development Goals (SDGs) is currently on track to meet the 2030 deadline in a world under climate change. Since their launch in 2015, the 2030 Agenda for Sustainable Development and the Paris Agreement have suffered persistent mismatches, which limit the potential for mutual gains. We use Artificial Intelligence (AI) to assess the degree and type of alignment between the Nationally Determined Contributions (NDCs) and the SDGs. While high income countries tackle the energy-infrastructure-community nexus in term of opportunity, lower income countries make climate impacts more explicit and center their trade-offs around the water-energy-food nexus. These two approaches mark different development trajectories and have non-negligible implications on international financial flow architecture and climate governance.

Authors: Francesca Larosa (Royal Institute for Technology); Sergio Hoyas (Universitat Politècnica de València); Fermin Mallor Franco (Royal Institute of Technology); J. Alberto Conejero (Universitat Politècnica de València); Javier García-Martinez (University of Alicante); Francesco Fuso Nerini (Royal Institute of Technology); Ricardo Vinuesa (KTH Royal Institute of Technology)

Natural Language Processing Climate Finance & Economics Societal Adaptation & Resilience Data Mining
NeurIPS 2024 Clustering-Based Framework for Assessing Transportation Resilience to Flood Events (Papers Track)
Abstract and authors: (click to expand)

Abstract: Flooding presents a significant threat to critical infrastructures (CIs), particularly in Spain, which is frequently cited as the most flood-affected country in Europe. The transportation sector, a crucial CI, is particularly susceptible to such events. It is imperative to monitor the corresponding resilience to improve disaster management efforts. With the advancements in data science and machine learning, various approaches have been developed in this area; however, researchers encounter challenges such as data inconsistency and reliability. This paper presents a resilience assessment framework that utilises machine learning and open data to evaluate the impact of floods on Spain's transportation network. The analysis aims to facilitate well-informed decision-making by stakeholders and government entities, thereby enhancing disaster preparedness and response.

Authors: Matheus Pedra (TECNUN); Leire Labaka (TECNUN); Josune Hernantes (TECNUN)

Transportation Cities & Urban Planning Disaster Management and Relief Extreme Weather Time-series Analysis
NeurIPS 2024 Multi-scale decomposition of sea surface height snapshots using machine learning (Papers Track)
Abstract and authors: (click to expand)

Abstract: Knowledge of ocean circulation is important for understanding and predicting weather and climate, and managing the blue economy. This circulation can be estimated through Sea Surface Height (SSH) observations, but requires decomposing the SSH into contributions from balanced and unbalanced motions (BMs and UBMs). This decomposition is particularly pertinent for the novel SWOT satellite, which measures SSH at an unprecedented spatial resolution. Specifically, the requirement, and the goal of this work, is to decompose instantaneous SSH into BMs and UBMs. While a few studies using deep learning (DL) approaches have shown promise in framing this decomposition as an image-to-image translation task, these models struggle to work well across a wide range of spatial scales and require extensive training data, which is scarce in this domain. These challenges are not unique to our task, and pervade many problems requiring multi-scale fidelity. We show that these challenges can be addressed by using zero-phase component analysis (ZCA) whitening and data augmentation; making this a viable option for SSH decomposition across scales.

Authors: Yue Wang (Columbia University); Jingwen Lyu (Columbia University); Chris Pedersen (NYU); Spencer Jones (Texas A&M University); Dhruv Balwada (Columbia University)

Oceans & Marine Systems Earth Observation & Monitoring Computer Vision & Remote Sensing
NeurIPS 2024 Safe Reinforcement Learning for Remote Microgrid Optimization with Industrial Constraints (Papers Track)
Abstract and authors: (click to expand)

Abstract: In remote microgrids, the power must be autonomously dispatched between fuel generators, renewable energy sources, and batteries, to fulfill the demand. These decisions must aim to reduce the consumption of fossil fuel and battery degradation while accounting for the complex dynamics of generators, and uncertainty in the demand and renewable production forecasts. Such an optimization could significantly reduce fuel consumption, potentially saving millions of liters of diesel per year. Traditional optimization techniques struggle with scaling in problem complexity and handling uncertainty. On the other hand, reinforcement learning algorithms often lacks the industry constraints guarantees needed for real-world deployment. In this project, we provide a realistic shielded microgrid environment designed to ensure safe control given real-world industry standards. Then, we train a deep reinforcement learning agents to control fuel generators and batteries to minimize the fuel consumption and battery degradation. Our agents outperform heuristics baselines and exhibit a Pareto frontier pattern.

Authors: Hadi Nekoei (Mila); Alexandre Blondin Massé (Hydro-Quebec); Rachid Hassani (Hydro-quebec); Sarath Chandar (Mila / École Polytechnique de Montréal); Vincent Mai (Hydro-Québec)

Power & Energy Reinforcement Learning
NeurIPS 2024 CanadaFire2023: Burned Area Mapping Datasets and Benchmarks for Canadian Wildfires in 2023 (Papers Track)
Abstract and authors: (click to expand)

Abstract: In 2023, wildfires burned record-breaking areas in Canada, resulting in significant carbon loss, exacerbating climate change, and underscoring the need for relevant datasets and machine learning methods for effective and efficient analysis. To understand the fire development processes and assess the climate impact of this natural disaster, burned area mapping datasets are essential for generating high-quality burned scar maps, enabling a comprehensive analysis of the 2023 wildfires, particularly given the vast expanse of Canada. To this end, we propose the CanadaFire2023 dataset, which includes burned area mapping data collected from multiple satellite platforms, namely, Landsat-8, Landsat-9, and Sentinel-2, specifically focused on these wildfires in the recorded history of Canada. To our knowledge, this is the first dataset specifically focused on burned area detection related to the unprecedented 2023 Canadian wildfires, using individual satellite imagery. We also trained four deep learning models—FCN, U-Net, multiscale ResNet, and SegFormer—for burned area mapping and evaluated the mapping performance using binary segmentation metrics, demonstrating that these datasets can serve as benchmarks for the research community studying wildfires and their environmental consequences. The CanadaFire2023 dataset could facilitate downstream applications such as disaster management, carbon emission estimation, and climate change mitigation.

Authors: Zilong Zhong (McMaster University); Alemu Gonsamo (McMaster University)

Earth Observation & Monitoring Computer Vision & Remote Sensing
NeurIPS 2024 TAUDiff: Improving statistical downscaling for extreme-event simulation using generative diffusion models (Papers Track)
Abstract and authors: (click to expand)

Abstract: Deterministic regression-based downscaling models for climate variables often suffer from spectral bias, which can be mitigated by generative models like diffusion models. To enable efficient and reliable simulation of extreme weather events, it is crucial to achieve rapid turnaround, dynamical consistency, and accurate spatio-temporal spectral recovery. We propose an efficient correction diffusion model TAUDiff that combines a deterministic spatio-temporal model for mean field downscaling with a smaller generative diffusion model for recovering the fine-scale stochastic features. This approach can not only ensure quicker simulation of extreme events but also reduce overall carbon footprint due to low inference times.

Authors: Rahul Sundar (Verisk, India); Nishant Parashar (Verisk); Antoine Blanchard (Verisk); Boyko Dodov (Verisk)

Generative Modeling Extreme Weather
NeurIPS 2024 Emulating the Global Change Analysis Model with Deep Learning (Papers Track)
Abstract and authors: (click to expand)

Abstract: The Global Change Analysis Model (GCAM) simulates complex interactions between the coupled Earth and human systems, providing valuable insights into the co-evolution of land, water, and energy sectors under different future scenarios. Understanding the sensitivities and drivers of this multisectoral system can lead to more robust understanding of the different pathways to particular outcomes. The interactions and complexity of the coupled human-Earth systems make GCAM simulations costly to run at scale - a requirement for large ensemble experiments which explore uncertainty in model parameters and outputs. A differentiable emulator with similar predictive power, but greater efficiency, could provide novel scenario discovery and analysis of GCAM and its outputs, requiring fewer runs of GCAM. As a first use case, we train a deep learning model on an existing large ensemble that explores a range of GCAM inputs related to different relative contributions of energy production sources, with a focus on wind and solar. We complement this existing ensemble with interpolated input values and a wider selection of outputs, predicting 22,528 GCAM outputs across time, sectors, and regions. We report a median R^2 score of 0.998 for the emulator's predictions and an R^2 score of 0.812 for its input-output sensitivity.

Authors: Andrew Holmes (Western Washington University); Matt Jensen (Pacific Northwest National Laboratory); Sarah Coffland (Western Washington University); Hidemi Mitani-Shen (Western Washington University); Logan Sizemore (Western Washington University); Seth Bassetti (Utah State University); Brenna Nieva (Western Washington University); Claudia Tebaldi (Joint Global Change Research Institute); Abigail Snyder (Joint Global Change Research Institute); Brian Hutchinson (Western Washington University)

Climate Science & Modeling Forests Power & Energy
NeurIPS 2024 Sliced-Wasserstein-based Anomaly Detection and Open Dataset for Localized Critical Peak Rebates (Papers Track)
Abstract and authors: (click to expand)

Abstract: In this work, we present a new unsupervised anomaly (outlier) detection (AD) method using the sliced-Wasserstein metric. This filtering technique is conceptually interesting for MLOps pipelines deploying machine learning models in critical sectors, e.g., energy, as it offers a conservative data selection. Additionally, we open the first dataset showcasing localized critical peak rebate demand response in a northern climate. We demonstrate the capabilities of our method on synthetic datasets as well as standard AD datasets and use it in the making of a first benchmark for our open-source localized critical peak rebate dataset.

Authors: Julien Pallage (Polytechnique Montréal, Mila, GERAD); Bertrand Scherrer (Hydro-Québec); Salma Naccache (Hydro-Québec); Christophe Bélanger (Hydro-Québec); Antoine Lesage-Landry (Polytechnique Montréal & GERAD)

Power & Energy Buildings Unsupervised & Semi-Supervised Learning
NeurIPS 2024 Scalable and interpretable deforestation detection in the Amazon rainforest (Papers Track)
Abstract and authors: (click to expand)

Abstract: Deforestation of the Amazon rainforest is a major contributor to climate change, as it is a crucial precipitation regulator, as well as a large natural carbon reserve. While there have been efforts to create real-time algorithms for deforestation detection, they are oftentimes not accurate or interpretable. We leverage multiple input signals, such as satellite imagery, time-series of deforestation indices and scalar measures, to create a single deep learning model that is both interpretable and accurate. We employ a novel dataset with millions of annotated images of the Brazilian Amazon to train our model, as well as class activation mappings to investigate the added value of interpretability in this context.

Authors: Rodrigo Schuller (IMPA); Francisco Ganacim (IMPA); Paulo Orenstein (IMPA)

Interpretable ML Earth Observation & Monitoring Ecosystems & Biodiversity Computer Vision & Remote Sensing
NeurIPS 2024 Operational Wind Speed Forecasts for Chile's Electric Grid Using A Hybrid Machine Learning Model (Papers Track)
Abstract and authors: (click to expand)

Abstract: As Chile's electric power sector advances toward a future powered by renewable energy, accurate forecasting of renewable generation is essential for managing grid operations. The integration of renewable energy sources is particularly challenging due to the operational difficulties of managing their power generation, which is highly variable compared to fossil fuel sources, delaying the availability of clean energy. To mitigate this, we quantify the impact of increasing intermittent generation from wind and solar on thermal power plants in Chile and introduce a hybrid wind speed forecasting methodology which combines two custom ML models for Chile. The first model is based on TiDE, an MLP-based ML model for short-term forecasts, and the second is based on a graph neural network, GraphCast, for medium-term forecasts up to 10 days. Our hybrid approach outperforms the most accurate operational deterministic systems by 4-21% for short-term forecasts and 5-23% for medium-term forecasts and can directly lower the impact of wind generation on thermal ramping, curtailment, and system-level emissions in Chile.

Authors: Dhruv Suri (Stanford University); Praneet Dutta (Google); Flora Xue (Google DeepMind); Ines Azevedo (Stanford University); Ravi Jain (X (formerly Google X))

Climate Science & Modeling Time-series Analysis
NeurIPS 2024 Predicting NOx emissions in Biochar Production Plants using Machine Learning (Papers Track)
Abstract and authors: (click to expand)

Abstract: The global Biochar Industry has witnessed a surge in biochar production, with a total of 350k mt/year production in 2023. With the pressing climate goals set and the potential of Biochar Carbon Removal (BCR) as a climate-relevant technology, scaling up the number of new plants to over 1000 facilities per year by 2030 becomes imperative. However, such a massive scale-up presents not only technical challenges but also control and regulation issues, ensuring maximal output of plants while conforming to regulatory requirements. In this paper, we present a novel method of optimizing the process of a biochar plant based on machine learning methods. We show how a standard Random Forest Regressor can be used to model the states of the pyrolysis machine, the physics of which remains highly complex. This model then serves as a surrogate of the machine -- reproducing several key outcomes of the machine -- in a numerical optimization. This, in turn, could enable us to reduce NOx emissions -- a key regulatory goal in that industry -- while achieving maximal output still. In a preliminary test our approach shows remarkable results, proves to be applicable on two different machines from different manufacturers, and can be implemented on standard Internet of Things (IoT) devices more generally.

Authors: Marius Köppel (Johannes Gutenberg University Mainz); Tobias Schweitzer (AIRA Holding GmbH); Jochen Weber (None); Erdem Yilmaz (PYREG GmbH); Niklas Witzig (Johannes Gutenberg University Mainz); Tim Klausmann (Johannes Gutenberg University Mainz); Mattia Cerrato (JGU Mainz); Juan Chimbo (ARTi); Bernardo del Campo (ARTi); Lissete Davila (ARTi); David Barreno (ARTi)

Carbon Capture & Sequestration
NeurIPS 2024 Spatio-Temporal Machine Learning Models for Emulation of Global Atmospheric Composition (Papers Track)
Abstract and authors: (click to expand)

Abstract: Interactive atmospheric composition simulations are among the most computationally expensive components in Earth System Models (ESMs) due to the need to transport a large number of gaseous and aerosol tracers at every model time step. This poses a significant limitation for higher-resolution transient climate simulations with current computational resources. In ESMs such as NASA GISS-E2.1 (hereafter referred to as ModelE), pre-computed monthly-averaged atmospheric composition concentrations are often used to reduce computational expenses. This approach is referred to as Non-Interactive Tracers (NINT). In this study, we extend the NINT version of the ModelE using machine learning to emulate the effects of interactive emissions on climate forcing. We use data from a fully interactive composition climate model with surface-driven emissions to develop an ML-based NINT climate model. This version accounts for instantaneous atmospheric conditions, enabling the tracers to respond dynamically to meteorology without the need for explicit calculation of tracer transport. This approach could be applied to any aerosol species and integrated into ESMs to simulate aerosol concentrations interactively. The proposed framework emulates the advection term at the surface pressure level, with a focus on predicting surface-level concentrations of Black Carbon (BC) from biomass burning, which is a contributor to elevated levels of PM2.5 concentrations. Two consecutive years of ModelE simulated data were used as training data. To capture both temporal and spatial dependencies, a Convolutional Long Short-Term Memory (ConvLSTM) model was used. Results show the ConvLSTM achieved an average R-squared of 0.85 (STD = 0.08) on the test set. In contrast, using monthly-averaged atmospheric composition concentrations resulted in an average R-squared of 0.42 (STD = 0.73) for the same period.

Authors: Mohammad Erfani (Columbia University); Kara Lamb (Columbia University); Susanne Bauer (NASA Goddard Institute for Space Studies); Kostas Tsigaridis (Columbia University); Marcus van Lier-Walqui (Columbia University); Gavin Schmidt (NASA Goddard Institute for Space Studies)

Climate Science & Modeling Time-series Analysis
NeurIPS 2024 Probabilistic representation learning of subseasonal to seasonal ocean dynamics (Papers Track)
Abstract and authors: (click to expand)

Abstract: Data-driven weather and climate forecasts on subseasonal-to-seasonal (S2S) scales face a two-fold challenge: the observational records are comparatively short and predictability strongly depends on the coupling of earth systems evolving on very different timescales. As a first step towards general subseasonal-to-seasonal predictions, we model the El Niño Southern Oscillation (ENSO), one of the principal sources of predictability on S2S scales. Characterised by anomalously warm sea surface temperatures in the tropical Pacific, an El Niño event arises from a combination of slow heat transfer within the Pacific and between ocean basins, as well as fast atmospheric dynamics, such as westerly wind bursts and convection. Here, we design a deep learning model that can flexibly represent information from modalities on different timescales, trained as a Masked Autoencoder and optimising the Empirical Continuous Ranked Probability Score. We find that the representation learning approach exhibits zero-shot performance that is competitive with task-specific models on S2S of ENSO in terms of correlation skill. To the best of our knowledge, we are the first to predict well-calibrated uncertainty estimates on a 24 month horizon.

Authors: Jannik Thuemmel (University of Tuebingen); Jakob Schlör (ECMWF); Florian Ebmeier (University of Tuebingen); Bedartha Goswami (University of Tübingen)

Unsupervised & Semi-Supervised Learning Climate Science & Modeling
NeurIPS 2024 Estimating atmospheric variables from Digital Typhoon Satellite Images via Conditional Denoising Diffusion Models (Papers Track)
Abstract and authors: (click to expand)

Abstract: This study explores the application of diffusion models in the field of typhoons, predicting multiple ERA5 meteorological variables simultaneously from Digital Typhoon satellite images. The focus of this study is taken to be Taiwan, an area very vulnerable to typhoons. By comparing the performance of Conditional Denoising Diffusion Probability Model (CDDPM) with Convolutional Neural Networks (CNN) and Squeeze-and-Excitation Networks (SENet), results suggest that the CDDPM performs best in generating accurate and realistic meteorological data. Specifically, CDDPM achieved a PSNR of 32.807, which is approximately 7.9% higher than CNN and 5.5% higher than SENet. Furthermore, CDDPM recorded an RMSE of 0.032, showing a 11.1% improvement over CNN and 8.6% improvement over SENet. A key application of this research can be for imputation purposes in missing meteorological datasets and generate additional high-quality meteorological data using satellite images. It is hoped that the results of this analysis will enable more robust and detailed forecasting, reducing the impact of severe weather events on vulnerable regions

Authors: Zhangyue Ling (Imperial College London); Pritthijit Nath (University Of Cambridge); Cesar Quilodran-Casas (Imperial College London)

Computer Vision & Remote Sensing Extreme Weather Generative Modeling
NeurIPS 2024 RL for Mitigating Cascading Failures: Targeted Exploration via Sensitivity Factors (Papers Track)
Abstract and authors: (click to expand)

Abstract: Electricity grid's resiliency and climate change strongly impact one another due to an array of technical and policy-related decisions that impact both. This paper introduces a physics-informed machine learning-based framework to enhance grid's resiliency. Specifically, when encountering disruptive events, this paper designs remedial control actions to prevent blackouts. The proposed~\textbf{P}hysics-\textbf{G}uided \textbf{R}einforcement \textbf{L}earning (PG-RL) framework determines effective real-time remedial line-switching actions, considering their impact on power balance, system security, and grid reliability. To identify an effective blackout mitigation policy, PG-RL leverages power-flow sensitivity factors to guide the RL exploration during agent training. Comprehensive evaluations using the Grid2Op platform demonstrate that incorporating physical signals into RL significantly improves resource utilization within electric grids and achieves better blackout mitigation policies -- both of which are critical in addressing climate change.

Authors: Anmol Dwivedi (RPI); Ali Tajer (RPI); Santiago Paternain (Rensselaer Polytechnic Institute); Nurali Virani (GE Research)

Power & Energy Hybrid Physical Models Reinforcement Learning
NeurIPS 2024 ATLAS: A spend classification benchmark for estimating scope 3 carbon emissions (Papers Track)
Abstract and authors: (click to expand)

Abstract: The majority (70%) of companies reporting their value chain emissions rely on financial spend ledger and emissions factors per dollar. Accurate classification of expenditures to emissions factors is critical but complex, given the sheer number of line items and the diversity of how they are categorized and described. This is an area where Large Language Models (LLMs) can play a key role. However, there is currently no benchmark dataset to evaluate the performance of LLM-based solutions. Here, we introduce the Aggregate Transaction Ledgers for Accounting Sustainability dataset or, ATLAS, and the initial evaluation results of four models using ATLAS. ATLAS is the first spend classification benchmark and is comprised of 10,000 synthetic, labeled spend items reflecting the distribution of corporate expenditures. We evaluate four baseline models, with the best model achieving a top-1 accuracy of 57.3% and a top-3 accuracy of 72.2%. ATLAS enables systematic evaluation of LLMs for spend classification. Our results provide a starting point for advancing automated carbon accounting and sustainability reporting for spend- based emissions.

Authors: Andrew Dumit (Watershed Technology, Inc.); Krishna Rao (Watershed Technology, Inc.); Travis Kwee (Watershed Technology, Inc.); Varsha Gopalakrishnan (Watershed Technology Inc.); Katherine Tsai (Watershed Technology, Inc.); Sangwon Suh (Watershed Technology, Inc.)

Natural Language Processing
NeurIPS 2024 Hierarchical Classification for Automated Image Annotation of Coral Reef Benthic Structures (Papers Track)
Abstract and authors: (click to expand)

Abstract: Automated benthic image annotation is crucial to efficiently monitor and protect coral reefs against climate change. Current machine learning approaches fail to capture the hierarchical nature of benthic organisms covering reef substrata, i.e., coral taxonomic levels and health condition. To address this limitation, we propose to annotate benthic images using hierarchical classification. Experiments on a custom dataset from a Northeast Brazilian coral reef show that our approach outperforms flat classifiers, improving both F1 and hierarchical F1 scores by approximately 2\% across varying amounts of training data. In addition, this hierarchical method aligns more closely with ecological objectives.

Authors: Celia Blondin (IRD); Joris Guerin (IRD, Univ. Montpellier); Laure Berti-Equille (IRD); Guilherme Ortigara Longo (Universidade Federal do Rio Grande do Norte); Kelly Inagaki (Universidade Federal do Rio Grande do Norte)

Oceans & Marine Systems
NeurIPS 2024 Optimizing NMR Spectroscopy Pulse Sequencing with Reinforcement Learning for Soil Atomic Abundance (Papers Track)
Abstract and authors: (click to expand)

Abstract: Determining the amount of sequestered carbon in soils and monitoring soil health in farmlands is an important climate change problem. Motivated by the lack of scalable and inexpensive techniques for in-situ soil health monitoring, we focus on low-voltage nuclear magnetic resonance (NMR) spectroscopy as a promising new approach and develop a reinforcement learning technique to modulate NMR pulses for rapid atomic abundance assessment of soils. Our preliminary results derived using Monte Carlo sampling and parallelized OpenAI Gym training show the promise of our RL-based approach.

Authors: Rohan Shenoy (UC Berkeley); Hans Gaensbauer (MIT); Elsa Olivetti (MIT); Evan Coleman (MIT)

Agriculture & Food Carbon Capture & Sequestration Reinforcement Learning
NeurIPS 2024 Multi-branch Spatio-Temporal Graph Neural Network For Efficient Ice Layer Thickness Prediction (Papers Track) Overall Best Paper
Abstract and authors: (click to expand)

Abstract: Understanding spatio-temporal patterns in polar ice layers is essential for tracking changes in ice sheet balance and assessing ice dynamics. While convolutional neural networks are widely used in learning ice layer patterns from raw echogram images captured by airborne snow radar sensors, noise in the echogram images prevents researchers from getting high-quality results. Instead, we focus on geometric deep learning using graph neural networks, aiming to build a spatio-temporal graph neural network that learns from thickness information of the top ice layers and predicts for deeper layers. In this paper, we developed a novel multi-branch spatio-temporal graph neural network that used the GraphSAGE framework for spatio features learning and a temporal convolution operation to capture temporal changes, enabling different branches of the network to be more specialized and focusing on a single learning task. We found that our proposed multi-branch network can consistently outperform the current fused spatio-temporal graph neural network in both accuracy and efficiency.

Authors: Zesheng Liu (Lehigh University); Maryam Rahnemoonfar (Lehigh University)

Computer Vision & Remote Sensing Climate Science & Modeling Earth Observation & Monitoring
NeurIPS 2024 Resolution-Agnostic Transformer-based Climate Downscaling (Papers Track)
Abstract and authors: (click to expand)

Abstract: Understanding future weather changes at regional and local scales is crucial for planning and decision-making, particularly in the context of extreme weather events, as well as for broader applications in agriculture, insurance, and infrastructure development. However, the computational cost of downscaling Global Climate Models (GCMs) to the fine resolutions needed for such applications presents a significant barrier. Drawing on advancements in weather forecasting models, this study introduces a cost-efficient downscaling method using a pretrained Earth Vision Transformer (Earth ViT) model. Initially trained on ERA5 data to downscale from 50 km to 25 km resolution, the model is then tested on the higher resolution BARRA-SY dataset at 3 km resolutions. Remarkably, it performs well without additional training, demonstrating its ability to generalize across different resolutions. This approach holds promise for generating large ensembles of regional climate simulations by downscaling GCMs with varying input resolutions without incurring additional training costs. Ultimately, this method could provide more comprehensive estimates of potential future changes in key climate variables, aiding in effective planning for extreme weather events and climate change adaptation strategies.

Authors: Declan Curran (UNSW); Hira Saleem (UNSW); Sanaa Hobeichi (The University of New South Wales); Flora Salim (University of New South Wales)

Climate Science & Modeling Computer Vision & Remote Sensing
NeurIPS 2024 Scalable Satellite Imagery Analysis: A Cascade Framework for Sparse Target Detection (Papers Track)
Abstract and authors: (click to expand)

Abstract: Remote sensing is a crucial tool for monitoring events affecting climate change, such as tracking forest loss, identifying pollution sources, and monitoring the deployment of renewable energy infrastructure. However, applying state-of-the-art deep learning models to monitor the entire Earth is expensive. In this paper, we propose a cascade framework to reduce this cost: we apply a small MLP on precomputed embeddings of each image patch to serve as a preliminary filter, identifying key patches that warrant further examination by more resource-intensive deep models. Our approach reduces per-task inference runtime by 5x with a <1% impact on accuracy. By reducing inference cost, our method enables nonprofits and other organizations with limited resources to scale monitoring efforts to more environmental and conservation applications.

Authors: Arvind Manivannan (University of Washington); Tarun Narayanan Venkatachalam (Allen Institute for AI); Yanlin Huang (University of Washington); Favyen Bastani (Allen Institute for AI)

Computer Vision & Remote Sensing Earth Observation & Monitoring
NeurIPS 2024 Estimating Greenland Ice Sheet Dynamics Using Neural Operators (Papers Track)
Abstract and authors: (click to expand)

Abstract: This study investigates the application of neural operators, particularly the Fourier Neural Operator (FNO), to enhance the modeling accuracy of the Greenland ice sheet's internal layers. Traditional computational methods often struggle to capture the complex spatial patterns of ice sheets adequately. Our approach leverages a radar image dataset to evaluate the effectiveness of neural operators in comparison to traditional neural network models. The FNO outperformed other models in key performance metrics, such as pixel error and layer accuracy. The results highlight the potential of neural operators to significantly improve the computational efficiency and accuracy of environmental models, paving the way for their broader application in cryospheric science and climate change research.

Authors: Maryam Rahnemoonfar (Lehigh University); Heling Wang (Lehigh University)

Climate Science & Modeling Computer Vision & Remote Sensing
NeurIPS 2024 Enhanced Detection of Human-Driven Forest Alterations using Echo State Networks (Papers Track)
Abstract and authors: (click to expand)

Abstract: Forest monitoring is crucial for understanding ecosystem dynamics, detecting changes, and implementing effective conservation strategies. In this work, we propose a novel approach for automated detection of human-induced changes in woodlands using Echo State Networks (ESNs) and satellite imagery. The utilization of ESNs offers a promising solution for analyzing time-series data and identifying deviations indicative of forest alterations, particularly those caused by human activities such as deforestation and logging. The proposed experimental setup leverages satellite imagery to capture temporal variations in the Normalized Difference Vegetation Index (NDVI), and involves the training and evaluation of ESN models using extensive datasets from Chile's central region, encompassing diverse woodland environments and human-induced disturbances. Our initial experiments demonstrate the effectiveness of ESNs in predicting NDVI values and detecting deviations indicative of human-related changes in woodlands, even in the presence of climate-induced changes like drought and browning. Our work contributes to forest monitoring by offering a scalable and efficient solution for automated change detection in woodland environments. Integrating ESNs with satellite imagery analysis provides valuable insights into human impacts on forest ecosystems, facilitating informed decision-making for sustainable land management and biodiversity conservation.

Authors: Tomás Couso (PUC); Paula Aguirre (PUC); Rodrigo Carrasco (PUC); Javier Lopatin (UAI)

Time-series Analysis
NeurIPS 2024 Optimizing Carbon Emissions and Cost Reductions for Household Energy Demand Using Machine Learning (Proposals Track)
Abstract and authors: (click to expand)

Abstract: Residential buildings account for 17-20% of global greenhouse gas (GHG) emissions, posing a significant challenge in transitioning all buildings to net zero. While heat pumps and building retrofits can significantly reduce emissions, homeowners are often unaware of their energy usage and the potential for cost reduction. Supervised machine learning methods have the potential to provide actionable insights for individual energy end users, enabling them to reduce both GHG emissions and energy costs. We propose a novel framework that utilizes baseline, intervention, and optimization models to predict emissions and cost estimates for individual energy end users. This paper presents a novel application of an optimization model for energy bills through machine learning methods: (1) classification of time series data for electricity and gas usage baselines, (2) prediction of GHG emission reductions, and (3) prediction of energy cost reductions. This study suggests energy retrofit policy implications using machine learning as an enabling technology for empowering decision-makers and end-users to tackle climate change.

Authors: Shivani Chotalia (Clean AI Initiative); Kyungmin Lee (University of Delaware); Soazig Kaam (N/A); Victor Hutse (Radix); David Quispe (University of Toronto)

Buildings Time-series Analysis
NeurIPS 2024 Machine Learning Models for Predicting Solar Power Potential and Energy Efficiency in some underserved localities of the Congo Basin Region (Proposals Track)
Abstract and authors: (click to expand)

Abstract: This research proposal aims to develop machine learning models—specifically Linear Regression, Long Short-Term Memory, and Convolutional Neural Networks—to accurately predict solar power potential and energy efficiency in selected underserved localities within the Congo Basin. The Congo Basin, recognized for its ecological significance as the world's second-largest tropical rainforest, faces severe energy access challenges, especially in rural communities that rely on traditional biomass for heating and cooking. This dependency intensifies deforestation and contributes to climate change through increased greenhouse gas emissions. Despite substantial solar energy potential of the region, access to clean and renewable energy sources remains limited. The study will compare the models based on accuracy, reliability, training times, and memory usage, generating actionable insights for development agencies and local stakeholders. By enabling informed, data-driven decisions regarding sustainable energy solutions, this work intends to facilitate a transition from traditional biomass to renewable energy sources, ultimately contributing to both environmental conservation and improved quality of life for local populations.

Authors: Jean de Dieu NGUIMFACK NDONGMO (The University of Bamenda); Adelaide Nicole KENGNOU TELEM (University of Buea); Reeves MELI FOKENG (The University of Bamenda)

Power & Energy
NeurIPS 2024 A Hybrid Machine Learning Model For Ship Speed ThroughWater: Solve And Predict (Proposals Track)
Abstract and authors: (click to expand)

Abstract: This research proposes a hybrid model for predicting ship speed through water, addressing challenges in estimating GHG emissions from shipping while contributing to climate change mitigation. Predicting ship speed through water is a key element in calculating GHG emissions. However, few models address this prediction in a way that integrates both physical principles and machine learning. Our approach combines physical modeling with data-driven techniques to predict real ship speed through water in two key steps: "Solve" and "Predict". In the first step “Solve”, a differential equation is resolved to estimate speed through calm water. "Predict" step uses a machine learning model that incorporates maritime and meteorological conditions and historical data to improve speed predictions in real-world conditions. This hybrid approach leverages both physics-based knowledge and machine learning models to provide a more comprehensive solution for accurately predicting ship speed through water.

Authors: Zakarya ELMIMOUNI (ENSAE); Ayoub Atanane (UQAR); Loubna Benabbou (UQAR)

Oceans & Marine Systems Hybrid Physical Models
NeurIPS 2024 Enhanced Cruise Control for Reducing Automobile Environmental Impact (Proposals Track)
Abstract and authors: (click to expand)

Abstract: Automobiles are one of the main contributors to climate change. We propose an enhanced Advanced Cruise Control (ACC) system that leverages Reinforcement Learning (RL) using location and terrain data along with onboard sensors and cameras to provide better driving characteristics and improve fuel economy.

Authors: Shubhankar Gahlot (University of Canterbury NZ)

Transportation Reinforcement Learning
NeurIPS 2024 Exploring Climate Awareness and Anxiety in Teens: An Expert-Driven AI Perspective (Proposals Track)
Abstract and authors: (click to expand)

Abstract: Climate awareness and climate anxiety often go hand in hand.The growing awareness of climate change among young people is increasingly shadowed by climate anxiety, a condition marked by profound stress, fear, and a sense of helplessness stemming from overwhelming information on environmental crises and perceived inaction by authorities. Our proposal is an innovative approach using Large Language Model (LLM)-based chatbots to support children and adolescents in fostering climate awareness, managing climate anxiety, and promoting sustainable practices. By collaborating with schools and engaging a multidisciplinary experts and the young people themselves, we seek to co-create an impactful educational intervention. Starting with a comprehensive survey of school counsellors to map the current state of climate awareness and anxiety, and to understand their expectations for our AI solution, this project is poised for global scalability, addressing the pressing mental health challenges associated with climate change, particularly in vulnerable and resource-constrained regions.

Authors: Sruthi Viswanathan (University of Oxford and The Spaceship Academy); Omar Mohammed (Independant Researcher); Craig Vezina (The Spaceship Academy); Mia Doces (Commitee For Children)

Societal Adaptation & Resilience Health Interpretable ML
NeurIPS 2024 Generating Climate Dataset in a Data-scarce Region of Choke Mountain Watersheds in Ethiopia Using Machine Learning Techniques (Proposals Track)
Abstract and authors: (click to expand)

Abstract: In regions where climate data is scarce, adapting to climate change becomes a significant challenge due to the lack of reliable information. This project addresses this issue by using Artificial Intelligence (AI) techniques to generate comprehensive climate datasets in a data-scarce region of Choke Mountain Watersheds in Ethiopia. The primary objectives are to fill gaps in existing in-situ precipitation and temperature observations and to create data for areas that are currently unmonitored. By applying advanced machine learning algorithms, we will improve the accuracy and reliability of climate data, and fill gaps in current datasets to ensure completeness. Ensuring the availability of a continuous dataset is crucial for informed decision-making in climate change adaptation.

Authors: Sintayehu Abebe (Debre Markos University); Kassahun Tadesse (Debre Markos University); Mulu Kerebih (Debre Markos University); Bekalu Asres (Debre Markos University); Bewketu Mulu (Debre Markos University); Varsha Gopalakrishnan (Self)

Earth Observation & Monitoring Extreme Weather Time-series Analysis
NeurIPS 2024 Mamba MethaneMapper: State Space Model for Methane Detection from Hyperspectral Imagery (Proposals Track)
Abstract and authors: (click to expand)

Abstract: Methane (CH4) is the chief contributor to global climate change. Recent advancements in AI-based image processing have paved the way for innovative approaches for the detection of methane using hyper-spectral imagery. Existing methods, while effective, often come with high computational demands and associated costs that can limit their practical applications. Addressing these limitations, we propose the Mamba MethaneMapper (MMM), a cost-effective and efficient AI-driven solution designed to enhance methane detection capabilities in hyper-spectral images. MMM will incorporate two key innovations that collectively improve performance while managing costs. First, we will utilize a gpu-aware state-space encoder, which optimizes the computational resources and efficiency of the system. Second, MMM will use an environment-sensitive module to prioritize image regions likely containing methane emissions, which are then analyzed by our efficient Mamba algorithm. This selective approach not only improves the accuracy of methane detection but also significantly reduces unnecessary computations and memory consumption.

Authors: Satish Kumar (University of California, Santa Barbara); ASM Iftekhar (Microsoft); Kaikai Liu (University of California, Santa Barbara); Bowen Zhang (University Of California, Santa Barbara); Mehan Jayasuriya (Mozilla)

Climate Science & Modeling Earth Observation & Monitoring Computer Vision & Remote Sensing Meta- and Transfer Learning
NeurIPS 2024 Unlocking the Potential of Green Virtual Bidding Strategies : A Pathway to a Low-Carbon Electricity Market (Proposals Track)
Abstract and authors: (click to expand)

Abstract: The increasing importance of renewable energy in mitigating climate change has led to a critical examination of electricity market mechanisms that can support this transition. Virtual bidding, a financial tool used within electricity markets, allows market participants to capitalize on discrepancies between the Day-Ahead (DA) and Real-Time (RT) prices of electricity. The introduction of virtual bidding within electricity markets has introduced significant changes in market dynamics, with implications for environmental outcomes. It supports the transition towards a greener energy mix by favoring the dispatch of renewable resources and contributing to more efficient market conditions. This proposal seeks to explore the impact of virtual bidding and the development of green virtual bidding strategies by leveraging advanced machine learning models.

Authors: Aya Laajil (Centrale Supelec); Loubna Benabou (UQAR); Frédérique M. Gagnon (Videns Analytics); Laurent Barcelo (Videns Analytics); Ghait Boukachab (Videns Analytics & Mila)

Climate Finance & Economics Power & Energy Time-series Analysis
NeurIPS 2024 Cross-Border Electricity Price Forecasting with Deep Learning (Proposals Track)
Abstract and authors: (click to expand)

Abstract: Accurate electricity price forecasting (EPF) is critical for the efficient operation of energy markets, especially with the increasing integration of renewable energy sources. In this study, we explore the performance of advanced deep learning models, including Long Short-Term Memory (LSTM), vanilla Transformers, Adaptive Fourier Neural Operator (AFNO), and Mamba, in forecasting electricity prices across 16 bidding zones in the European Union. By utilizing a comprehensive dataset that includes cross-border electricity prices and generation data, we compare the effectiveness of these models under different learning strategies, including zero-shot, one-shot, and few-shot learning. We hope our results set a new benchmark for future EPF studies and offer valuable insights into the dynamics of electricity pricing in energy markets.

Authors: Hadeer El Ashhab (KIT); Benjamin Schäfer (Karlsruhe Institute of Technology)

Power & Energy Time-series Analysis
NeurIPS 2024 Making Climate AI Systems Past and Future Aware to Better Evaluate Climate Change Policies (Proposals Track)
Abstract and authors: (click to expand)

Abstract: Addressing the issues faced by climate change necessitates appropriate methodologies for evaluating climate policies, particularly when discussing long-term and real-world scenarios. While large language models (LLMs) have altered artificial intelligence, they ultimately fall short of connecting historical data with future estimates. We propose an agentic LLM system that would address this gap by considering and analyzing the probable outcomes of the user-specified climate policy inside the practical settings. Further, we propose using knowledge graphs to model the existing data about the impact of climate policies along with allowing our system to access the data about future climate predictions. Done this way, the model can peek into the past (previous policies) and the future (climate scenarios forecast), paving the way for agencies to evaluate and design strategies and plans for climate change more effectively.

Authors: Riya . (IIT Roorkee); Sudhakar Singh (Nvidia)

Natural Language Processing Climate Finance & Economics Earth Observation & Monitoring Public Policy
NeurIPS 2024 A Novel Approach to Assessing the Efficacy of Policy-driven Urban Energy Demand Reduction via Proximal Infrared Remote Sensing (Proposals Track)
Abstract and authors: (click to expand)

Abstract: Byproducts of energy use are among the leading drivers of emissions that can lead to climate change. Heating and cooling represent the largest source of energy use in urban infrastructure; thus, significant policy efforts have been implemented to reduce energy demand. Estimates of the efficacy of these policies are often difficult as they rely on self-reporting or detailed utility data, which is not available broadly and challenging to collect in areas where utility infrastructures are unavailable. Buildings' heating, ventilation, and air conditioning (HVAC) can be used to measure energy consumption from the demand side to evaluate end use and potential energy policy-induced reductions. Therefore, we are proposing a novel, non-intrusive method of energy use monitoring that uses proximal infrared remote sensing of building envelopes to find patterns of heating and cooling use. Our results show that the technique can discern on-and-off patterns of externally facing cooling units (ACs) in a dense urban scene at distances up to ~0.5 miles. We have tested multiple cases of residential buildings in New York City. We collected ~337,000 infrared images at 10-second intervals of the buildings’ facades taken from June 11 to July 18, 2018, between 500 m and 1 km. We have applied computer vision techniques to that sequence of images to identify exterior-venting HVAC units and generate their infrared time series, which correlates directly with their temperature. We determine each AC unit's on/off transitions using a one-dimensional edge detection algorithm and identify aggregated and disaggregated patterns of end-user behavior that are direct proxies for total heating and cooling use. Finally, we demonstrate that this new methodology is sufficiently sensitive to assess policy interventions to reduce daily peak demand effectively.

Authors: Kyungmin Lee (University of Delaware); Gregory Dobler (University of Delaware)

Buildings Computer Vision & Remote Sensing
NeurIPS 2024 Composing Open-domain Vision with RAG for Ocean Monitoring and Conservation (Proposals Track)
Abstract and authors: (click to expand)

Abstract: Climate change's destruction of marine biodiversity is threatening communities and economies around the world which rely on healthy oceans for their livelihoods. The challenge of applying computer vision to niche, real-world domains such as ocean conservation lies in the dynamic and diverse environments where traditional top-down learning struggle with long-tailed distributions, generalization, and domain transfer. Scalable species identification for ocean monitoring is particularly difficult due to the need to adapt models to new environments and identify rare or unseen species. To overcome these limitations, we propose leveraging bottom-up, open-domain learning frameworks–specifically vision-language models (VLMs) combined with retrieval-augmented generation (RAG)–as a resilient, scalable solution for image and video analysis in marine applications. We validate this approach through a preliminary application in classifying fish from video onboard fishing vessels, demonstrating impressive emergent retrieval and prediction capabilities without domain-specific training or knowledge of the task itself.

Authors: Sepand Dyanatkar (OnDeck Fisheries AI); Angran Li (OnDeck Fisheries AI); Alexander Dungate (OnDeck Fisheries AI)

Computer Vision & Remote Sensing Earth Observation & Monitoring Ecosystems & Biodiversity Unsupervised & Semi-Supervised Learning
NeurIPS 2024 How are companies reducing emissions? An LLM-based approach to creating a carbon emissions reduction levers library at scale (Proposals Track)
Abstract and authors: (click to expand)

Abstract: Creating a transparent, sector-specific database of actions that would result in carbon emissions reduction is essential for guiding companies toward effective, data-driven pathways to meet their net-zero commitments. Information on carbon emissions reduction levers is scattered around greenhouse gas emissions disclosures and sustainability reports in dense text forms, and no systematic, sector and region specific reduction lever libraries are available to companies. This research proposes a multi-agent system leveraging Large Language Models (LLMs) integrated with Retrieval-Augmented Generation (RAG) to systematically extract, classify, and validate carbon reduction actions from publicly available sustainability reports. By constructing a standardized database of reduction levers categorized by industry, geography, and greenhouse gas scopes, this work empowers companies to prioritize high-impact, cost-effective emissions reduction strategies. We plan to integrate environmentally-extended input-output models to ensure that these actions are closely tied to sector-specific emissive sources, increasing their relevance and scalability. This initiative is expected to support companies in mitigating greenhouse gas emissions by offering a practical resource that accelerates the transition to a low-carbon economy, and makes actionable insights readily available to corporations, industry and the research community.

Authors: Varsha Gopalakrishnan (Watershed Technology Inc.); Shaena Ulissi (Watershed); Andrew Dumit (Watershed Technology, Inc.); Krishna Rao (Watershed Technology Inc.); Katherine Tsai (Watershed Technology Inc.); Sangwon Suh (Watershed Technology Inc.)

Natural Language Processing
NeurIPS 2024 Enhanced PINNs for high-order power grid dynamics (Proposals Track)
Abstract and authors: (click to expand)

Abstract: We develop improved physics-informed neural networks (PINNs) for high-order and high-dimensional power system models described by nonlinear ordinary differential equations. We propose some novel enhancements to improve PINN training and accuracy and also implement several other recently proposed ideas from the literature. We successfully apply these to study the transient dynamics of synchronous generators. We also make progress towards applying PINNs to advanced inverter models. Such enhanced PINNs can allow us to accelerate high-fidelity simulations needed to ensure a stable and reliable renewables-rich future grid.

Authors: Vineet Jagadeesan Nair (MIT)

Power & Energy Hybrid Physical Models Unsupervised & Semi-Supervised Learning
NeurIPS 2024 Multimodal AI framework for predicting candidate high temperature superconductors (Proposals Track)
Abstract and authors: (click to expand)

Abstract: Materials science is at the forefront of addressing some of the most pressing challenges of our era, particularly in enhancing energy efficiency and sustainability. One of the most promising avenues in this field is the study of superconductors—materials that, when cooled below a critical temperature (Tc), exhibit zero electrical resistance. This unique property not only eliminates energy loss due to resistance but also enables a wide range of advanced technologies, such as MRI machines, magnetically levitating trains, and other high-efficiency systems. Superconductors can significantly reduce the carbon footprint of power transmission and other industrial applications. Given the complexity and importance of predicting candidate and practical high-temperature superconductors, we propose to develop a multimodal AI framework to predict new high-Tc superconducting materials. By integrating various material properties, including structural and compositional data, we seek to study patterns and relationships that could guide the discovery of new high-temperature superconductors. Success in this endeavor could significantly reduce energy losses in electrical systems, contributing to the fight against climate change.

Authors: Nidhish Sagar (Massachusetts Institute of Technology); Eslam G. Al-Sakkari (Polytechnique Montréal); Ahmed Ragab (Polytechnique Montréal)

Chemistry & Materials Power & Energy Generative Modeling Meta- and Transfer Learning
NeurIPS 2024 Seeing Inside Buildings: Leveraging Generative AI and Multimodal Data to Automate Building Material Audits (Proposals Track)
Abstract and authors: (click to expand)

Abstract: Building retrofits and deconstruction projects are expected to surpass new construction jobs globally, putting pressure on architects and developers to sustainably reuse buildings and their materials (World Economic Forum). However, building reuse and material upcycling is hindered by the complexity of building material audits, which require expensive tests, site visits, and detailed data that is often missing for old structures. We present a Generative AI approach to predict the structural and material make-up of existing buildings from multimodal geospatial, technical, and cadaster data. Leveraging a dataset of 100 buildings across the United States with corresponding building 3D scans, geolocation, and construction data, we demonstrate the capability of a stable diffusion model to reliably predict structural diagrams for subsequent estimation of material contents. This process also offers designers actionable potential material reuse data to streamline and accelerate circularity for existing building design.

Authors: Nikita Klimenko (MIT); James Stoddart (Autodesk, Inc.); Lorenzo Villaggi (Autodesk, Inc.); Dale Zhao (Autodesk, Inc.)

Buildings Carbon Capture & Sequestration Computer Vision & Remote Sensing Generative Modeling
NeurIPS 2024 Flood Prediction in Kenya - Leveraging Pre-Trained Models to Generate More Validation Data in Sparse Observation Settings (Proposals Track)
Abstract and authors: (click to expand)

Abstract: Kenya has lacked a national flood risk management framework and also has sparse flood observation data, which makes developing deep learning flood prediction models on a national scale challenging. Flood prediction models are critical to operationalize Early Warning Systems (EWS). We propose two different models to feed into an EWS. The first model will leverage statistical machine learning approaches to predict flood or no flood events on a 0.25 x 0.25 degree scale (approximately 30 km x 30 km in Kenya) using ERA5 features as well as land cover and Digital Terrain Model (DTM) data. This first model will also be used to create a baseline prediction benchmark across the entire country of Kenya. The second model will leverage pre-trained remote sensing based models to generate segmented flood or no flood data on a fine spatial scale. This will increase the number of validation points by a factor of 1000, which opens the door to deep learning approaches to predict flood or no-flood events on a 30 meter x 30 meter spatial scale. We hope that this approach of leveraging pre-trained models to generate fine scale validation data can eventually be used widely in other extreme climate event forecasting scenarios given the scarcity of historical extreme climate events compared to normal weather events.

Authors: Alim Karimi (Self / Purdue University); David Quispe (University of Toronto); Hammed Akande (Concordia University); Nicole Mong'are (Athi Water Works Development Agency); Valerie Brosnan (Mitga Solutions); Asbina Baral (Ministry of Education, Science and Technology)

Societal Adaptation & Resilience Climate Science & Modeling Disaster Management and Relief Earth Observation & Monitoring Extreme Weather Computer Vision & Remote Sensing Time-series Analysis
NeurIPS 2024 Towards more efficient agricultural practices via transformer-based crop type classification (Proposals Track)
Abstract and authors: (click to expand)

Abstract: Machine learning has great potential to increase crop production and resilience to climate change. Accurate maps of where crops are grown are a key input to several downstream policy and research applications. This proposal presents preliminary work showing that it is possible to accurately classify crops from time series derived from Sentinel 1 and 2 satellite imagery in Mexico using a pixel-based binary crop/non-crop time series transformer model. We also find preliminary evidence that meta-learning approaches supplemented with data from similar agro-ecological zones may improve model performance. Due to these promising results, we propose further development of this method with the goal of accurate multi-class crop classification in Jalisco, Mexico via meta-learning with a dataset comprising similar agro-ecological zones.

Authors: Isabella Smythe (Columbia University); Eduardo Ulises Moya (Gobierno de Jalisco); Michael Smith (Aspia Space); Yazid Mikail (Climate Change AI); Daisy Ondwari (Kabarak University)

Agriculture & Food Earth Observation & Monitoring Computer Vision & Remote Sensing Meta- and Transfer Learning Time-series Analysis
NeurIPS 2024 A Multimodal Causal Framework for Large-Scale Ecosystem Valuation: Application to Wetland Benefits for Flood Mitigation (Proposals Track)
Abstract and authors: (click to expand)

Abstract: Climate change is poised to alter wetland ecosystems through changes in temperature and precipitation patterns, compounding the already pronounced influence of human-driven wetland development. In this context, policymakers and environmental managers would benefit from accurate wetland valuations to guide their decision-making, as their choices regarding this critical natural resource directly impact flood mitigation efforts, biodiversity conservation, and economic activity. This paper introduces a novel multimodal causal framework for producing location-specific ecosystem valuations at a national scale to be used in cost-benefit policy analysis. It leverages recent advances in estimating heterogeneous treatment effects to flexibly determine how the expected impact of ecosystem-level changes---such as wetland loss via development---varies conditional on high-dimensional and multimodal measures that characterize the complex interactions between human and natural systems such as aerial satellite imagery, weather sequence data, land cover classifications, and water surface networks. From this effort, we aim to create a national database of location-specific wetland valuations in an approach that can be readily extended in estimating the effect of other interventions on ecosystems. We also plan to generate open-source feature embeddings for each U.S. wetland, embeddings that can be used to address other climate-related causal questions as well.

Authors: Hannah Druckenmiller (Caltech); Georgia Gkioxari (Caltech); Connor Jerzak (University of Texas at Austin); SayedMorteza Malaekeh (University of Texas at Austin)

Ecosystems & Biodiversity Earth Observation & Monitoring Computer Vision & Remote Sensing
NeurIPS 2024 Stubble (Crop Residue) Burning Detection Through Satellite Images Using Geospatial Foundation Model: A Case Study in Punjab, India (Proposals Track)
Abstract and authors: (click to expand)

Abstract: Stubble burning is a significant environmental challenge globally, with widespread implications for air quality, greenhouse gases emission, soil degradation and health issues. This practice is particularly prevalent in agricultural regions across the world, though its impacts are notably severe in the northern India. This proposed work focuses on improving the detection of stubble (crop residue) burning in Punjab (India), through using the geospatial foundation model. This study leverages a series of satellite images where stubble burning incidents have been documented. By refining the model to incorporate local environmental factors, this study aims to improve the accuracy of stubble burning detection, thereby contributing to a scalable solution for real-time monitoring and intervention in crop residue burning practices worldwide.

Authors: Rajiv Ranjan (Plaksha University); Ying-Jung Chen (Georgia Institute of Technology); Shashank Tamaskar (Plaksha University); Anupam Sobti (IIT Delhi)

Computer Vision & Remote Sensing Agriculture & Food
NeurIPS 2024 Leveraging Artificial Intelligence for Real-Time Carbon Emission Management and Climate Change Resilience Planning in Kenya (Proposals Track)
Abstract and authors: (click to expand)

Abstract: As climate change intensifies, reducing carbon emissions has emerged as a critical global objective. Countries like Kenya, which heavily depend on agriculture and pastoral systems, are disproportionately affected by changing climatic conditions. In 2020, Kenya’s Nationally Determined Contribution (NDC) set a target to reduce carbon emissions by 32% by 2030. However, traditional methods for tracking and managing carbon emissions are derailed by inconsistencies, such as data manipulation and lack of real-time monitoring. This project leverages blockchain technology to provide a transparent and reliable platform for tracking CO2 emissions. Additionally, remote sensing and machine learning models will be used to enhance data collection, monitoring, and predicting emissions. The study will be piloted in 16 sugar companies and 16 plastic companies, as they are among the highest carbon emitters in Kenya. The findings from this study will assist the government and industries in planning climate change resilience strategies and adaptation measures.

Authors: Alukwe Jones (Kabarak University); Theophilus Owiti (Kabarak University); Mercy Gachoka (Kabarak Univeristy); Peter Rogendo (Kabarak University)

Carbon Capture & Sequestration Time-series Analysis Unsupervised & Semi-Supervised Learning
NeurIPS 2024 DeepMyco - Dataset Generation for Dye Mycoremediation (Proposals Track)
Abstract and authors: (click to expand)

Abstract: Textile dyes comprise 20% of global water pollution. Mycoremediation, a promising approach utilizing cheap, naturally growing fungi, has not seen scale production. While numerous studies indicate benefits, it is challenging to apply the specific learnings of each study to the combination of environmental factors present in a given physical site - a gap we believe machine learning can help fill if datasets become available. We propose an approach to drive machine learning research in mycoremediation by contributing a comprehensive dataset. We propose using advanced language models and vision transformers to extract and categorize experimental data from various research papers. This dataset will enable ML-driven innovation in matching fungi to specific dye types, optimizing remediation processes, and scaling up mycoremediation efforts effectively.

Authors: Danika Gupta (The Harker Upper School)

Heavy Industry and Manufacturing Natural Language Processing
NeurIPS 2024 Large language model co-pilot for transparent and trusted life cycle assessment comparisons (Proposals Track)
Abstract and authors: (click to expand)

Abstract: Intercomparing life cycle assessments (LCA), a common type of sustainability and climate model, is difficult due to basic differences in fundamental assumptions, especially in the goal and scope definition stage. This complicates decision-making and the selection of climate-smart policies, as it becomes difficult to compare optimal products and processes between different studies. To aid policymakers and LCA practitioners alike, we plan to leverage large language models (LLM) to build a database containing documented assumptions for LCAs across the agricultural sector, with a case study on livestock management. The articles for this database are identified in a systematic literature search, then processed to extract relevant assumptions about the goal and scope definition of the LCA and inserted into a vector database. We then leverage this database to develop an AI co-pilot by augmenting LLMs with retrieval augmented generation to be used by stakeholders and LCA practitioners alike. This co-pilot will accrue two major benefits: 1) enhance the decision-making process through facilitating comparisons among LCAs to enable policymakers to adopt data-driven climate policies and 2) encourage the use of common assumptions by LCA practitioners. Ultimately, we hope to create a foundational model for LCA tasks that can plug-in with existing open source LCA software and tools.

Authors: Nathan Preuss (Cornell University); Fengqi You (Cornell University)

Natural Language Processing Agriculture & Food Public Policy Supply Chains Data Mining Generative Modeling
ICLR 2024 Mapping Land Naturalness from Sentinel-2 using Deep Contextual and Geographical Priors (Papers Track)
Abstract and authors: (click to expand)

Abstract: In recent decades, the causes and consequences of climate change have accelerated, affecting our planet on an unprecedented scale. This change is closely tied to the ways in which humans alter their surroundings. As our actions continue to impact natural areas, using satellite images to see and measure these effects has become crucial for understanding and fighting climate change. Aiming to map land naturalness on the continuum of modern human pressure, we develop a multi-modal supervised deep learning framework that addresses the unique challenges of satellite data and the task at hand. We incorporate contextual and geographical priors. These priors are represented by corresponding coordinate information and broader contextual information including and surrounding the immediate patch to be predicted. Our framework improves the model's predictive performance to map land naturalness from a given Sentinel-2 data, a multi-spectral optical satellite imagery. Recognizing that our protective measures are as effective as our grasp of the ecosystem, quantifying naturalness serves as a crucial step towards enhancing our environmental stewardship.

Authors: Burak Ekim (University of the Bundeswehr); Michael Schmitt (University of the Bundeswehr Munich)

Ecosystems & Biodiversity Forests
ICLR 2024 Structured spectral reconstruction for scalable soil organic carbon inference (Papers Track)
Abstract and authors: (click to expand)

Abstract: Measuring soil organic carbon (SOC) inexpensively and accurately is crucial for soil health monitoring and agricultural decarbonization. Hyperspectral imaging is commonly evaluated as an inexpensive alternative to dry combustion for SOC measurement, but existing end-to-end approaches trained to predict SOC content from spectral data frequently fail to generalize when applied outside of their ground-truth geographic sampling distributions. Using stratified data from the USDA Rapid Carbon Assessment (RaCA), we demonstrate a method to improve model generalization out-of-distribution by training SOC regression alongside models that reconstruct input spectra. Because hyperspectra can be collected from remote platforms such as drones and satellites, this approach raises the possibility of using large hyperspectral Earth observation datasets to transfer SOC inference models to remote geographies where geographically-dense ground-truth data collection may be expensive or impossible. By replacing the decoder with a simple physics-informed model, we also learn an interpretable spectral signature of SOC, confirming its dark hue and expected reflectance troughs. Finally, we show that catastrophic generalization failures can be better addressed with these architectures by fine-tuning on large quantities of hyperspectral data.

Authors: Evan A Coleman (MIT); Sujay Nair (Georgia Institute of Technology); Xinyi Zeng (Coho Climate Advisors); Elsa Olivetti (MIT Department of Materials Science & Engineering)

Earth Observation & Monitoring Computer Vision & Remote Sensing
ICLR 2024 ClimateQ&A : bridging the gap between climate scientists and the general public (Papers Track)
Abstract and authors: (click to expand)

Abstract: This research paper investigates public views on climate change and biodiversity loss by analyzing questions asked to the ClimateQ&A platform. ClimateQ&A is a conversational agent that uses LLMs to respond to queries based on over 14,000 pages of scientific literature from the IPCC and IPBES reports. Launched online in March 2023, the tool has gathered over 30,000 questions, mainly from a French audience. Its chatbot interface allows for the free formulation of questions related to nature*. While its main goal is to make nature science more accessible, it also allows for the collection and analysis of questions and their themes. Unlike traditional surveys involving closed questions, this novel method offers a fresh perspective on individual interrogations about nature. Running NLP clustering algorithms on a sample of 3,425 questions, we find that a significant 25.8% inquire about how climate change and biodiversity loss will affect them personally (e.g., where they live or vacation, their consumption habits) and the specific impacts of their actions on nature (e.g., transportation or food choices). This suggests that traditional methods of surveying may not identify all existing knowledge gaps, and that relying solely on IPCC and IPBES reports may not address all individual inquiries about climate and biodiversity, potentially affecting public understanding and action on these issues. *Note: we use “nature” as an umbrella term for “climate change” and “biodiversity loss”.

Authors: Natalia de la Calzada (Ekimetrics); Theo Alves Da Costa (Ekimetrics); Annabelle Blangero (Ekimetrics); Nicolas CHESNEAU (EKIMETRICS)

Natural Language Processing Behavioral and Social Science
ICLR 2024 Improving Streamflow Predictions with Vision Transformers (Papers Track)
Abstract and authors: (click to expand)

Abstract: Accurate streamflow prediction is crucial to understand climate impacts on water resources and develop effective adaption strategies. A global Long Short-Term Memory (LSTM) model, using data from multiple basins, can enhance streamflow prediction, yet acquiring detailed basin attributes remains a challenge. To overcome this, we introduce the Geo-ViT-LSTM model, a novel approach that enriches LSTM predictions by integrating basin attributes derived from remote sensing with a vision transformer. Applied to 531 basins across the United States (US), our method significantly outperforms existing models, showing an 11% increase in prediction accuracy. Geo-ViT-LSTM marks a significant advancement in land surface modeling, providing a more comprehensive and effective tool for managing water resources under climate change.

Authors: Kshitij Tayal (Oak Ridge National Lab); Arvind Renganathan (University of Minnesota); Dan Lu (Oak Ridge National Laboratory)

Oceans & Marine Systems Time-series Analysis
ICLR 2024 Grapevine Disease Prediction Using Climate Variables from Multi-Sensor Remote Sensing Imagery via a Transformer Model (Papers Track)
Abstract and authors: (click to expand)

Abstract: Early detection and management of grapevine diseases are important in pursuing sustainable viticulture. This paper introduces a novel framework leveraging the TabPFN model to forecast blockwise grapevine diseases using climate variables from multi-sensor remote sensing imagery. By integrating advanced machine learning techniques with detailed environmental data, our approach significantly enhances the accuracy and efficiency of disease prediction in vineyards. The TabPFN model's experimental evaluations showcase comparable performance to traditional gradient-boosted decision trees, such as XGBoost, CatBoost, and LightGBM. The model's capability to process complex data and provide per-pixel disease-affecting probabilities enables precise, targeted interventions, contributing to more sustainable disease management practices. Our findings underscore the transformative potential of combining Transformer models with remote sensing data in precision agriculture, offering a scalable solution for improving crop health and productivity while reducing environmental impact.

Authors: Weiying Zhao (Deep Planet); Natalia Efremova (Queen Mary University London)

Agriculture & Food Earth Observation & Monitoring Causal & Bayesian Methods
ICLR 2024 Towards Ecological Network Analysis with Gromov-Wasserstein Distances (Papers Track)
Abstract and authors: (click to expand)

Abstract: Climate change is driving the widespread redistribution of species with cascading effects on predators and their prey. Formally comparing ecological interaction networks is a critical step towards understanding the impact of climate change on ecosystem functioning, yet current methods for ecological network analysis are unable to do so. We propose using the GromovWasserstein (GW) metric for quantifying dissimilarity between ecological networks. We demonstrates the corresponding optimal transport plans of this distance can be interpreted as species functional alignment between food webs. Our results show that GW transport plans align species from different mammal communities consistent with ecological understanding. Furthermore, we illustrate extensions of the GW distance to notions of averages and factorization over ecological networks. Ultimately, we propose the foundation for a novel interpretable topological data analysis framework to inform future ecological research and conservation management.

Authors: Kai M Hung (Rice University); Ann Finneran (Rice University); Alex Zalles (Rice University); Lydia Beaudrot (Rice University); Cesar Uribe (Rice University)

Ecosystems & Biodiversity
ICLR 2024 Scaling Transformers for Skillful and Reliable Medium-range Weather Forecasting (Papers Track) Overall Best Paper
Abstract and authors: (click to expand)

Abstract: Weather forecasting is a fundamental problem for anticipating and mitigating the impacts of climate change. Recently, data-driven approaches for weather forecasting based on deep learning have shown great promise, achieving accuracies that are competitive with operational systems. However, those methods often employ complex, customized architectures without sufficient ablation analysis, making it difficult to understand what truly contributes to their success. Here we introduce Stormer, a simple transformer model that achieves state-of-the-art performance on weather forecasting with minimal changes to the standard transformer backbone. We identify the key components of Stormer through careful empirical analyses, including weather-specific embedding, randomized dynamics forecast, and pressure-weighted loss. At the core of Stormer is a randomized forecasting objective that trains the model to forecast the weather dynamics over varying time intervals. During inference, this allows us to produce multiple forecasts for a target lead time and combine them to obtain better forecast accuracy. On WeatherBench 2, Stormer performs competitively at short to medium-range forecasts and outperforms current methods beyond 7 days, while requiring orders-of-magnitude less training data and compute. Additionally, we demonstrate Stormer’s favorable scaling properties, showing consistent improvements in forecast accuracy with increases in model size and training tokens.

Authors: Tung Nguyen (University of California, Los Angeles); Rohan Shah (Carnegie Mellon University); Hritik Bansal (UCLA); Troy Arcomano (Argonne National Laboratory); Sandeep Madireddy (Argonne National Laboratory); Romit Maulik (Argonne National Laboratory); Veerabhadra Kotamarthi (Argonne National Laboratory); Ian Foster (Computation Institute); Aditya Grover (UCLA)

Climate Science & Modeling Extreme Weather
ICLR 2024 Categorization of Meteorological Data by Contrastive Clustering (Papers Track)
Abstract and authors: (click to expand)

Abstract: Visualized ceilometer backscattering data, displaying meteorological phenomena like clouds, precipitation, and aerosols, is mostly analyzed manually by meteorology experts. In this work, we present an approach for the categorization of backscattering data using a contrastive clustering approach, incorporating image and spatiotemporal information into the model. We show that our approach leads to meteorologically meaningful clusters, opening the door to the automatic categorization of ceilometer data, and how our work could potentially create insights in the field of climate science.

Authors: Michael Dammann (HAW Hamburg); Ina Mattis (DWD); Michael Neitzke (HAW Hamburg); Ralf Möller (University of Lübeck)

Unsupervised & Semi-Supervised Learning Climate Science & Modeling Computer Vision & Remote Sensing
ICLR 2024 AI-driven emulation of ocean dynamics on sub-seasonal scales (Papers Track)
Abstract and authors: (click to expand)

Abstract: Climate forecasting systems rely on coupling atmospheric models to ocean and sea ice models. However, while there have recently been significant efforts to accelerate atmospheric models using AI, there have been very scarce efforts to accelerate the latter. As a result, climate forecasting systems still rely on expensive numerical simulations, which renders large-scale ensembling and probabilistic prediction cumbersome. To address this issue, we propose a large-scale AI model of ocean dynamics. Our method relies on a spherical neural operator to accurately capture the functional nature of ocean dynamics on the sphere. We empirically demonstrate that our model can accurately predict ocean dynamics for sub-seasonal horizons and outperforms the existing method. It offers a 60x speedup over the fastest numerical solver currently used for the task.

Authors: Suyash Bire (Massachusetts Institute of Technology); Jean Kossaifi (NVIDIA); Simone Silvestri (Massachusetts Institute of Technology); Nikola Kovachki (Nvidia Corp.); Kamyar Azizzadenesheli (Nvidia Corp.); Chris N Hill (MIT); Animashree Anandkumar (Caltech)

Climate Science & Modeling Generative Modeling
ICLR 2024 Calibrating Bayesian UNet++ for Sub-seasonal Forecasting (Papers Track)
Abstract and authors: (click to expand)

Abstract: Seasonal forecasting is a crucial task when it comes to detecting the extreme heat and colds that occur due to climate change. Confidence in the predictions should be reliable since a small increase in the temperatures in a year has a big impact on the world. Calibration of the neural networks provides a way to ensure our confidence in the predictions. However, calibrating regression models is an under-researched topic, especially in forecasters. We calibrate a UNet++ based architecture, which was shown to outperform physics-based models in temperature anomalies. We show that with a slight trade-off between prediction error and calibration error, it is possible to get more reliable and sharper forecasts. We believe that calibration should be an important part of safety-critical machine learning applications such as weather forecasters.

Authors: Büşra Asan (Istanbul Technical University); Abdullah Akgül (University of Southern Denmark); Alper Unal (Istanbul Technical University); Melih Kandemir (University of Southern Denmark); Gozde Unal (Istanbul Technical University)

Uncertainty Quantification & Robustness Climate Science & Modeling
ICLR 2024 Extreme Precipitation Nowcasting using Transformer-based generative models (Papers Track)
Abstract and authors: (click to expand)

Abstract: This paper presents an innovative approach to extreme precipitation nowcasting by employing Transformer-based generative models, specifically VideoGPT with Extreme Value Loss (EVL) regularization. Leveraging a comprehensive dataset from the Royal Netherlands Meteorological Institute (KNMI), our study focuses on predicting short-term precipitation with high accuracy. We introduce a novel method for computing EVL without assuming fixed extreme representations, addressing the limitations of current models in capturing extreme weather events. We present both qualitative and quantitative analyses, demonstrating the superior performance of the proposed VideoGPT-EVL in generating accurate precipitation forecasts, especially when dealing with extreme precipitation events.

Authors: Cristian Meo (TUDelft); Mircea T Lica (Delft University of Technology); Ankush Roy (TUDelft); Zeina Boucher (TUDelft); Junzhe Yin (TUDelft); Yanbo Wang (Delft University of Technology); Ruben Imhoff (Deltares); Remko Uijlenhoet (TUDelft); Justin Dauwels (TU Delft)

Generative Modeling Climate Science & Modeling Computer Vision & Remote Sensing Time-series Analysis
ICLR 2024 GeoFormer: A Vision and Sequence Transformer-based Approach for Greenhouse Gas Monitoring (Papers Track)
Abstract and authors: (click to expand)

Abstract: Air pollution represents a pivotal environmental challenge globally, playing a major role in climate change via greenhouse gas emissions and negatively affecting the health of billions. However, predicting the spatial and temporal patterns of pollutants remains challenging. The scarcity of ground-based monitoring facilities and the dependency of air pollution modeling on comprehensive datasets, often inaccessible for numerous areas, complicate this issue. In this work, we introduce GeoFormer, a compact model that combines a vision transformer module with a highly efficient time-series transformer module to predict surface-level nitrogen dioxide (NO2) concentrations from Sentinel-5P satellite imagery. We train the proposed model to predict surface-level NO2 measurements using a dataset we constructed with Sentinel-5P images of ground-level monitoring stations, and their corresponding NO2concentration readings. The proposed model attains high accuracy (MAE 5.65), demonstrating the efficacy of combining vision and time-series transformer architectures to harness satellite-derived data for enhanced GHG emission insights, proving instrumental in advancing climate change monitoring and emission regulation efforts globally.

Authors: Madhav Khirwar (Independent)

Computer Vision & Remote Sensing Climate Science & Modeling Time-series Analysis
ICLR 2024 Paddy Doctor: Open Dataset and Automated Pest Identification Using Pre-trained Deep Learning Models (Papers Track)
Abstract and authors: (click to expand)

Abstract: Agriculture is one of the most important industries contributing to several countries’ national income. Timely identification of these pests is crucial for both farmers and agriculture experts to implement effective mitigation methods. Traditionally, farmers employ manual techniques based on their experience and visual inspection to identify paddy pests, but this is highly inefficient, time-consuming, and error-prone. At times, even experienced farmers and agriculture experts might struggle to identify a specific species of pests due to their size, colors similar to leaves, and the extensive variety of identical types. Moreover, farmers often apply a large quantity of pesticide without accurately identifying the exact type of pests and their underlying causes. Therefore, it is increasingly important to automate the process of detection of paddy pests to reduce pesticide usage and subsequently minimize the loss in yield. In this paper, we present an open dataset and deep learning models for automated pest identification in real paddy fields. Our dataset contains 6,062 annotated paddy leaf images across 17 classes (16 pest classes and a normal class). We benchmarked our dataset using six pre-trained models (ResNet34, VGG16, DenseNet121, EfficientNet_v2_m, MobileNet_V3_Large, and SqueezeNet1_0). The experimental results showed that ResNet34 achieved the highest accuracy of 98.31%. We release our dataset and reproducible code in the open source for community use.

Authors: Petchiammal A (Manonmaniam Sundaranar University, Tirunelveli, Tamil Nadu); Pandarasamy Arjunan (Indian Institute of Science, Bangalore)

Agriculture & Food
ICLR 2024 Using expired weather forecasts to supply 10 000y of data for accurate planning of a renewable European energy system (Papers Track)
Abstract and authors: (click to expand)

Abstract: Expanding renewable energy generation and electrifying heating to address climate change will heighten the exposure of our power systems to the variability of weather. Planning and assessing these future systems typically lean on past weather data. We spotlight the pitfalls of this approach---chiefly its reliance on what we claim is a limited weather record---and propose a novel approach: to evaluate these systems on two orders of magnitude more weather scenarios. By repurposing past ensemble weather predictions, we not only drastically expand the known weather distribution---notably its extreme tails---for traditional power system modeling but also unveil its potential to enable data-intensive self-supervised, diffusion-based and optimization ML techniques. Building on our methodology, we introduce a **dataset** collected from ECMWF ENS forecasts, encompassing power-system relevant variables over Europe, and detail the intricate process behind its assembly.

Authors: Petr Dolezal (AI4ER CDT, University of Cambridge); Emily Shuckburgh (University of Cambridge)

Power & Energy Climate Science & Modeling Data Mining Uncertainty Quantification & Robustness
ICLR 2024 Black carbon plumes from gas flaring in North Africa identified from multi-spectral imagery with deep learning (Papers Track)
Abstract and authors: (click to expand)

Abstract: Black carbon (BC) is an important pollutant aerosol emitted by numerous human activities, including gas flaring. Improper combustion in flaring activities can release large amounts of BC, which is harmful to human health and has a strong climate warming effect. To our knowledge, no study has ever directly monitored BC emissions from satellite imagery. Previous works quantified BC emissions indirectly, by applying emission coefficients to flaring volumes estimated from satellite imagery. Here, we develop a deep learning framework and apply it to Sentinel-2 imagery over North Africa during 2022 to detect and quantify BC emissions from gas flaring. We find that BC emissions in this region amount to about 1 million tCO2eq, or 1 million passenger cars, more than a quarter of which are due to 10 sites alone. This work demonstrates the operational monitoring of BC emissions from flaring, a key step in implementing effective mitigation policies to reduce the climate impact of oil and gas operations.

Authors: Alexandre Tuel (Galeio); Thomas Kerdreux (INRIA/ ENS); Louis THIRY (ENS Paris)

Earth Observation & Monitoring Computer Vision & Remote Sensing
ICLR 2024 Time-Varying Constraint-Aware Reinforcement Learning for Energy Storage Control (Papers Track)
Abstract and authors: (click to expand)

Abstract: Energy storage devices, such as batteries, thermal energy storages, and hydrogen systems, can help mitigate climate change by ensuring a more stable and sustainable power supply. To maximize the effectiveness of such energy storage, determining the appropriate charging and discharging amounts for each time period is crucial. Reinforcement learning is preferred over traditional optimization for the control of energy storage due to its ability to adapt to dynamic and complex environments. However, the continuous nature of charging and discharging levels in energy storage poses limitations for discrete reinforcement learning, and time-varying feasible charge-discharge range based on state of charge (SoC) variability also limits the conventional continuous reinforcement learning. In this paper, we propose a continuous reinforcement learning approach that takes into account the time-varying feasible charge-discharge range. An additional objective function was introduced for learning the feasible action range for each time period, supplementing the objectives of training the actor for policy learning and the critic for value learning. This actively promotes the utilization of energy storage by preventing them from getting stuck in suboptimal states, such as continuous full charging or discharging. This is achieved through the enforcement of the charging and discharging levels into the feasible action range. The experimental results demonstrated that the proposed method further maximized the effectiveness of energy storage by actively enhancing its utilization.

Authors: Jaeik Jeong (Electronics and Telecommunications Research Institute (ETRI)); Tai-Yeon Ku (Electronics and Telecommunications Research Institute (ETRI)); Wan-Ki Park (Electronics and Telecommunications Research Institute (ETRI))

Power & Energy Reinforcement Learning Time-series Analysis
ICLR 2024 Towards a Data-Driven Understanding of Cloud Structure Formation (Papers Track)
Abstract and authors: (click to expand)

Abstract: The physics of cloud formation and evolution is still not fully understood and constitutes one of the highest uncertainties in climate modeling. We are working on an approach that aims at improving our understanding of how clouds of different structures form from a data-driven perspective: By predicting the visual appearance of cloud photographs from physical quantities obtained from reanalysis data and subsequently attributing the decisions to physical quantities using ``explainable AI'' methods, we try to identify relevant physical processes. At the current stage, this is just a proof of concept, being at least able to identify basic meteorologically plausible facts from data.

Authors: Ann-Christin Wörl (Johannes Gutenberg University); Michael Wand (University of Mainz); Peter Spichtinger (Johannes Gutenberg University)

Interpretable ML Climate Science & Modeling
ICLR 2024 Bee Activity Prediction and Pattern Recognition in Environmental Data (Papers Track)
Abstract and authors: (click to expand)

Abstract: As a consequence of climate change, biodiversity is declining rapidly. Many species like insects, especially bees, suffer from changes in temperature and rainfall patterns. Applying machine learning for monitoring and predicting specie's health and life conditions can help understanding and improving biodiversity. In this work we use data collected from cameras and sensors mounted upon beehives together with different other data sources like weather data, information extracted from satellite images and geographical information. We aim at predicting bees' health (measured as their activity) and analyzing influencing environmental conditions. We show that we are able to accurately predict bees' activity and understand their life conditions by using machine learning algorithms and explainable AI. Understanding these conditions can help to make recommendations on good locations for beehives. This work illustrates the potential of applying machine learning on sensor, satellite and weather data for monitoring and predicting species' health and hence shows the ability for adaptation to climate change and a more accurate species monitoring.

Authors: Christine Preisach (University of Applied Sciences Karlsruhe); Marius Herrmann (Karlsruhe Institute of Technology)

Ecosystems & Biodiversity Interpretable ML
ICLR 2024 Advancing Earth System Model Calibration: A Diffusion-Based Method (Papers Track) Honorable Mention
Abstract and authors: (click to expand)

Abstract: Understanding of climate impact on ecosystems globally requires site-specific model calibration. Here we introduce a novel diffusion-based uncertainty quantification (DBUQ) method for efficient model calibration. DBUQ is a score-based diffusion model that leverages Monte Carlo simulation to estimate the score function and evaluates a simple neural network to quickly generate samples for approximating parameter posterior distributions. DBUQ is stable, efficient, and can effectively calibrate the model given diverse observations, thereby enabling rapid and site-specific model calibration on a global scale. This capability significantly advances Earth system modeling and our understanding of climate impacts on Earth systems. We demonstrate DBUQ's capability in E3SM land model calibration at the Missouri Ozark AmeriFlux forest site. Both synthetic and real-data applications indicate that DBUQ produces accurate parameter posterior distributions similar to those generated by Markov Chain Monte Carlo sampling but with 30X less computing time. This efficiency marks a significant stride in model calibration, paving the way for more effective and timely climate impact analyses.

Authors: Yanfang Liu (Oak Ridge National Laboratory); Dan Lu (Oak Ridge National Laboratory); Zezhong Zhang (Oak Ridge National Laboratory); Feng Bao (Florida State University); Guannan Zhang (Oak Ridge National Laboratory)

Generative Modeling Ecosystems & Biodiversity Uncertainty Quantification & Robustness
ICLR 2024 Towards Downscaling Global AOD with Machine Learning (Papers Track)
Abstract and authors: (click to expand)

Abstract: Poor air quality represents a significant threat to human health, especially in urban areas. To improve forecasts of air pollutant mass concentrations, there is a need for high-resolution Aerosol Optical Depth (AOD) forecasts as proxy. However, current General Circulation Model (GCM) forecasts of AOD suffer from limited spatial resolution, making it difficult to accurately represent the substantial variability exhibited by AOD at the local scale. To address this, a deep residual convolutional neural network (ResNet) is evaluated for the GCM to local scale downscaling of low-resolution global AOD retrievals, outperforming a non-trainable interpolation baseline. We explore the bias correction potential of our ResNet using global reanalysis data, evaluating it against in-situ AOD observations. The improved resolution from our ResNet can assist in the study of local AOD variations.

Authors: Josh Millar (Imperial College London); Paula Harder (Mila); Lilli J Freischem (University of Oxford); Philipp Weiss (University of Oxford); Philip Stier (University of Oxford)

Climate Science & Modeling Computer Vision & Remote Sensing
ICLR 2024 Reconstructing the Breathless Ocean with Spatio-Temporal Graph Learning (Papers Track)
Abstract and authors: (click to expand)

Abstract: The ocean is currently undergoing severe deoxygenation. Accurately reconstructing the breathless ocean is crucial for assessing and protecting marine ecosystem in response to climate change. Existing expert-dominated numerical simulations fail to catch up with the dynamic variation caused by global warming and human activities. Besides, due to the high-cost data collection, the historical observations are severely sparse, leading to big challenge for precise reconstruction. In this work, we propose OxyGenerator, the first spatio-temporal graph learning model, to reconstruct the global ocean deoxygenation from 1920 to 2023. Specifically, to address the heterogeneity across large temporal and spatial scales, we propose zoning-varying graph message-passing to capture the complex oceanographic correlations between missing values and sparse observations. Additionally, to further calibrate the uncertainty, we incorporate inductive bias from dissolved oxygen (DO) variations and chemical effects. Compared with in-situ DO observations, OxyGenerator significantly outperforms CMIP6 numerical simulations, reducing MAPE by 38.77%, demonstrating a promising potential to understand the ocean deoxygenation in data-driven manner.

Authors: Bin Lu (Shanghai Jiao Tong University); Ze Zhao (Shanghai Jiao Tong University); Luyu Han (Shanghai Jiao Tong University); Xiaoying Gan (Shanghai Jiao Tong University); Yuntao Zhou (Shanghai Jiao Tong University); Lei Zhou (Shanghai Jiao Tong Univ); Luoyi Fu (Shanghai Jiao Tong University); Xinbing Wang (Shanghai Jiao Tong University); Chenghu Zhou (Institute of Geographic Sciences and Natural Resources Research, CAS); Jing Zhang (Shanghai Jiao Tong University)

Oceans & Marine Systems Data Mining Time-series Analysis
ICLR 2024 Analyzing the secondary wastewater-treatment process using Faster R-CNN and YOLOv5 object detection algorithms (Papers Track)
Abstract and authors: (click to expand)

Abstract: The activated sludge (AS) process is the most common type of secondary wastewater treatment, applied worldwide. Due to the complexity of microbial communities, imbalances between the different types of bacteria may occur and disturb the process, with pronounced economical and environmental consequences. Microscopic inspection of the morphology of flocs and microorganisms provides key information on AS properties and function. This is a time-consuming, highly skilled, and expensive process that is not readily available in all locations. Thus, most wastewater-treatment plants do not carry out this essential analysis, resulting in frequent operational faults. In this study, we develop a novel deep learning (DL) object detection algorithm to analyze and monitor the AS process based on a unique microscopic image database of flocs and microorganisms. Specifically, we applied YOLOv5 and Faster R-CNN algorithms as tools for segmentation and object detection to analyze the wastewater. The mean average precision (mAP) of the YOLOv5 was 0.67, outperforming the Faster R-CNN by 15%. Histogram equalization preprocessing of both bright-field and phase-contrast images significantly improved the results of the algorithm in all classes. In the case of YOLOv5, the mAP increased by 16.67%, to 0.77, where the AP of protozoa, filaments, and open floc classes outperformed the previous model by over 20%. These results demonstrate the potential of leveraging DL algorithms to enhance the analysis and monitoring of WWTPs in an affordable manner, consequently reducing environmental pollution caused by contaminated effluent. The fundamental challenge addressed herein has important global relevance, especially in an era in which the demand for high-quality wastewater reuse is expected to increase dramatically.

Authors: Offir Inbar (Tel-Aviv University); Moni Shahar (Tel Aviv University); Jacob Gidron (Tel-Aviv University); Ido Cohen (Tel-Aviv University); Dror Avisar (Tel-Aviv University)

Agriculture & Food Cities & Urban Planning Ecosystems & Biodiversity Health Chemistry & Materials Computer Vision & Remote Sensing
ICLR 2024 Diffusion-Based Joint Temperature and Precipitation Emulation of Earth System Models (Papers Track)
Abstract and authors: (click to expand)

Abstract: Earth system models (ESMs) are the principal tools used in climate science to generate future climate projections under various atmospheric emissions scenarios on a global or regional scale. Generative deep learning approaches are suitable for emulating these tools due to their computational efficiency and ability, once trained, to generate realizations in a fraction of the time required by ESMs. We extend previous work that used a generative probabilistic diffusion model to emulate ESMs by targeting the joint emulation of multiple variables, temperature and precipitation, by a single diffusion model. Joint generation of multiple variables is critical to generate realistic samples of phenomena resulting from the interplay of multiple variables. The diffusion model emulator takes in the monthly mean-map of temperature and precipitation and produces the daily values of each of these variables that exhibit statistical properties similar to those generated by ESMs. Our results show the outputs from our extended model closely resemble those from ESMs on various climate metrics including dry spells and hot streaks, and that the joint distribution of temperature and precipitation in our sample closely matches those of ESMs.

Authors: Katie Christensen (Western Washington University); Lyric Otto (Western Washington University); Seth Bassetti (Utah State University); Claudia Tebaldi (Joint Global Change Research Institute); Brian Hutchinson (Western Washington University)

Climate Science & Modeling Generative Modeling
ICLR 2024 Verifying Practices of Regenerative Agriculture: African Smallholder Farmer Dataset for Remote Sensing and Machine Learning (Papers Track)
Abstract and authors: (click to expand)

Abstract: Despite Africa’s contribution to global greenhouse gas (GHG) emissions being only a few %, the continent experiences the harshest impacts, particularly within its food production systems. Regenerative agriculture is receiving a large amount of attention as a method to strengthen both food security and climate change resilience in Africa. For practicing regenerative agriculture, carbon credits are issued, but verifying the methodologies on a large scale is one of the challenging points in popularizing it. In this paper, we provide a comprehensive dataset on regenerative agriculture in sub-Saharan Africa. The dataset has field polygon information and is labeled with several types of regenerative agriculture methodologies. The dataset can be applied to local site analysis, classification, and detection of regenerative agriculture with remote sensing and machine learning. We also highlight several machine learning models and summarize the baseline results on our dataset. We believe that by providing this dataset, we can contribute to the establishment of verification methods for regenerative agriculture. The dataset can be downloaded from https://osf.io/xgp9m/.

Authors: Yohei Nakayama (Degas Ltd); Grace Antwi (Degas Ltd); Seiko Shirasaka (Keio University)

Agriculture & Food Computer Vision & Remote Sensing
ICLR 2024 Semi-Supervised Domain Adaptation for Wildfire Detection (Papers Track)
Abstract and authors: (click to expand)

Abstract: Recently, both the frequency and intensity of wildfires have increased worldwide, primarily due to climate change. In this paper, we propose a novel protocol for wildfire detection, leveraging semi-supervised Domain Adaptation for object detection, accompanied by a corresponding dataset designed for use by both academics and industries. Our dataset encompasses 30 times more diverse labeled scenes than the current largest benchmark wildfire dataset, HPWREN, and introduces a new labeling policy for wildfire detection. Inspired by coordconv., we propose a robust baseline, Location-Aware Object Detection for Semi-Supervised Domain Adaptation (LADA), utilizing a teacher-student based self-supervised Domain Adaptation framework capable of extracting translational variance features characteristic of wildfires. Our framework significantly outperforms the existing baseline by a notable margin of 3.8\%p in mean Average Precision on the HPWREN wildfire dataset.

Authors: Joo Young Jang (Alchera); Youngseo Cha (Alchera); Jisu Kim (Alchera); SooHyung Lee (Alchera); Geonu Lee (Alchera); Minkook Cho (Alchera); Young Hwang (Alchera); Nojun Kwak (Seoul National University)

Unsupervised & Semi-Supervised Learning Disaster Management and Relief Computer Vision & Remote Sensing
ICLR 2024 Identifying Climate Targets in National Laws and Policies using Machine Learning (Papers Track)
Abstract and authors: (click to expand)

Abstract: Quantified policy targets are a fundamental element of climate policy, typically characterised by domain-specific and technical language. Current methods for curating comprehensive views of global climate policy targets entail significant manual effort. At present there are few scalable methods for extracting climate targets from national laws or policies, which limits policymakers’ and researchers’ ability to (1) assess private and public sector alignment with global goals and (2) inform policy decisions. In this paper we present an approach for extracting mentions of climate targets from national laws and policies. We create an expert-annotated dataset identifying three categories of target (’Net Zero’, ’Reduction’ and ’Other’ (e.g. renewable energy targets)) and train a classifier to reliably identify them in text. We investigate bias and equity impacts related to our model and identify specific years and country names as problematic features. We explore the dataset generated from applying our classifier to the Climate Policy Radar (CPR) dataset, showcasing the potential for automated data collection and research support in climate policy. Our work represents a significant upgrade in the accessibility of these key climate policy elements for policymakers and researchers.

Authors: Matyas Juhasz (Climate Policy Radar); Tina Marchand (Climate Policy Radar); Roshan Melwani (Climate Policy Radar); Kalyan Dutia (Climate Policy Radar); Sarah Goodenough (Climate Policy Radar); Harrison Pim (Climate Policy Radar); Henry Franks (Climate Policy Radar)

Public Policy
ICLR 2024 Empowering Sustainable Finance: Leveraging Large Language Models for Climate-Aware Investments (Papers Track)
Abstract and authors: (click to expand)

Abstract: With the escalating urgency of climate change, it is becoming more imperative for businesses and organizations to align their objectives with sustainability goals. Financial institutions also face a critical mandate to fulfill the Sustainable Development Goals (SDGs), particularly goal 13, which targets the fight against climate change and its consequences. Mitigating the impacts of climate change requires a focus on reducing supply chain emissions, which constitute over 90% of total emission inventories. In the financial industry, supply chain emissions linked to lending and investments emerge as the primary source of emissions, posing challenges in tracking financed emissions due to the intricate process of collecting data from numerous suppliers across the supply chain. To address these challenges, we propose an emission estimation framework utilizing a Large Language Model (LLM) to drastically accelerate the assessment of the emissions associated with lending and investment activities. This framework utilizes financial activities as a proxy for measuring financed emissions. Utilizing the LLM, we classify financial activities into seven asset classes following the Partnership for Carbon Accounting Financials (PCAF) standard. Additionally, we map investments to industry categories and employ spend-based emission factors (kg-CO2/$-spend) to calculate emissions associated with financial investments. In our study, we compare the performance of our proposed method with state-of-the-art text classification models like TF-IDF, word2Vec, and Zero-shot learning. The results demonstrate that the LLM-based approach not only surpasses traditional text mining techniques and performs on par with a subject matter expert (SME) but most importantly accelerates the assessment process.

Authors: Ayush Jain (IBM Research); Manikandan Padmanaban (IBM Research India); Jagabondhu Hazra (IBM Research India); Shantanu Godbole (IBM India); Hendrik Hamann (IBM Research)

Natural Language Processing Supply Chains
ICLR 2024 Near-real-time monitoring of global ocean carbon sink (Papers Track)
Abstract and authors: (click to expand)

Abstract: The ocean, absorbing about 25% of anthropogenic CO2 emissions, plays a crucial role in mitigating climate change. However, the delayed (by one year) traditional estimates of ocean-atmosphere CO2 flux hinder timely understanding and response to the global carbon cycle’s dynamics. Addressing this challenge, we introduce Carbon Monitor Ocean (CMO-NRT), a pioneering dataset providing near-real-time, monthly gridded estimates of global surface ocean fugacity of CO2 (fCO2) and ocean-atmosphere CO2 flux from January 2022 to July 2023. This dataset marks a significant advancement by updating the global carbon budget’s estimates through a fusion of data from 10 Global Ocean Biogeochemical Models (GOBMs) and 8 data products into a near-real-time analysis framework. By harnessing the power of Convolutional Neural Networks (CNNs) and semi-supervised learning techniques, we decode the complex nonlinear relationships between model or product estimates and observed environmental predictors. The predictive models, both for GOBM and data products, exhibit exceptional accuracy, with root mean square errors (RMSEs) maintaining below the 5% threshold. This advancement supports more effective climate change mitigation efforts by providing scientists and policymakers with timely and accurate data.

Authors: Xiaofan Gui (Microsoft Research); Jiang Bian (Microsoft Research)

Climate Science & Modeling Unsupervised & Semi-Supervised Learning
ICLR 2024 WindDragon: enhancing wind power forecasting with automated deep learning (Papers Track)
Abstract and authors: (click to expand)

Abstract: Achieving net zero carbon emissions by 2050 requires the integration of increasing amounts of wind power into power grids. This energy source poses a challenge to system operators due to its variability and uncertainty. Therefore, accurate forecasting of wind power is critical for grid operation and system balancing. This paper presents an innovative approach to short-term (1 to 6 hour horizon) wind power forecasting at a national level. The method leverages Automated Deep Learning combined with Numerical Weather Predictions wind speed maps to accurately forecast wind power.

Authors: Julie Keisler (INRIA, EDF R&D); Etienne Le Naour (Sorbonne University, EDF R&D)

Power & Energy
ICLR 2024 Neural Processes for Short-Term Forecasting of Weather Attributes (Papers Track)
Abstract and authors: (click to expand)

Abstract: Traditional weather prediction models rely on solving complex physical equations, with long computation time. Machine learning models can process large amount of data more quickly. We propose to use neural processes (NPs) for short-term weather attributes forecasting. This is a novel avenue of research, as previous work has focused on NPs for long-term forecasting. We compare a multi-task neural process (MTNP) to an ensemble of independent single-task NPs (STNP) and to an ensemble of Gaussian processes (GPs). We use time series data for multiple weather attributes from Chichester Harbour over a one-week period. We evaluate performance in terms of NLL and MSE with 2-hours and 6-hours time horizons. When limited context information is provided, the MTNP leverages inter-task knowledge and outperforms the STNP. The STNP outperforms both the MTNP and the GPs ensemble when a sufficient, but not exceeding, amount of context information is provided.

Authors: Benedetta L Mussati (University of Oxford); Helen McKay (Mind Foundry); Stephen Roberts (University of Oxford)

Time-series Analysis Meta- and Transfer Learning
ICLR 2024 EU Climate Change News Index: Forecasting EU ETS prices with online news (Papers Track)
Abstract and authors: (click to expand)

Abstract: Carbon emission allowance prices have been rapidly increasing in the EU since 2018 and accurate forecasting of EU Emissions Trading System (ETS) prices has become essential. This paper proposes a novel method to generate alternative predictors for daily ETS price returns using relevant online news information. We devise the EU Climate Change News Index by calculating the term frequency–inverse document frequency (TF–IDF) feature for climate change-related keywords. The index is capable of tracking the ongoing debate about climate change in the EU. Finally, we show that incorporating the index in a simple predictive model significantly improves forecasts of ETS price returns.

Authors: Aron Pap (BGSE); Aron D Hartvig (Corvinus University of Budapest, Cambridge Econometrics); Péter Pálos (Budapest University of Technology and Economics)

Climate Finance & Economics Power & Energy Public Policy Natural Language Processing Time-series Analysis
ICLR 2024 Explaining Zeolite Synthesis-Structure Relationships using Aggregated SHAP Analysis (Papers Track)
Abstract and authors: (click to expand)

Abstract: Zeolites, crystalline aluminosilicate materials with well-defined porous structures, have emerged as versatile materials with applications in carbon capture. Hydrothermal synthesis is a widely used method for zeolite production, offering control over crystallinity and and pore size. However, the intricate interplay of synthesis parameters necessitates a comprehensive understanding to optimize the synthesis process. We train a supervised classification machine learning model on ZeoSyn (a dataset of zeolite synthesis routes) to predict the zeolite framework product given a synthesis route. Subsequently, we leverage SHapley Additive Explanations (SHAP) to reveal key synthesis-structure relationships in zeolites. To that end, we introduce an aggregation SHAP approach to extend such analysis to explain the formation of composite building units (CBUs) of zeolites. Analysis at this unprecedented scale sheds light on key synthesis parameters driving zeolite crystallization.

Authors: Elton Pan (MIT)

Chemistry & Materials Carbon Capture & Sequestration Interpretable ML
ICLR 2024 Fast non-stationary geospatial modelling with multiresolution (wavelet) Gaussian processes (Papers Track)
Abstract and authors: (click to expand)

Abstract: Climate modelling tasks involve assimilating large amounts of geospatial data from different sources, such as simulators and measurements from weather stations and satellites. These sources of data are weighted according to their uncertainty, so good quality uncertainty estimates are essential. Gaussian processes (GPs) offer flexible models with uncertainty estimates, and have a long track record of use in geospatial modelling. However, much of the research effort, including recent work on scalability, is focused on statistically stationary models, which are not suitable for many climatic variables, such as precipitation. Here we propose a novel, scalable, nonstationary GP model based upon discrete wavelets, and evaluate them on toy and real world data.

Authors: Talay M Cheema (University of Cambridge); Carl Edward Rasmussen (Cambridge University)

Causal & Bayesian Methods Climate Science & Modeling Generative Modeling
ICLR 2024 PostRainBench: A comprehensive benchmark and a new model for precipitation forecasting (Papers Track)
Abstract and authors: (click to expand)

Abstract: Accurate precipitation forecasting is a vital challenge of societal importance. Though data-driven approaches have emerged as a widely used solution, solely relying on data-driven approaches has limitations in modeling the underlying physics, making accurate predictions difficult. We focus on the Numerical Weather Prediction (NWP) post-processing based precipitation forecasting task to couple Machine Learning techniques with traditional NWP. This task remains challenging due to the imbalanced precipitation data and complex relationships between multiple meteorological variables. To address these limitations, we introduce the PostRainBench, a comprehensive multi-variable NWP post-processing benchmark, and CAMT, a simple yet effective Channel Attention Enhanced Multi-task Learning framework with a specially designed weighted loss function. Extensive experimental results on the proposed benchmark show that our method outperforms state-of-the-art methods by 6.3%, 4.7%, and 26.8% in rain CSI and improvements of 15.6%, 17.4%, and 31.8% over NWP predictions in heavy rain CSI on respective datasets. Most notably, our model is the first deep learning-based method to outperform NWP approaches in heavy rain conditions. These results highlight the potential impact of our model in reducing the severe consequences of extreme rainfall events. Our datasets and code are available at https://github.com/yyyujintang/PostRainBench.

Authors: Yujin Tang (The Hong Kong University of Science and Technology (Guangzhou)); Jiaming Zhou (Hong Kong University of Science and Technology (Guangzhou)); Xiang Pan (Nanjing University); Zeying Gong (Hong Kong University of Science and Technology (Guangzhou)); Junwei Liang (The Hong Kong University of Science and Technology (Guangzhou))

Extreme Weather Computer Vision & Remote Sensing
ICLR 2024 Physics-informed Machine Learning-based Cloud Microphysics parameterization for Earth System Models (Papers Track)
Abstract and authors: (click to expand)

Abstract: In this study, we develop a physics-informed machine learning (ML)-based cloud microphysics parameterization for the ICON model. By training the ML parameterization on high-resolution simulation data, we aim to improve Earth System Models (ESMs) in comparison to traditional parameterization schemes. We investigate the usage of a multilayer perceptron (MLP) with feature engineering and physics-constraints, and use explainability techniques to understand the relationship between input features and model output. Our novel approach yields promising results, with the physics-informed ML-based cloud microphysics parameterization achieving an R$^2$ score up to 0.777 for an individual feature. Additionally, we demonstrate a notable improvement in the overall performance in comparison to a baseline MLP, increasing its average R$^2$ score from 0.290 to 0.613 across all variables. This approach to improve the representation of cloud microphysics in ESMs promises to enhance climate projections, contributing to a better understanding of climate change.

Authors: Ellen Sarauer (German Aerospace Center (DLR)); Mierk Schwabe (Deutsches Zentrum für Luft- und Raumfahrt e.V. (DLR), Institut für Physik der Atmosphäre, Oberpfaffenhofen, Germany); Philipp Weiss (University of Oxford); Axel Lauer (Deutsches Zentrum für Luft- und Raumfahrt e.V. (DLR), Institut für Physik der Atmosphäre, Oberpfaffenhofen, Germany); Philip Stier (University of Oxford); Veronika Eyring (Deutsches Zentrum für Luft- und Raumfahrt e.V. (DLR), Institut für Physik der Atmosphäre, Oberpfaffenhofen, Germany; University of Bremen, Institute of Environmental Physics (IUP), Bremen, Germany)

Climate Science & Modeling Interpretable ML
ICLR 2024 From spectra to biophysical insights: end-to-end learning with a biased radiative transfer model (Papers Track)
Abstract and authors: (click to expand)

Abstract: Advances in machine learning have boosted the use of Earth observation data for climate change research. Yet, the interpretability of machine-learned representations remains a challenge, particularly in understanding forests' biophysical reactions to climate change. Traditional methods in remote sensing that invert radiative transfer models (RTMs) to retrieve biophysical variables from spectral data often fail to account for biases inherent in the RTM, especially for complex forests. We propose to integrate RTMs into an auto-encoder architecture, creating an end-to-end learning approach. Our method not only corrects biases in RTMs but also outperforms traditional techniques for variable retrieval like neural network regression. Furthermore, our framework has potential generally for inverting biased physical models.

Authors: Yihang She (University of Cambridge); Clement Atzberger (Mantle Labs); Andrew Blake (University of Cambridge, Mantle Labs); Srinivasan Keshav (University of Cambridge)

Earth Observation & Monitoring Interpretable ML
ICLR 2024 Model Failure or Data Corruption? Exploring Inconsistencies in Building Energy Ratings with Self-Supervised Contrastive Learning (Papers Track)
Abstract and authors: (click to expand)

Abstract: Building Energy Rating (BER) stands as a pivotal metric, enabling building owners, policymakers, and urban planners to understand the energy-saving potential through improving building energy efficiency. As such, enhancing buildings' BER levels is expected to directly contribute to the reduction of carbon emissions and promote climate improvement. Nonetheless, the BER assessment process is vulnerable to missing and inaccurate measurements. In this study, we introduce CLEAR, a data-driven approach designed to scrutinize the inconsistencies in BER assessments through self-supervised contrastive learning. We validated the effectiveness of CLEAR using a dataset representing Irish building stocks. Our experiments uncovered evidence of inconsistent BER assessments, highlighting measurement data corruption within this real-world dataset.

Authors: Qian Xiao (Trinity College Dublin); Dan Liu (Trinity College Dublin); Kevin Credit (Maynooth University)

Buildings Data Mining Interpretable ML Unsupervised & Semi-Supervised Learning
ICLR 2024 Interpretable Machine Learning for Extreme Events detection: An application to droughts in the Po River Basin (Papers Track)
Abstract and authors: (click to expand)

Abstract: The increasing frequency and intensity of drought events-periods of significant decrease in water availability-are among the most alarming impacts of climate change. Monitoring and detecting these events is essential to mitigate their impact on our society. However, traditional drought indices often fail to accurately detect such impacts as they mostly focus on single precursors. In this study, we leverage machine learning algorithms to define a novel data-driven, impact-based drought index reproducing as target the Vegetation Health Index, a satellite signal that directly assesses the vegetation status. We first apply novel dimensionality reduction methods that allow for interpretable spatial aggregation of features related to precipitation, temperature, snow, and lakes. Then, we select the most informative and non-redundant features through filter feature selection. Finally, linear supervised learning methods are considered, given the small number of samples and the aim of preserving interpretability. The experimental setting focuses on ten sub-basins of the Po River basin, but the aim is to design a machine learning-based workflow applicable on a large scale.

Authors: Paolo Bonetti (Politecnico di Milano); Matteo Giuliani (Politecnico di Milano); Veronica Cardigliano (Politecnico di Milano); Alberto Maria Metelli (Politecnico di Milano); Marcello Restelli (Politecnico di Milano); Andrea Castelletti (Politecnico di Milano)

Agriculture & Food Extreme Weather Interpretable ML
ICLR 2024 Literature Mining with Large Language Models to Assist the Development of Sustainable Building Materials (Papers Track)
Abstract and authors: (click to expand)

Abstract: Concrete industry, as one of the significant sources of carbon emissions, drives the urgency for its decarbonization that requires a shift to alternative materials. However, the absence of systematic knowledge summary remains a challenge for further development of sustainable building materials. This work offers a cost-efficient strategy for information extraction tasks in complex terminology settings using small (2.8B) large language models (LLMs) with well-designed instruction-completion schemes and fine-tuning strategies, introducing a dataset cataloging civil engineering applications of alternative materials. The Multiple Choice instruction scheme significantly improves model accuracies in entity inference from non-Noun-Phrase sources, with supervised fine-tuning benefiting from straightforward tokenized representations of choices. We also demonstrate the utility of the dataset by extracting valuable insights into promising applications of alternative materials from knowledge graph representations.

Authors: Yifei Duan (Massachusetts Institute of Technology); Yixi Tian (Massachusetts Institute of Technology); Soumya Ghosh (IBM Research); Richard Goodwin (IBM T.J. Watson Research Center); Vineeth Venugopal (Massachusetts Institute of Technology); Jeremy Gregory (Massachusetts Institute of Technology); Jie Chen (IBM Research); Elsa Olivetti (Massachusetts Institute of Technology)

Natural Language Processing Buildings Chemistry & Materials
ICLR 2024 A Deep Learning Technology Suite for Cost-Effective Sequestered CO2 Monitoring (Papers Track)
Abstract and authors: (click to expand)

Abstract: Carbon capture and storage (CCS) is a way of reducing carbon emissions to help tackle global warming. Injecting CO2 into rock formations and preventing it from escaping to the surface is a main step in a CCS project. Therefore, monitoring of geologically sequestered CO2 is important for CCS security assessment. Time-lapse seismic (4D seismic) is one of the most effective tools for CO2 monitoring. Unfortunately, the main challenge of 4D seismic is the high cost due to repeated monitoring seismic data acquisition surveys and the subsequent time-consuming data processing that involves imaging and inversion. To address this, we developed a technology suite powered by deep learning engines that significantly reduces the cost by (1) acquiring very sparse monitoring data; (2) firing multiple seismic sources simultaneously; (3) converting 2D images to 3D volume; (4) enforcing repeatability between baseline data and monitoring data; and (5) nonlinearly mapping seismic data to subsurface property model to bypass complex wave-equation-based seismic data processing procedures.

Authors: Wenyi Hu (SLB); Son Phan (SLB); Cen Li (SLB); Aria Abubakar (SLB)

Carbon Capture & Sequestration Earth Observation & Monitoring Climate Science & Modeling Hybrid Physical Models Unsupervised & Semi-Supervised Learning
ICLR 2024 On the potential of Optimal Transport in Geospatial Data Science (Papers Track)
Abstract and authors: (click to expand)

Abstract: Prediction problems in geographic information science and transportation are often motivated by the possibility to enhance operational efficiency and thereby reduce emissions. Examples range from predicting car sharing demand for relocation planning to forecasting traffic congestion for navigation purposes. However, conventional accuracy metrics ignore the spatial distribution of the errors, despite its relevance for operations. Here, we put forward a spatially aware evaluation metric and loss function based on Optimal Transport (OT). Our framework leverages partial OT and can minimize relocation costs in any spatial prediction problem. We showcase the advantages of OT-based evaluation over conventional metrics and further demonstrate the application of an OT loss function for improving forecasts of bike sharing demand and charging station occupancy. Thus, our framework not only aligns with operational considerations, but also signifies a step forward in refining predictions within geospatial applications. All code is available at https://github.com/mie-lab/geospatial_optimal_transport.

Authors: Nina V Wiedemann (ETH Zurich); Martin Raubal (ETH Zürich)

Time-series Analysis Climate Science & Modeling Transportation
ICLR 2024 ExioML: Eco-economic dataset for Machine Learning in Global Sectoral Sustainability (Papers Track)
Abstract and authors: (click to expand)

Abstract: The Environmental Extended Multi-Regional Input-Output analysis is the predominant Ecological Economic research framework for analysing the environmental impact of economic activities. This paper introduces the novel ExioML dataset as the first Machine Learning benchmark data in sustainability analysis. We open-sourced the ExioML data and development toolkit to lower barriers and accelerate the cooperation between Machine Learning and Ecological Economic research. A crucial greenhouse gas emission regression task evaluates the usability of the proposed dataset. We compared the performance of traditional shallow models against deep models by leveraging a diverse factor accounting table and incorporating multiple modalities of categorical and numerical features. Our findings reveal that deep and ensemble models achieve low mean square errors below 0.25 and serve as a future machine learning research baseline. Through Ex- ioML, we aim to foster precise ML predictions and modelling to support climate actions and sustainable investment decisions. The data and codes are available: https://github.com/Yvnminc/ExioML

Authors: Yanming Guo (University of Sydney)

Climate Finance & Economics
ICLR 2024 FARADAY: SYNTHETIC SMART METER GENERATOR FOR THE SMART GRID (Papers Track)
Abstract and authors: (click to expand)

Abstract: Access to smart meter data is essential to rapid and successful transitions to elec- trified grids, underpinned by flexibility delivered by low carbon technologies, such as electric vehicles (EV) and heat pumps, and powered by renewable energy. Yet little of this data is available for research and modelling purposes due consumer privacy protections. Whilst many are calling for raw datasets to be unlocked through regulatory changes, we believe this approach will take too long. Synthetic data addresses these challenges directly by overcoming privacy issues. In this pa- per, we present Faraday, a Variational Auto-encoder (VAE)-based model trained over 300 million smart meter data readings from an energy supplier in the UK, with information such as property type and low carbon technologies (LCTs) own- ership. The model produces household-level synthetic load profiles conditioned on these labels, and we compare its outputs against actual substation readings to show how the model can be used for real-world applications by grid modellers interested in modelling energy grids of the future.

Authors: Sheng Chai (Centre for Net Zero); Gus Chadney (Centre for Net Zero)

Generative Modeling
ICLR 2024 Generalizable Temperature Nowcasting with Physics-Constrained RNNs for Predictive Maintenance of Wind Turbine Components (Papers Track)
Abstract and authors: (click to expand)

Abstract: Machine learning plays an important role in the operation of current wind energy production systems. One central application is predictive maintenance to increase efficiency and lower electricity costs by reducing downtimes. Integrating physics-based knowledge in neural networks to enforce their physical plausibilty is a promising method to improve current approaches, but incomplete system information often impedes their application in real world scenarios. We describe a simple and efficient way for physics-constrained deep learning-based predictive maintenance for wind turbine gearbox bearings with partial system knowledge. The approach is based on temperature nowcasting constrained by physics, where unknown system coefficients are treated as learnable neural network parameters. Results show improved generalization performance to unseen environments compared to a baseline neural network, which is especially important in low data scenarios often encountered in real-world applications.

Authors: Johannes Exenberger (Graz University of Technology); Matteo Di Salvo (Sirius Energy Automation); Thomas Hirsch (Graz University of Technology); Franz Wotawa (Graz University of Technology, Institute for Software Technology); Gerald Schweiger (Graz University of Technology)

Hybrid Physical Models Time-series Analysis
ICLR 2024 Forecasting Tropical Cyclones with Cascaded Diffusion Models (Papers Track)
Abstract and authors: (click to expand)

Abstract: As tropical cyclones become more intense due to climate change, the rise of Al-based modelling provides a more affordable and accessible approach compared to traditional methods based on mathematical models. This work leverages generative diffusion models to forecast cyclone trajectories and precipitation patterns by integrating satellite imaging, remote sensing, and atmospheric data. It employs a cascaded approach that incorporates three main tasks: forecasting, super-resolution, and precipitation modelling. The training dataset includes 51 cyclones from six major tropical cyclone basins from January 2019 - March 2023. Experiments demonstrate that the final forecasts from the cascaded models show accurate predictions up to a 36-hour rollout, with excellent Structural Similarity (SSIM) and Peak-Signal-To-Noise Ratio (PSNR) values exceeding 0.5 and 20 dB, respectively, for all three tasks. The 36-hour forecasts can be produced in as little as 30 mins on a single Nvidia A30/RTX 2080 Ti. This work also highlights the promising efficiency of Al methods such as diffusion models for high-performance needs in weather forecasting, such as tropical cyclone forecasting, while remaining computationally affordable, making them ideal for highly vulnerable regions with critical forecasting needs and financial limitations. Code accessible at \url{https://github.com/nathzi1505/forecast-diffmodels}.

Authors: Pritthijit Nath (Imperial College London); Pancham Shukla (Imperial College London); Shuai Wang (University of Delaware); Cesar Quilodran-Casas (Imperial College London)

Generative Modeling Extreme Weather
ICLR 2024 Forecasting regional PV power in Great Britain with a multi-modal late fusion network (Papers Track)
Abstract and authors: (click to expand)

Abstract: The ability to forecast solar photovoltaic (PV) power is important for grid balancing and reducing the CO2 intensity of electricity globally. The use of multi-modal data such as numerical weather predictions (NWPs) and satellite imagery can be harnessed to make more accurate PV forecasts. In this work, we propose a late fusion model which integrates two different NWP sources alongside satellite images to make 0-8 hour lead time forecasts for grid regions across Great Britain. We limit the model inputs to be reflective of those available in a live production system. We show how the different input data sources contribute to average error at each time horizon and compare against a simple baseline.

Authors: James Fulton (Open Climate Fix); Jacob Bieker (Open Climate Fix); Peter Dudfield (Open Climate Fix); Solomon Cotton (Open Climate Fix); Zakari Watts (Open Climate Fix); Jack Kelly (Open Climate Fix)

Power & Energy Computer Vision & Remote Sensing
ICLR 2024 Exploring Graph Neural Networks to Predict the Seagrasses Ecosystem State in the Italian Seas (Papers Track)
Abstract and authors: (click to expand)

Abstract: Marine coastal ecosystems (MCEs) play a critical role in climate change adaptation and human well-being. However, they face global threats from environmental pressures, both related to climate change (CC) and direct human impacts. Leveraging the increasing availability of geospatial data, this study explores Graph Neural Networks (GNNs) to assess cumulative impacts arising from human and CC related pressures on the Seagrass ecosystem in the Italian seas. Unlike traditional machine learning (ML) models with which they were compared in this study, GNNs incorporate the spatial component of data through graph structures. While experimental results demonstrate a modest performance improvement in GNNs, the study is constrained by limited data availability, preventing the exploration of the temporal component and physical laws representable through graph structures. Future efforts aim to collect higher-resolution spatial and temporal data, considering expressible environmental processes, to enhance model learning.

Authors: Angelica Bianconi (University School for Advanced Studies (IUSS) Pavia & Ca’ Foscari University of Venice); Sebastiano Vascon (Ca' Foscari University of Venice & European Centre for Living Technology); Elisa Furlan (Centro Euro-Mediterraneo sui Cambiamenti Climatici (CMCC) & Ca' Foscari University of Venice); Andrea Critto (Centro Euro-Mediterraneo sui Cambiamenti Climatici (CMCC) & Ca' Foscari University of Venice)

Ecosystems & Biodiversity Oceans & Marine Systems
ICLR 2024 Global High Resolution CO2 monitoring using Super Resolution (Papers Track)
Abstract and authors: (click to expand)

Abstract: Monitoring Greenhouse Gases (GHGs) concentrations and emissions is essential to mitigate climate change. Thanks to the large amount of satellite data available, it is now possible to understand GHGs' behaviours at a broad scale. However, due to remote sensing devices technological limitations, the task of global high resolution (HR) monitoring remains an open problem. To avoid waiting for new missions and better data to be generated, it is therefore relevant to experiment with processing methods able to improve existing datasets. Our paper proposes to apply Super Resolution (SR), a Deep Learning (DL) approach commonly used in Computer Vision (CV), on global L3 satellite data. We produce a daily high resolution global CO2 dataset that opens the door for globally consistent point source monitoring.

Authors: Andrianirina Rakotoharisoa (Imperial College London); Rossella Arcucci (Imperial College London)

Computer Vision & Remote Sensing Earth Observation & Monitoring
ICLR 2024 DiffObs: Generative Diffusion for Global Forecasting of Satellite Observations (Papers Track)
Abstract and authors: (click to expand)

Abstract: This work presents an autoregressive generative diffusion model (DiffObs) to predict the global evolution of daily precipitation, trained on a satellite observational product, and assessed with domain-specific diagnostics. The model is trained to probabilistically forecast day-ahead precipitation. Nonetheless, it is stable for multi-month rollouts, which reveal a qualitatively realistic superposition of convectively coupled wave modes in the tropics. Cross-spectral analysis confirms successful generation of low frequency variations associated with the Madden--Julian oscillation, which regulates most subseasonal to seasonal predictability in the observed atmosphere, and convectively coupled moist Kelvin waves with approximately correct dispersion relationships. Despite secondary issues and biases, the results affirm the potential for a next generation of global diffusion models trained on increasingly sparse, and increasingly direct and differentiated observations of the world, for practical applications in subseasonal and climate prediction.

Authors: Jason Stock (Colorado State University); Jaideep Pathak (NVIDIA Corporation); Yair Cohen (NVIDIA Corporation); Mike Pritchard (NVIDIA Corporation); Piyush Garg (NVIDIA); Dale Durran (NVIDIA Corporation); Morteza Mardani (NVIDIA Corporation); Noah D Brenowitz (NVIDIA)

Climate Science & Modeling Computer Vision & Remote Sensing Generative Modeling
ICLR 2024 A cautionary tale about deep learning-based climate emulators (Papers Track)
Abstract and authors: (click to expand)

Abstract: Climate models are computationally too expensive for many tasks, such as, rapidly exploring future impacts of climate policies. Thus, since the 1980s scientists have been developing lightweight approximations or emulators of climate models. Recently, deep learning has been proposed for this task and most commonly been evaluated on the benchmark ClimateBenchv1.0. We implemented a linear regression-based model from the 1990s with 30K parameters, called linear pattern scaling, that is now the 'best' model on ClimateBenchv1.0 -- outperforming the incumbent 100M-parameter foundation model, ClimaX, on the spatial error of 3 out of the 4 variables. Nevertheless, climate emulation might benefit from innovations in machine learning and we analyse two aspects that need to be addressed in future emulators: First, the data complexity depends strongly on the climate variable of interest and the chosen spatiotemporal resolution. Second, current benchmarks do not sufficiently address the large impact of interannual variability in the climate system. We have published our analysis as an interactive tutorial at github.com/ygaxolotl/tags-linear-pattern-scaling.

Authors: Björn Lütjens (Massachussets Institute of Technology); Raffaele Ferrari (Massachusetts Institute of Technology); Paolo Giani (Massachusetts Institute of Technology); Dava Newman (MIT); Andre Souza (Massachusetts Institute of Technology); Duncan Watson-Parris (University of California San Diego); Noelle Selin (Massachusetts Institute of Technology)

Climate Science & Modeling Hybrid Physical Models Time-series Analysis
ICLR 2024 CausalPrompt: Enhancing LLMs with Weakly Supervised Causal Reasoning for Robust Performance in Non-Language Tasks (Papers Track)
Abstract and authors: (click to expand)

Abstract: In confronting the pressing issue of climate change, we introduce "CausalPrompt", an innovative prompting strategy that adapts large language models (LLMs) for classification and regression tasks through the application of weakly supervised causal reasoning. We delve into the complexities of data shifts within energy systems, often resulting from the dynamic evolution of sensor networks, leading to discrepancies between training and test data distributions or feature inconsistencies. By embedding domain-specific reasoning in the finetuning process, CausalPrompt significantly bolsters the adaptability and resilience of energy systems to these shifts. We show that CausalPrompt significantly enhances predictions in scenarios characterized by feature shifts, including electricity demand, solar power generation, and cybersecurity within energy infrastructures. This approach underlines the crucial role of CausalPrompt in enhancing the reliability and precision of predictions in energy systems amid feature shifts, highlighting its significance and potential for real-world applications in energy management and cybersecurity, contributing effectively to climate change mitigation efforts.

Authors: Tung-Wei Lin (University of California, Berkeley); Vanshaj Khattar (Virginia Tech); Yuxuan Huang (University College London); Junho Hong (University of Michigan); Ruoxi Jia (Virginia Tech); Chen-Ching Liu (Virginia Tech); Alberto L Sangiovanni-Vincentelli (University of California, Berkeley); Ming Jin (Virginia Tech)

Natural Language Processing Buildings Power & Energy
ICLR 2024 Estimating the age of buildings from satellite and morphological features to create a pan-EU Digital Building Stock Model (Papers Track)
Abstract and authors: (click to expand)

Abstract: The acceleration in the effects of global warming and the recent turbulences in the energy market are further highlighting the need to act quicker and smarter in terms of decisions to transition to greener energy and reduce our overall energy consumption. With buildings accounting for about 40% of the energy consumption in Europe, it is crucial to have a comprehensive understanding of the building stock and their energy-related characteristics, including their age, in order to make informed decisions for energy savings. This study introduces a novel way to approach the age estimation of buildings at scale, using a machine learning method that integrates satellite-based imagery with morphological features of buildings. The findings demonstrate the benefits of combining these data sources and underscore the importance of incorporating local data to enable accurate prediction across different cities.

Authors: Jeremias Wenzel (Universiteit Twente); Ana M. Martinez (European Commission - Joint Research Centre); Pietro Florio (European Commission - Joint Research Centre); Katarzyna Goch (Institute of Geography and Spatial Organization Polish Academy of Sciences)

Buildings Cities & Urban Planning
ICLR 2024 Probabilistic electricity price forecasting through conformalized deep ensembles (Papers Track)
Abstract and authors: (click to expand)

Abstract: Probabilistic electricity price forecasting (PEPF) is subject of an increasing interest, following the demand for proper prediction uncertainty quantification, to support the operation in complex power markets with increasing share of renewable generation. Distributional neural networks ensembles (DE) have been recently shown to outperform state of the art PEPF benchmarks. Still, they require reliability improvements, as fail to pass the coverage tests at various steps on the prediction horizon. In this work, we tackle this issue by extending the DE framework with the introduction of a Conformal Prediction based technique. Experiments have been conducted on multiple market regions, achieving day-ahead probabilistic forecasts with better hourly coverage.

Authors: Alessandro Brusaferri (National Research Council of Italy); Andrea Ballarino (National Research Council of Italy); Luigi Grossi (University of Parma); Fabrizio Laurini (University of Parma)

Power & Energy
ICLR 2024 Interpretable Machine Learning for power systems: Establishing Confidence in SHapley Additive exPlanationS (Papers Track)
Abstract and authors: (click to expand)

Abstract: Interpretable Machine Learning (IML) is expected to remove significant barriers for the application of Machine Learning (ML) algorithms in power systems. This work first seeks to showcase the benefits of SHapley Additive exPlanations (SHAP) for understanding the outcomes of ML models, which are increasingly being used to optimise power systems with increasing share of Renewable Energy (RE), to support worldwide calls for decarbonisation and climate change. To do so, we demonstrate that the Power Transfer Distribution Factors (PTDF)—a power system physics-based linear sensitivity index—can be derived from the SHAP values. To do so, we take the derivatives of SHAP values from a ML model trained to learn line-flows from generator power-injections, using a DC power-flow case in a benchmark test network. In demonstrating that SHAP values can be related back to the physics that underpin the power system, we build confidence in the explanations SHAP can offer.

Authors: Tabia Ahmad (University of Strathclyde); Robert Hamilton (Shell); Panagiotis Papadopoulos (University of Manchester); Samuel Chevalier (University of Vermont); Ilgiz Murzakhanov (Technical University of Denmark); Rahul Nellikkath (Technical University of Denmark); Jochen Bernhard Stiasny (Technical University of Denmark); Spyros Chatzivasileiadis (Technical University of Denmark)

Power & Energy Interpretable ML
ICLR 2024 DETECTION OF METEOROLOGICAL VARIABLES IN A WIND FARM INFLUENCING THE EXTREME WIND SPEED BY HETEROGENEOUS GRANGER CAUSALITY (Papers Track)
Abstract and authors: (click to expand)

Abstract: For an efficiently managed wind farm and wind power generation under adverse weather, knowledge of meteorological parameters influencing wind speed is of crucial importance for optimized and improved forecasts. We investigate temporal effects of wind speed related processes such as wakes within the wind farm using the Heterogeneous Graphical Granger model. The ERA5 meteorological reanalysis was used to generate wind farm power production data in Eastern Austria. We evaluated six different scenarios for the hydrological half-year period, based on moderate wind speed and varying temporal intervals of low or high extreme wind speed This allows to carry out causal reasoning about possible causes of extreme wind speed in a wind farm. A set of causal parameters for each of the scenarios was discovered enabling future early warning and for taking management measures for wind farm power generation management under adverse weather conditions.

Authors: Katerina Schindlerova (UniVie); Irene Schicker (Geos); Kejsi Hoxhallari (UniVie); Claudia Plant (University of Vienna, Austria)

Causal & Bayesian Methods Extreme Weather
ICLR 2024 Machine Learning for the Detection of Arctic Melt Ponds from Infrared Imagery (Papers Track)
Abstract and authors: (click to expand)

Abstract: Melt ponds are pools of water on Arctic summer sea ice that play an important role in the Arctic climate system. Retrieving their coverage is essential to better understand and predict the rapidly changing Arctic, but current data are limited. The goal of this study is to enhance melt pond data by developing a method that segments thermal infrared (TIR) airborne imagery into melt pond, sea ice, and ocean classes. Due to temporally and spatially varying surface temperatures, we use a data-driven deep learning approach to solve this task. We adapt and fine-tune AutoSAM, a Segment Anything-based segmentation model. We make the code, data, and models available online.

Authors: Marlena Reil (University of Osnabrück and University of Bremen, Institute of Environmental Physics); Gunnar Spreen (University of Bremen, Institute of Environmental Physics); Marcus Huntemann (University of Bremen, Institute of Environmental Physics); Lena Buth (Alfred Wegener Institute); Dennis Wilson (University of Toulouse, ISAE-Supaero)

Earth Observation & Monitoring Computer Vision & Remote Sensing
ICLR 2024 Identifying Complex Dynamics of Power Grid Frequency (Papers Track)
Abstract and authors: (click to expand)

Abstract: The energy system is undergoing rapid changes to integrate a growing number of intermittent renewable generators and facilitate the broader transition toward sustainability. As millions of consumers and thousands of (volatile) generators are connected to the same synchronous grid, no straightforward bottom-up models describing the dynamics are available on a continental scale comprising all of these necessary details. Hence, to identify this unknown power grid dynamics, we propose to leverage the Sparse Identification of Nonlinear Dynamics (SINDy) method. Thereby, we unveil the governing equations underlying the dynamical system directly from data measurements. Investigating the power grids of Iceland, Ireland and the Balearic islands as sample systems, we observe structurally similar dynamics with remarkable differences in both quantitative and qualitative behavior. Overall, we demonstrate how complex, i.e. non-linear, noisy, and time-dependent, dynamics can be identified straightforwardly.

Authors: Xinyi Wen (Karlsruhe Institute of Technology); Ulrich Oberhofer (Karlsruhe Institute of Technology); Leonardo Rydin Gorjão (Norwegian University of Life Sciences); G.Cigdem YALCIN (Istanbul University); Veit Hagenmeyer (Karlsruhe Institute of Technology (KIT)); Benjamin Schäfer (Karlsruhe Institute of Technology)

Power & Energy Time-series Analysis
ICLR 2024 Imbalance-aware Presence-only Loss Function for Species Distribution Modeling (Papers Track)
Abstract and authors: (click to expand)

Abstract: In the face of significant biodiversity decline, species distribution models (SDMs) are essential for understanding the impact of climate change on species habitats by connecting environmental conditions to species occurrences. Traditionally limited by a scarcity of species observations, these models have significantly improved in performance through the integration of larger datasets provided by citizen science initiatives. However, they still suffer from the strong class imbalance between species within these datasets, often resulting in the penalization of rare species--those most critical for conservation efforts. To tackle this issue, this study assesses the effectiveness of training deep learning models using a balanced presence-only loss function on various large citizen science-based datasets. We demonstrate that this imbalance-aware loss function outperforms traditional loss functions across various datasets and tasks, particularly in accurately modeling rare species with limited observations.

Authors: Robin Zbinden (EPFL); Nina van Tiel (EPFL); Marc Rußwurm (Wageningen University); Devis Tuia (EPFL)

Ecosystems & Biodiversity
ICLR 2024 Deep Gaussian Processes and inversion for decision support in model-based climate change mitigation and adaptation problems (Papers Track)
Abstract and authors: (click to expand)

Abstract: To inform their decisions, policy makers often rely on models developed by researchers that are computationally intensive and complex and that frequently run on High Performance Computers (HPC). These decision-support models are not used directly by deciders and the results of these models tend to be presented by experts as a limited number of potential scenarios that would result from a limited number of potential policy choices. Machine learning models such as Deep Gaussian Processes (DGPs) can be used to radically re-define how decision makers can use models by creating a ‘surrogate model’ or ‘emulator’ of the original model. Surrogate models can then be embedded into apps that decisions makers can use to directly explore a vast array of policy options corresponding to potential target outcomes (model inversion). To illustrate the mechanism, we give an example of application that is envisaged as part of the UK government’s Net Zero strategy. To achieve Net Zero CO2 emissions by 2050, the UK government is considering multiple options that include planting trees to capture carbon. However, the amount of CO2 captured by the trees depend on a large number of factors that include climate conditions, soil type, soil carbon, tree type, ... Depending on these factors the net balance of carbon removal after planting trees may not necessarily be positive. Hence, choosing the right place to plant the right tree is very important. A decision-helping model has been developed to tackle this problem. For a given policy input, the model outputs its impact in terms of CO2 sequestration, biodiversity and other ecosystem services. We show how DGPs can be used to create a surrogate model of this original afforestation model and how these can be embedded into an R shiny app that can then be directly used by decision makers.

Authors: bertrand nortier (University of Exeter); daniel williamson (University of Exeter); mattia mancini (University of Exeter); amy binner (University of Exeter); brett day (University of Exeter); ian bateman (University of Exeter)

Public Policy Carbon Capture & Sequestration Ecosystems & Biodiversity Forests Causal & Bayesian Methods Uncertainty Quantification & Robustness
ICLR 2024 Generalized Policy Learning for Smart Grids: FL TRPO Approach (Papers Track)
Abstract and authors: (click to expand)

Abstract: The smart grid domain requires bolstering the capabilities of existing energy management systems; Federated Learning (FL) aligns with this goal as it demonstrates a remarkable ability to train models on heterogeneous datasets while maintaining data privacy, making it suitable for smart grid applications, which often involve disparate data distributions and interdependencies among features that hinder the suitability of linear models. This paper introduces a framework that combines FL with a Trust Region Policy Optimization (FL TRPO) aiming to reduce energy-associated emissions and costs. Our approach reveals latent interconnections and employs personalized encoding methods to capture unique insights, understanding the relationships between features and optimal strategies, allowing our model to generalize to previously unseen data. Experimental results validate the robustness of our approach, affirming its proficiency in effectively learning policy models for smart grid challenges.

Authors: Yunxiang LI (MBZUAI); Nicolas M Cuadrado (MBZUAI); Samuel Horváth (MBZUAI); Martin Takac (Mohamed bin Zayed University of Artificial Intelligence)

Power & Energy Reinforcement Learning
ICLR 2024 A Deep Learning Framework to Efficiently Estimate Precipitation at the Convection Permitting Scale (Papers Track)
Abstract and authors: (click to expand)

Abstract: Precipitation-related extreme events are rapidly growing due to climate change, emphasizing the need for accurate hazard projections. To effectively model the convective phenomena driving severe precipitation, high-resolution estimates are crucial. Existing methods struggle with either insufficient expressiveness in capturing complex convective dynamics, due to the low resolution, or excessive computational demands. In response, we propose an innovative deep learning framework that efficiently harnesses available data to yield precise results. This model, based on graph neural networks, utilises two grids with different resolution and two sets of edges to represent spatial relationships. Employing as input ERA5 reanalysis atmospheric variables on an approximately 25 km grid, the framework produces hourly precipitation estimates on a finer 3 km grid. Findings are promising in accurately capturing yearly precipitation distribution and estimating cumulative precipitation during extreme events. Notably, the model demonstrates effectiveness in spatial regions not included in the training, motivating further exploration of its transferability potential.

Authors: Valentina Blasone (University of Trieste); Erika Coppola (Earth System Physics Section, ICTP, Trieste); Guido Sanguinetti (SISSA); Viplove Arora (Theoretical and Scientific Data Science, SISSA, Trieste); Serafina Di Gioia (Earth System Physics Section, ICTP, Trieste); Luca Bortolussi (University of Trieste)

Climate Science & Modeling Extreme Weather
ICLR 2024 Global Vegetation Modeling With Pre-Trained Weather Transformers (Papers Track)
Abstract and authors: (click to expand)

Abstract: Accurate vegetation models can produce further insights into the complex inter-action between vegetation activity and ecosystem processes. Previous research has established that long-term trends and short-term variability of temperature and precipitation affect vegetation activity. Motivated by the recent success of Transformer-based Deep Learning models for medium-range weather forecasting, we adapt the publicly available pre-trained FourCastNet to model vegetation activity while accounting for the short-term dynamics of climate variability. We investigate how the learned global representation of the atmosphere’s state can be transferred to model the normalized difference vegetation index (NDVI). Our model globally estimates vegetation activity at a resolution of 0.25◦ while relying only on meteorological data. We demonstrate that leveraging pre-trained weather models improves the NDVI estimates compared to learning an NDVI model from scratch. Additionally, we compare our results to other recent data-driven NDVI modeling approaches from machine learning and ecology literature. We further provide experimental evidence on how much data and training time is necessary to turn FourCastNet into an effective vegetation model. Code and models are available at https://github.com/LSX-UniWue/Global-Ecosystem-Modeling.

Authors: Pascal Janetzky (University Wuerzburg); Florian Gallusser (Universität Würzburg); Simon Hentschel (Julius-Maximilians-Universität of Würzburg); Andreas Hotho (University of Wuerzburg); Anna Krause (Universität Würzburg, Department of Computer Science, CHair X Data Science)

Ecosystems & Biodiversity Climate Science & Modeling Data Mining
ICLR 2024 Building Ocean Climate Emulators (Papers Track)
Abstract and authors: (click to expand)

Abstract: The current explosion in machine learning for climate has led to skilled, computationally cheap emulators for the atmosphere. However, the research for ocean emulators remains nascent despite the large potential for accelerating coupled climate simulations and improving ocean forecasts on all timescales. There are several fundamental questions to address that can facilitate the creation of ocean emulators. Here we focus on two questions: 1) the role of the atmosphere in improving the extended skill of the emulator and 2) the representation of variables with distinct timescales (e.g., velocity and temperature) in the design of any emulator. In tackling these questions, we show stable prediction of surface fields for over 8 years, training and testing on data from a high-resolution coupled climate model, using results from four regions of the globe. Our work lays out a set of physically motivated guidelines for building ocean climate emulators.

Authors: Adam Subel (New York University); Laure Zanna (New York University)

Climate Science & Modeling
ICLR 2024 Neural Tree Reconstruction for the Open Forest Observatory (Papers Track)
Abstract and authors: (click to expand)

Abstract: The Open Forest Observatory (OFO) is a collaboration across universities and other partners to make low-cost forest mapping accessible to ecologists, land managers, and the general public. The OFO is building both a database of geospatial forest data as well as open-source methods and tools for forest mapping by un- crewed aerial vehicle. Such data are useful for a variety of climate applications including prioritizing reforestation efforts, informing wildfire hazard reduction, and monitoring carbon sequestration. In the current iteration of the OFO’s forest map database, 3D tree maps are created using classical structure-from-motion techniques. This approach is prone to artifacts, lacks detail, and has particular difficulty on the forest floor where the input data (overhead imagery) has limited visibility. These reconstruction errors can potentially propagate to the down- stream scientific tasks (e.g. a wildfire simulation.) Advances in 3D reconstruction, including methods like Neural Radiance Fields (NeRF), produce higher quality results that are more robust to sparse views and support data-driven priors. We explore ways to incorporate NeRFs into the OFO dataset, outline future work to support even more state-of-the-art 3D vision models, and describe the importance of high-quality 3D reconstructions for forestry applications.

Authors: Marissa Ramirez de Chanlatte (UC Berkeley); Arjun Rewari (Darrell Group, Berkeley AI Research Lab); Trevor Darrell (UC Berkeley); Derek Young (University of California Davis)

Forests Carbon Capture & Sequestration
ICLR 2024 Graph Neural Network Based Screening of Metal-Organic Frameworks for CO2 Capture (Papers Track)
Abstract and authors: (click to expand)

Abstract: Our ability to capture and remove carbon dioxide (CO2) at gigaton scale within a decade or two depends on our ability to quickly identify new materials that are high performing, selective over other gases with low energy demand and then further develop them for large scale deployment. As a proven technology for gas separation in other industrial applications, metal-organic frameworks (MOF) come in virtually unlimited number of crystal combinations in their highly porous lattice and may offer the solution for CO2 capture from atmosphere or industrial point sources. Although MOFs can have highly complex crystal structure, which cannot be easily exploited in tabular data format in conventional ML methods or more recent Deep Learning methods, Graph Neural Networks can easily be trained on their representative crystallographic information file (CIF) content. In this work, we train GNNs to create an end-to-end workflow to screen large number of MOF crystal structures directly from the data within the crystallographic information files for their CO2 working capacity or CO2/N2 selectivity under low-pressure conditions. Our preliminary results show that a simple 2-layered Graph Convolution Networks (GCN) can easily achieve R2 score in the range of 0.87 to 0.89, easily.

Authors: Zikri Bayraktar (Schlumberger Doll Research); Mengying Li (Schlumberger Doll Research); Shahnawaz Molla (Schlumberger Doll Research)

Carbon Capture & Sequestration
ICLR 2024 Predicting Species Occurrence Patterns from Partial Observations (Papers Track)
Abstract and authors: (click to expand)

Abstract: To address the interlinked biodiversity and climate crises, we need an understanding of where species occur and how these patterns are changing. However, observational data on most species remains very limited, and the amount of data available varies greatly between taxonomic groups. We introduce the problem of predicting species occurrence patterns given (a) satellite imagery, and (b) known information on the occurrence of other species. To evaluate algorithms on this task, we introduce SatButterfly, a dataset of satellite images, environmental data and observational data for butterflies, which is designed to pair with the existing SatBird dataset of bird observational data. To address this task, we propose a general model, R-Tran, for predicting species occurrence patterns that enables the use of partial observational data wherever found. We find that R-Tran outperforms other methods in predicting species encounter rates with partial information both within a taxon (birds) and across taxa (birds and butterflies). Our approach opens new perspectives to leveraging insights from species with abundant data to other species with scarce data, by modelling the ecosystems in which they co-occur.

Authors: Hager Radi Abdelwahed (Mila: Quebec AI Institute); Mélisande Teng (Mila, Université de Montréal); David Rolnick (MIT)

Ecosystems & Biodiversity Computer Vision & Remote Sensing
ICLR 2024 Towards Scalable Deep Species Distribution Modelling using Global Remote Sensing (Papers Track)
Abstract and authors: (click to expand)

Abstract: Destruction of natural habitats and anthropogenic climate change are threatening biodiversity globally. Addressing this loss necessitates enhanced monitoring techniques to assess the impact of environmental shifts and to guide policy-making efforts. Species distribution models are crucial tools that predict species locations by interpolating observed field data with environmental information. We develop an improved, scalable method for species distribution modelling by proposing a dataset pipeline that incorporates global remote sensing imagery, land use classification data, environmental variables, and observation data, and utilising this with convolutional neural network (CNN) models to predict species presence at higher spatial and temporal resolutions than well-established species distribution modelling methods. We apply our approach to modelling Protea species distributions in the Cape Floristic Region of South Africa, demonstrating its performance in a region of high biodiversity. We train two CNN models and compare their performance to Maxent, a popular conventional species distribution modelling method. We find that the CNN models trained with remote sensing data outperform Maxent, underscoring the potential of our method as an effective and scalable solution for modelling species distribution.

Authors: Emily Morris (University of Cambridge); Anil Madhavapeddy (University of Cambridge); Sadiq Jaffer (University of Cambridge); David Coomes (University of Cambridge)

Ecosystems & Biodiversity Computer Vision & Remote Sensing
ICLR 2024 Valuation and Profit Allocation for Electric Vehicle Battery Data in a Data Market (Proposals Track)
Abstract and authors: (click to expand)

Abstract: This paper delves into the realm of electric vehicle (EV) battery data trading markets, focusing on data valuation and revenue allocation. In the face of fast-developing electric mobility, the safety of EV batteries becomes more and more important, driving the need for robust anomaly detection models. For newly found EV companies lacking extensive data, data markets offer a solution, facilitated by trading platforms. We shape this landscape, outline a transaction process involving data buyers, data sellers, and platforms. Our exploration extends to data valuation methodologies, encompassing the classic Shapley value and the least core algorithm. Considering the complicated mechanisms in EV battery, we unveil a deep learning framework for anomaly detection, treating EV batteries as dynamic systems. To explain data value from an economic perspective, we utilize a utility function considering the direct economic costs saved for the EV company to refine the evaluation process. Based on data value, we further propose revenue allocation schemes to allocate part of EV company's revenue to data sellers, offering diverse perspectives on fair and equitable profit distribution. A case study is conducted based on real world EV battery dataset to illustrate how the different revenue allocation schemes allocate payoffs to data sellers.

Authors: Junkang Chen (Peking University); Guannan He (Peking University)

Data Mining Transportation Time-series Analysis
ICLR 2024 SkyImageNet: Towards a large-scale sky image dataset for solar power forecasting (Proposals Track)
Abstract and authors: (click to expand)

Abstract: The variability of solar photovoltaic (PV) output, particularly that caused by rapidly changing cloud dynamics, challenges the reliability of renewable energy systems. Solar forecasting based on cloud observations collected by ground-level sky cameras shows promising performance in anticipating short-term solar power fluctuations. However, current deep learning methods often rely on a single dataset with limited sample diversity for training, and thus generalize poorly to new locations and different sky conditions. Moreover, the lack of a standardized dataset hinders the consistent comparison of existing solar forecasting methods. To close these gaps, we propose to build a large-scale standardized sky image dataset --- SkyImageNet --- by assembling, harmonizing, and processing suitable open-source datasets collected in various geographical locations. An accompanying python package will be developed to streamline the process of utilizing SkyImageNet in a machine learning framework. We hope that the outcomes of this project will foster the development of more robust forecasting systems, advance the comparability of short-term solar forecasting model performances, and further facilitate the transition to the next generation of sustainable energy systems.

Authors: Yuhao Nie (Massachusetts Institute of Technology); Quentin Paletta (European Space Research Institute); Sherrie Wang (MIT)

Computer Vision & Remote Sensing Earth Observation & Monitoring Meta- and Transfer Learning
ICLR 2024 An Adaptive Hydropower Management Approach for Downstream Ecosystem Preservation (Proposals Track)
Abstract and authors: (click to expand)

Abstract: Hydropower plants play a pivotal role in advancing clean and sustainable energy production, contributing significantly to the global transition towards renewable energy sources. However, hydropower plants are currently perceived both positively as sources of renewable energy and negatively as disruptors of ecosystems. In this work, we highlight the overlooked potential of using hydropower plant as protectors of ecosystems by using adaptive ecological discharges. To advocate for this perspective, we propose using a neural network to predict the minimum ecological discharge value at each desired time. Additionally, we present a novel framework that seamlessly integrates it into hydropower management software, taking advantage of the well-established approach of using traditional constrained optimisation algorithms. This novel approach not only protects the ecosystems from climate change but also contributes to potentially increase the electricity production.

Authors: Cecília Coelho (University of Minho); Ming Jin (Virginia Tech); M. Fernanda P. Costa (Dep. Mathematics, University of Minho); Luís L. Ferrás (University of Porto)

Ecosystems & Biodiversity Climate Science & Modeling Power & Energy Time-series Analysis
ICLR 2024 A Benchmark Dataset for Meteorological Downscaling (Proposals Track)
Abstract and authors: (click to expand)

Abstract: High spatial resolution in atmospheric representations is crucial across Earth science domains, but global reanalysis datasets like ERA5 often lack the detail to capture local phenomena due to their coarse resolution. Recent efforts have leveraged deep neural networks from computer vision to enhance the spatial resolution of meteorological data, showing promise for statistical downscaling. However, methodological diversity and insufficient comparisons with traditional downscaling techniques challenge these advancements. Our study introduces a benchmark dataset for statistical downscaling, utilizing ERA5 and the finer-resolution COSMO-REA6, to facilitate direct comparisons of downscaling methods for 2m temperature, global (solar) irradiance and 100m wind fields. Accompanying U-Net, GAN, and transformer models with a suite of evaluation metrics aim to standardize assessments and promote transparency and confidence in applying deep learning to meteorological downscaling.

Authors: Michael Langguth (Juelich Supercomputing Centre - Forschungszentrum Juelich); Paula Harder (Mila); Irene Schicker (Geos); Ankit Patnala (Juelich Supercomputing Centre - Forschungszentrum Juelich); Sebastian Lehner (GeoSphere Austria); Konrad Mayer (GeoSphere Austria); Markus Dabernig (GeoSphere Austria)

Climate Science & Modeling Earth Observation & Monitoring Power & Energy Generative Modeling
ICLR 2024 Enabling Clean Energy Resilience with Machine Learning-Empowered Underground Hydrogen Storage (Proposals Track)
Abstract and authors: (click to expand)

Abstract: To address the urgent challenge of climate change, there is a critical need to transition away from fossil fuels towards sustainable energy systems, with renewable energy sources playing a pivotal role. However, the inherent variability of renewable energy, without effective storage solutions, often leads to imbalances between energy supply and demand. Underground Hydrogen Storage (UHS) emerges as a promising long-term storage solution to bridge this gap, yet its widespread implementation is impeded by the high computational costs associated with high fidelity UHS simulations. This paper introduces UHS from a data-driven perspective and outlines a roadmap for integrating machine learning into UHS, thereby facilitating the large-scale deployment of UHS.

Authors: Alvaro R Carbonero Gonzales (Los Alamos National Lab); Shaowen Mao (Los Alamos National Laboratory); Mohamed Mehana (Los Alamos National Lab)

Carbon Capture & Sequestration Computer Vision & Remote Sensing
ICLR 2024 Adjustment of ocean carbon sink predictions with an emission-driven Earth system model using a deep neural network (Proposals Track)
Abstract and authors: (click to expand)

Abstract: Near-term predictions of the Global Carbon Budget (GCB) with Earth system models (ESMs) driven by specified CO2 emissions were used to inform the GCB annual update for the first time in 2023. These predictions are biased because they are initialized indirectly from the ESMs response to physical observational constraints, and because the ESMs themselves are imperfect representations of the climate system. We propose a deep learning-based post-processing method to adjust GCB predictions using an autoencoder, which outperforms standard bias and trend correction methods.

Authors: Reinel Sospedra-Alfonso (Environment and Climate Change Canada); Parsa Gooya (Environment and Climate Change Canada); Johannes Exenberger (Graz University of Technology)

Climate Science & Modeling
ICLR 2024 Calibrating Earth System Models with Bayesian Optimal Experimental Design (Proposals Track)
Abstract and authors: (click to expand)

Abstract: Earth system models (ESMs) are complex climate simulations that are critical for projecting future climate change and its impacts. However, running ESMs is extremely computationally expensive, limiting the number of simulations that can be performed. This results in significant uncertainty in key climate metrics estimated from ESM ensembles. We propose a Bayesian optimal experimental design (BOED) approach to efficiently calibrate ESM simulations to observational data by actively selecting the most informative input parameters. BOED optimises the expected information gain (EIG) to select the ESM input parameter to reduce the final uncertainty estimates in the climate metrics of interest. Initial results on a synthetic benchmark demonstrate our approach can more efficiently reduce uncertainty compared to common sampling schemes like Latin hypercube sampling.

Authors: Tim Reichelt (University of Oxford); Shahine Bouabid (University of Oxford); Luke Ong (University of Oxford); Duncan Watson-Parris (University of California San Diego); Tom Rainforth (University of Oxford)

Active Learning Climate Science & Modeling Causal & Bayesian Methods
ICLR 2024 Severe Wind Event Prediction with Multivariate Physics-Informed Deep Learning (Proposals Track)
Abstract and authors: (click to expand)

Abstract: Wind turbines play a crucial role in combating climate change by harnessing the force of the wind to generate clean and renewable energy. One key factor in ensuring the long-term effectiveness of wind turbines is the reduction of operating costs due to maintenance. Severe weather events, such as extreme changes in wind, can damage turbines, resulting in costly maintenance and economic losses in power production. We propose a preliminary physics-informed deep learning model to improve predictions of severe wind events and a multivariate time series extension for this work.

Authors: Willa Potosnak (Carnegie Mellon University); Cristian I Challu (Carnegie Mellon University); Kin G. Olivares (Carnegie Mellon University); James K Miller (Carnegie Mellon University); Artur Dubrawski (Carnegie Mellon University)

Hybrid Physical Models Extreme Weather Power & Energy Time-series Analysis
ICLR 2024 Empowering Safe Reinforcement Learning in Power System Control with CommonPower (Tutorials Track)
Abstract and authors: (click to expand)

Abstract: Reinforcement learning (RL) has become a valuable tool for addressing complex decision-making problems in power system control. However, the unique intricacies of this domain necessitate the development of specialized RL algorithms. While benchmarking problems have proven effective in advancing algorithm development in various domains, existing suites do not enable a systematic study of two key challenges in power system control: ensuring adherence to physical constraints and evaluating the impact of forecast accuracy on controller performance. This tutorial introduces the sophisticated capabilities of the CommonPower toolbox, designed to address these overlooked challenges. We guide participants in composing benchmark problems within CommonPower, leveraging predefined components, and demonstrate the creation of new components. We showcase the training of a safe RL agent to solve a benchmark problem, comparing its performance against a built-in MPC baseline. Notably, CommonPower's symbolic modeling approach enables the automatic derivation of safety shields for vanilla RL algorithms. We explain the theory behind this feature in a concise introduction to the field of safe RL. Furthermore, we present CommonPower's interface for seamlessly integrating diverse forecasting strategies into the system. The workshop emphasizes the significance of safeguarding vanilla RL algorithms and encourages researchers to systematically investigate the influence of forecast uncertainties in their experiments.

Authors: Hannah Markgraf (Technical University of Munich); Michael Eichelbeck (Technical University of Munich); Matthias Althoff (Technical University of Munich)

Reinforcement Learning Buildings
ICLR 2024 Planning for Floods & Droughts: Intro to AI-Driven Hydrological Modeling (Tutorials Track)
Abstract and authors: (click to expand)

Abstract: This tutorial presents an AI-driven hydrological modeling approach to advance predictions of extreme hydrological events, including floods and droughts, which are of significant socioeconomic concerns. Traditionally, physics-based hydrological models have been the mainstay for simulating rainfall-runoff dynamics and forecasting streamflow. These models, while effective, are constrained by limitations in our systematic understanding and an inability to incorporate heterogeneous data. Recently, the surge in availability of multi-scale, multi-modal hydrological data has spurred the adoption of data-driven machine learning (ML) techniques. These methods have shown promising predictive performance. However, they often struggle with generalization and reliability, especially under climate change. This tutorial introduces physics-informed ML, by leveraging data and domain knowledge, to improve prediction accuracy and trustworthiness. We will delve into uncertainty quantification methods for probabilistic predictions that are vital for climate-resilient planning in managing floods and droughts. Participants will be guided through a comprehensive workflow, encompassing data analysis, model construction, and model evaluation. This tutorial is designed to elevate researchers’ understanding of hydrological systems and provide practitioners with robust, climate-resilient water management tools. These tools are instrumental in facilitating informed decision-making, crucial in the context of climate adaptation strategies. Participants will learn: ● Heterogeneous climate and hydrology data analysis ● State-of-the-art neural network models for rainfall-runoff modeling. ● ML model construction, training, validating, and testing ● Multiple ways to build a physics-informed ML model ● Uncertainty quantification in ML model predictions. All code and data will be publicly available for researchers/practitioners to build their own models.

Authors: Kshitij Tayal (Oak Ridge National Lab); Arvind Renganathan (University of Minnesota); Dan Lu (Oak Ridge National Laboratory)

Climate Science & Modeling Oceans & Marine Systems Data Mining Time-series Analysis
ICLR 2024 Understanding drivers of climate extremes using regime-specific causal graphs (Tutorials Track)
Abstract and authors: (click to expand)

Abstract: The climate system is intricate, involving numerous interactions among various components at multiple spatio-temporal scales. This complexity poses a significant challenge in understanding and predicting weather extremes within the Earth's climate system. However, a better understanding of the dynamics of such events is crucial due to their profound impact on ecosystems, economies, and worldwide communities. This tutorial will offer a comprehensive guide on using Regime-PCMCI (Saggioro et al., 2020), a constraint-based causal discovery technique, to uncover the causal relationships governing anomalous climate phenomena. Regime-PCMCI is designed to uncover causal relationships in time-series where transitions between regimes exist, and different causal relationships may govern each regime. In this tutorial, we will first discuss how to frame the problem of understanding climate and weather extremes using regime-specific causal discovery. We will shortly introduce constraint-based causal discovery and present the Regime-PCMCI algorithm. To enable participants to gain hands-on experience with the algorithm, we will apply Regime-PCMCI, implemented in the open-source Python package Tigramite (https://github.com/jakobrunge/tigramite), to a real-world climate science problem. Our example will focus on validating hypothesized regime-specific causal graphs that describe the causal relationship between atmospheric circulation, temperature, rainfall, evaporation, and soil moisture under various moisture regimes. Our tutorial will cover essential steps such as data preprocessing, parameter selection, and interpretation of results, ensuring that all participants with a basic understanding of climate science or data analysis can grasp the presented concepts. With this tutorial, we wish to equip participants with the skills to apply Regime-PCMCI in their research to further uncover complex mechanisms in climate science, as this knowledge is crucial for more informed policy-making.

Authors: Oana-Iuliana Popescu (Institute of Data Science, German Aerospace Center (DLR)); Wiebke Günther (German Aerospace Center); Raed Hamed (Institute for Environmental Studies, VU Amsterdam); Dominik Schumacher (4Institute for Atmospheric and Climate Science, ETH Zürich); Martin Rabel (DLR); Dim Coumou (IVM/VU); Jakob Runge (Institute of Data Science, German Aerospace Center (DLR))

Causal & Bayesian Methods Climate Science & Modeling Time-series Analysis
ICLR 2024 One Prompt Fits All: Visual Prompt-Tuning for Remote Sensing Segmentation (Tutorials Track) Audience Choice
Abstract and authors: (click to expand)

Abstract: Image segmentation is crucial in climate change research for analyzing satellite imagery. This technique is vital for ecosystems mapping, natural disasters assessment, and urban and agricultural planning. The advent of vision-based foundational models like the Segment Anything Model (SAM) opens new avenues in climate research and remote sensing (RS). SAM can perform segmentation tasks on any object from manually-crafted prompts. However, the efficacy of SAM largely depends on the quality of these prompts. This issue is particularly pronounced with RS data, which are inherently complex. To use SAM for accurate segmentation at scale for RS, one would need to create complex prompts for each image, which typically involves selecting dozens of points. To address this, we introduce Prompt-Tuned SAM (PT-SAM), a method that minimizes the need for manual input through a trainable, lightweight prompt embedding. This embedding captures key semantic information for specific objects of interest that would be applicable to unseen images. Our approach merges the zero-shot generalization capabilities of the pre-trained SAM model with supervised learning. Importantly, the training process for the prompt embedding not only has minimal hardware requirements, allowing it to be conducted on a CPU, but it also requires only a small dataset. With PT-SAM, image segmentation on RS data can be performed at scale without human intervention, achieving accuracies comparable to those of human-designed prompts with SAM. For example, PT-SAM can be used for analyzing forest cover across vast areas, a key factor in understanding the impact of human activities on forests. Its capability to segment a multitude of images makes it ideal for monitoring widespread land-cover changes, providing deeper insights into urbanization. This tutorial will explore how to train and utilize PT-SAM for large-scale segmentation tasks, specifically focusing on training embeddings that capture forests, and buildings.

Authors: Marshall Wang (Vector Institute); John Willes (Vector Institute); Deval Pandya (Vector Institute)

Computer Vision & Remote Sensing Earth Observation & Monitoring Forests
ICLR 2024 1. Building Sustainable Futures: Tutorial on Carbon Footprint Analysis and Mitigation Strategies Using Counter Factual Queries (Tutorials Track)
Abstract and authors: (click to expand)

Abstract: As the sense of urgency regarding climate change continues to mount with growing regulatory pressure across the globe, it has become increasingly crucial for enterprises and governments to align their goals with sustainability values. They face a crucial imperative to act on climate change mitigation by disclosing their GHG emissions and committing to reduction and optimization of emissions from their industrial activities including operations, infrastructure, logistics, and supply chains. The world's largest enterprises have set long-term net-zero targets but lacks an integrated view of how their key business operations and processes contribute to their sustainability journey, which makes it difficult for them to embark on a well-planned journey to achieve their sustainability goals. With the recent advancement, AI intervention becomes imperative to measure, track, and improve ESG performance to achieve sustainability goals. This tutorial aims to provide a comprehensive guide on leveraging advanced AI techniques for analysing and mitigating carbon footprints in various sectors. The tutorial covers the utilization of a generalized framework that integrates sector-specific and cross-sector enterprise data, including assets and operations, to derive actionable insights. The framework also uses additional data such as weather parameters and contextual information to facilitate a holistic approach to carbon footprint analysis and its mitigation strategies. The tutorial will delve into the working of a framework which comprises of an LLM driven carbon accounting engine, predictive models for carbon emissions, anomaly detection models, and counterfactual models. It identifies the emission hotspots, thereafter provides actionable recommendations to mitigate the carbon emission. The proposed tutorial aims to empower participants with the knowledge and skills to make informed decisions towards building a more sustainable future

Authors: Kumar Saurav (IBM); Manikandan Padmanaban (IBM Research India); Ayush Jain (IBM Research); Jagabondhu Hazra (IBM Research India)

Climate Science & Modeling Recommender Systems
NeurIPS 2023 Machine learning for gap-filling in greenhouse gas emissions databases (Papers Track)
Abstract and authors: (click to expand)

Abstract: Greenhouse Gas (GHG) emissions datasets are often incomplete due to inconsistent reporting and poor transparency. Filling the gaps in these datasets allows for more accurate targeting of strategies to accelerate the reduction of GHG emissions. This study evaluates the potential of machine learning methods to automate the completion of GHG datasets. We use 3 datasets of increasing complexity with 18 different gap-filling methods and provide a guide to which methods are useful in which circumstances. If few dataset features are available, or the gap consists only of a missing time step in a record, then simple interpolation is often the most accurate method and complex models should be avoided. However, if more features are available and the gap involves non-reporting emitters, then machine learning methods can be more accurate than simple extrapolation. Furthermore, the secondary output of feature importance from complex models allows for data collection prioritisation to accelerate the improvement of datasets. Graph based methods are particularly scalable due to the ease of updating predictions given new data and incorporating multimodal data sources. This study can serve as a guide to the community upon which to base ever more integrated frameworks for automated detailed GHG emissions estimations, and implementation guidance is available at https://hackmd.io/@luke-scot/ML-for-GHG-database-completion.

Authors: Luke Cullen (University of Cambridge); Andrea Marinoni (UiT the Arctic University of Norway); Jonathan M Cullen (University of Cambridge)

Public Policy
NeurIPS 2023 EarthPT: a foundation model for Earth Observation (Papers Track)
Abstract and authors: (click to expand)

Abstract: We introduce EarthPT -- an Earth Observation (EO) pretrained transformer. EarthPT is a 700 million parameter decoding transformer foundation model trained in an autoregressive self-supervised manner and developed specifically with EO use-cases in mind. We demonstrate that EarthPT is an effective forecaster that can accurately predict future pixel-level surface reflectances across the 400-2300 nm range well into the future. For example, forecasts of the evolution of the Normalised Difference Vegetation Index (NDVI) have a typical error of approximately 0.05 (over a natural range of -1 -> 1) at the pixel level over a five month test set horizon, out-performing simple phase-folded models based on historical averaging. We also demonstrate that embeddings learnt by EarthPT hold semantically meaningful information and could be exploited for downstream tasks such as highly granular, dynamic land use classification. Excitingly, we note that the abundance of EO data provides us with -- in theory -- quadrillions of training tokens. Therefore, if we assume that EarthPT follows neural scaling laws akin to those derived for Large Language Models (LLMs), there is currently no data-imposed limit to scaling EarthPT and other similar ‘Large Observation Models.’

Authors: Michael J Smith (Aspia Space); Luke Fleming (Aspia Space); James Geach (Aspia Space)

Earth Observation & Monitoring
NeurIPS 2023 Towards Understanding Climate Change Perceptions: A Social Media Dataset (Papers Track)
Abstract and authors: (click to expand)

Abstract: Climate perceptions shared on social media are an invaluable barometer of public attention. By directing research towards this topic, we can eventually improve the effectiveness of climate change communication, increase public engagement, and enhance climate change education. We propose two real-world image datasets to promote impactful research both in the Computer Vision community and beyond. Firstly, ClimateTV, a dataset containing over 700,000 climate change-related images posted on Twitter and labelled on basis of the image hashtags. Secondly, ClimateCT, a Twitter dataset containing images with five-dimensional annotations in super-categories (i) Animals, (ii) Climate action, (iii) Consequences, (iv) Setting, and (v) Type. These challenging classification datasets contain classes which are designed according to their relevance in the context of climate change. The challenging nature of the datasets is given by varying class diversities (e.g. polar bear vs. land mammal) and foci (e.g. arctic vs. snowy residential area). The analyses of our datasets using CLIP embeddings and query optimization (CoCoOp) further showcase the challenging nature of ClimateTV and ClimateCT.

Authors: Katharina Prasse (University of Siegen); Steffen Jung (MPII); Isaac B Bravo (Technische Universität München); Stefanie Walter (Technical University of Munich); Margret Keuper (University of Siegen, Max Planck Institute for Informatics)

Behavioral and Social Science
NeurIPS 2023 Adaptive-Labeling for Enhancing Remote Sensing Cloud Understanding (Papers Track)
Abstract and authors: (click to expand)

Abstract: Cloud analysis is a critical component of weather and climate science, impacting various sectors like disaster management. However, achieving fine-grained cloud analysis, such as cloud segmentation, in remote sensing remains challenging due to the inherent difficulties in obtaining accurate labels, leading to significant labeling errors in training data. Existing methods often assume the availability of reliable segmentation annotations, limiting their overall performance. To address this inherent limitation, we introduce an innovative model-agnostic Cloud Adaptive-Labeling (CAL) approach, which operates iteratively to enhance the quality of training data annotations and consequently improve the performance of the learned model. Our methodology commences by training a cloud segmentation model using the original annotations. Subsequently, it introduces a trainable pixel intensity threshold for adaptively labeling the cloud training images on-the-fly. The newly generated labels are then employed to fine-tune the model. Extensive experiments conducted on multiple standard cloud segmentation benchmarks demonstrate the effectiveness of our approach in significantly boosting the performance of existing segmentation models. Our CAL method establishes new state-of-the-art results when compared to a wide array of existing alternatives.

Authors: Jay Gala (NMIMS); Sauradip Nag (University of Surrey); Huichou Huang (City University of Hong Kong); Ruirui Liu (Brunel University London); Xiatian Zhu (University of Surrey)

Climate Science & Modeling
NeurIPS 2023 Flamingo: Environmental Impact Factor Matching for Life Cycle Assessment with Zero-Shot ML (Papers Track)
Abstract and authors: (click to expand)

Abstract: Consumer products contribute to >75% of global greenhouse gas (GHG) emissions, primarily through indirect contributions from the supply chain. Measurement of GHG emissions associated with products is crucial to quantify the impact of GHG emission abatement actions. Life cycle assessment (LCA), the scientific discipline for measuring GHG emissions, estimates the environmental impact of a product. Scaling LCA to millions of products is challenging as it requires extensive manual analysis by domain experts. To avoid repetitive analysis, environmental impact factors (EIF) of common materials and products are published for use by experts. However, finding appropriate EIFs for even a single product can require hundreds of hours of manual work, especially for complex products. We present Flamingo, an algorithm that leverages neural language models to automatically identify an appropriate EIF given a text description. A key challenge in automation is that EIF databases are incomplete. Flamingo uses industry sector classification as an intermediate layer to identify when there are no good matches in the database. On a dataset of 664 products, Flamingo achieves an EIF matching precision of 75%.

Authors: Bharathan Balaji (Amazon); Venkata Sai Gargeya Vunnava (amazon); Nina Domingo (Amazon); Shikhar Gupta (Amazon); Harsh Gupta (Amazon); Geoffrey Guest (Amazon); Aravind Srinivasan (Amazon); Kellen Axten (Amazon); Jared Kramer (Amazon)

Natural Language Processing
NeurIPS 2023 AtmoRep: A stochastic model of atmosphere dynamics using large scale representation learning (Papers Track)
Abstract and authors: (click to expand)

Abstract: AtmoRep is a novel, task-independent stochastic computer model of atmospheric dynamics inspired by the concept of foundation models in natural language processing, like the GPT line or PalmX, applied in the context of Earth system science. The main innovative aspect consists in the fact that the model can skillfully solve scientific tasks it was not specifically trained on, clearly exhibiting in-context learning capabilities. AtmoRep's skill has been tested on nowcasting, temporal interpolation, model correction, and counterfactuals, demonstrating that large-scale neural networks can provide skillful, task-independent models able to complement the existing numerical approaches in multiple applications. In addition, the authors also demonstrated the possibility to further increase the model accuracy by fine tuning it directly on observational data for tasks such as precipitation corrections or downscaling.

Authors: ilaria luise (CERN)

Climate Science & Modeling
NeurIPS 2023 Artificial Intelligence for Methane Mitigation : Through an Automated Determination of Oil and Gas Methane Emissions Profiles (Papers Track)
Abstract and authors: (click to expand)

Abstract: The oil and gas sector is the second largest anthropogenic emitter of methane, which is responsible for approximately 25% of global warming since pre-industrial times. In order to mitigate methane atmospheric emissions from oil and gas industry, the potential emitting infrastructure must be monitored. Initiatives such as the Methane Alert and Response System (MARS), launched by the United Nations Environment Program, aim to locate significant emissions events, alert relevant stakeholders, as well as monitor and track progress in mitigation efforts. To achieve this goal, an automated solution is needed for consistent monitoring across multiple oil and gas basins around the world. Most methane emissions analysis studies propose post-emission analysis. The works and future guidelines presented in this paper aim to provide an automated collection of informed methane emissions by oil and gas site and infrastructure which are necessary to dress emission profile in near real time. This proposed framework also permits to create action margins to reduce methane emissions by passing from post methane emissions analysis to forecasting methods.

Authors: Jade Eva Guisiano (Sorbonne / ISEP / Polytechnique / UNEP); Thomas LAUVAUX (Université de Reims); Eric Moulines (Ecole Polytechnique); Jérémie Sublime (ISEP)

Earth Observation & Monitoring
NeurIPS 2023 Augmenting Ground-Level PM2.5 Prediction via Kriging-Based Pseudo-Label Generation (Papers Track)
Abstract and authors: (click to expand)

Abstract: Fusing abundant satellite data with sparse ground measurements constitutes a major challenge in climate modeling. To address this, we propose a strategy to augment the training dataset by introducing unlabeled satellite images paired with pseudo-labels generated through a spatial interpolation technique known as ordinary kriging, thereby making full use of the available satellite data resources. We show that the proposed data augmentation strategy helps enhance the performance of the state-of-the-art convolutional neural network-random forest (CNN-RF) model by a reasonable amount, resulting in a noteworthy improvement in spatial correlation and a reduction in prediction error.

Authors: lei Duan (Duke University); Ziyang Jiang (Duke University); David Carlson (Duke University)

Computer Vision & Remote Sensing
NeurIPS 2023 Physics-Informed Domain-Aware Atmospheric Radiative Transfer Emulator for All Sky Conditions (Papers Track)
Abstract and authors: (click to expand)

Abstract: Radiative transfer modeling is a complex and computationally expensive process that is used to predict how radiation interacts with the atmosphere and Earth's surface. The Rapid Radiation Transfer Model (RRTM) is one such process model that is used in many Earth system models. In recent years, there has been a growing interest in using machine learning (ML) to speed up radiative transfer modeling. ML algorithms can be trained on large datasets of existing RRTM simulations to learn how to predict the results of new simulations without having to run the full RRTM model so one can use the algorithm for new simulations with very light computational demand. This study developed a new physics-based ML emulator for RRTM that is built on a convolutional neural network (CNN) where we trained the CNN on a dataset of 28 years of RRTM simulations. We built a custom loss function, which incorporates information on how radiation interacts with clouds at day- and night-time. The emulator was able to learn how to predict the vertical heating rates in the atmosphere with a high degree of accuracy (RMSE of less than 2% and Pearson's correlation above 0.9). The new ML emulator is over 56 times faster than the original RRTM model on traditional multi-CPU machines. This speedup could allow scientists to call the RRTM much more frequently in atmosphere models, which may improve the accuracy of climate models and reduce the uncertainty in the future climate projections.

Authors: Piyush Garg (Argonne National Laboratory); Emil Constantinescu (Argonne National Laboratory); Bethany Lusch (Argonne National Lab); Troy Arcomano (Argonne National Laboratory); Jiali Wang (Argonne National Laboratory); Rao Kotamarthi (Argonne National Laboratory)

Climate Science & Modeling Hybrid Physical Models
NeurIPS 2023 Can We Reliably Improve the Robustness to Image Acquisition of Remote Sensing of PV Systems? (Papers Track)
Abstract and authors: (click to expand)

Abstract: Photovoltaic (PV) energy is crucial for the decarbonization of energy systems. Due to the lack of centralized data, remote sensing of rooftop PV installations is the best option to monitor the evolution of the rooftop PV installed fleet at a regional scale. However, current techniques lack reliability and are notably sensitive to shifts in the acquisition conditions. To overcome this, we leverage the wavelet scale attribution method (WCAM), which decomposes a model's prediction in the space-scale domain. The WCAM enables us to assess on which scales the representation of a PV model rests and provides insights to derive methods that improve the robustness to acquisition conditions, thus increasing trust in deep learning systems to encourage their use for the safe integration of clean energy in electric systems.

Authors: Gabriel Kasmi (Mines Paris - PSL); Laurent Dubus (RTE France); Yves-Marie Saint-Drenan (Mines Paris - PSL); Philippe Blanc (Mines Paris - PSL)

Computer Vision & Remote Sensing
NeurIPS 2023 Uncertainty Quantification of the Madden–Julian Oscillation with Gaussian Processes (Papers Track)
Abstract and authors: (click to expand)

Abstract: The Madden–Julian Oscillation (MJO) is an influential climate phenomenon that plays a vital role in modulating global weather patterns. In spite of the improvement in MJO predictions made by machine learning algorithms, such as neural networks, most of them cannot provide the uncertainty levels in the MJO forecasts directly. To address this problem, we develop a nonparametric strategy based on Gaussian process (GP) models. We calibrate GPs using empirical correlations. Furthermore, we propose a posteriori covariance correction that extends the probabilistic coverage by more than three weeks.

Authors: Haoyuan Chen (Texas A&M University); Emil Constantinescu (Argonne National Laboratory); Vishwas Rao (Argonne National Laboratory); Cristiana Stan (George Mason University)

Climate Science & Modeling Uncertainty Quantification & Robustness
NeurIPS 2023 Ocean Wave Energy: Optimizing Reinforcement Learning Agents for Effective Deployment (Papers Track)
Abstract and authors: (click to expand)

Abstract: Fossil fuel energy production is a leading cause of climate change. While wind and solar energy have made advancements, ocean waves, a more consistent clean energy source, remain underutilized. Wave Energy Converters (WEC) transform wave power into electric energy. To be economically viable, modern WECs need sophisticated real-time controllers that boost energy output and minimize mechanical stress, thus lowering the overall cost of energy (LCOE). This paper presents how a Reinforcement Learning (RL) controller can outperform the default spring damper controller for complex spread waves in the sea, enhancing wave energy's viability. Using the Proximal Policy Optimization (PPO) algorithm with Transformer variants as function approximators, the RL controllers optimize multi-generator Wave Energy Converters (WEC), leveraging wave sensor data for multiple cost-efficiency goals. After successful tests in the EuropeWave\footnote{EuropeWave: https://www.europewave.eu/} project's emulator tank, the platform is planned to deploy. We discuss the challenges of deployment at the BiMEP site and how we had to tune the RL controller to address that. The RL controller outperforms the default Spring Damper controller in the BiMEP\footnote{BiMEP: https://www.bimep.com/en/} conditions by 22.8% on energy capture. Enhancing wave energy's economic viability will expedite the transition to clean energy, reducing carbon emissions and fostering a healthier climate.

Authors: Vineet Gundecha (Hewlett Packard Enterpise); Sahand Ghorbanpour (Hewlett Packard Enterprise); Ashwin Ramesh Babu (Hewlett Packard Enterprise Labs); Avisek Naug (Hewlett Packard Enterprise); Alexandre Pichard (Carnegie Clean Energy); mathieu Cocho (Carnegie Clean Energy); Soumyendu Sarkar (Hewlett Packard Enterprise)

Power & Energy Reinforcement Learning
NeurIPS 2023 Sustainable Data Center Modeling: A Multi-Agent Reinforcement Learning Benchmark (Papers Track)
Abstract and authors: (click to expand)

Abstract: The rapid growth of machine learning (ML) has led to an increased demand for computational power, resulting in larger data centers (DCs) and higher energy consumption. To address this issue and reduce carbon emissions, intelligent control of DC components such as cooling, load shifting, and energy storage is essential. However, the complexity of managing these controls in tandem with external factors like weather and green energy availability presents a significant challenge. While some individual components like HVAC control have seen research in Reinforcement Learning (RL), there's a gap in holistic optimization covering all elements simultaneously. To tackle this, we've developed DCRL, a multi-agent RL environment that empowers the ML community to research, develop, and refine RL controllers for carbon footprint reduction in DCs. DCRL is a flexible, modular, scalable, and configurable platform that can handle large High Performance Computing (HPC) clusters. In its default setup, DCRL also provides a benchmark for evaluating multi-agent RL algorithms, facilitating collaboration and progress in green computing research.

Authors: Soumyendu Sarkar (Hewlett Packard Enterprise); Avisek Naug (Hewlett Packard Enterprise); Antonio Guillen (Hewlett Packard Enterprise); Ricardo Luna Gutierrez (Hewlett Packard Enterprise); Vineet Gundecha (Hewlett Packard Enterpise); Sahand Ghorbanpour (Hewlett Packard Enterprise); Sajad Mousavi (Hewlett Packard Enterprise); Ashwin Ramesh Babu (Hewlett Packard Enterprise Labs)

Buildings Reinforcement Learning
NeurIPS 2023 ACE: A fast, skillful learned global atmospheric model for climate prediction (Papers Track)
Abstract and authors: (click to expand)

Abstract: Existing ML-based atmospheric models are not suitable for climate prediction, which requires long-term stability and physical consistency. We present ACE (AI2 Climate Emulator), a 200M-parameter, autoregressive machine learning emulator of an existing comprehensive 100-km resolution global atmospheric model. The formulation of ACE allows evaluation of physical laws such as the conservation of mass and moisture. The emulator is stable for 100 years, nearly conserves column moisture without explicit constraints and faithfully reproduces the reference model's climate, outperforming a challenging baseline on over 90% of tracked variables. ACE requires nearly 100x less wall clock time and is 100x more energy efficient than the reference model using typically available resources. Without fine-tuning, ACE can stably generalize to a previously unseen historical sea surface temperature dataset.

Authors: Oliver Watt-Meyer (Allen Institute for AI); Gideon Dresdner (Allen Institute for AI Climate Science); Jeremy McGibbon (Allen Institute for AI); Spencer K Clark (Allen Institute for Artificial Intelligence); James Duncan (University of California, Berkeley); Brian Henn (Allen Institute for AI); Matthew Peters (AI2); Noah D Brenowitz (NVIDIA); Karthik Kashinath (NVIDIA); Mike Pritchard (NVIDIA); Boris Bonev (NVIDIA); Christopher Bretherton (Allen Institute for AI)

Climate Science & Modeling
NeurIPS 2023 A Configurable Pythonic Data Center Model for Sustainable Cooling and ML Integration (Papers Track)
Abstract and authors: (click to expand)

Abstract: There have been growing discussions on estimating and subsequently reducing the operational carbon footprint of enterprise data centers. The design and intelligent control for data centers have an important impact on data center carbon footprint. In this paper, we showcase PyDCM, a Python library that enables extremely fast prototyping of data center design and applies reinforcement learning-enabled control with the purpose of evaluating key sustainability metrics, including carbon footprint, energy consumption, and observing temperature hotspots. We demonstrate these capabilities of PyDCM and compare them to existing works in EnergyPlus for modeling data centers. PyDCM can also be used as a standalone Gymnasium environment for demonstrating sustainability-focused data center control.

Authors: Avisek Naug (Hewlett Packard Enterprise); Antonio Guillen (Hewlett Packard Enterprise); Ricardo Luna Gutierrez (Hewlett Packard Enterprise); Vineet Gundecha (Hewlett Packard Enterpise); Sahand Ghorbanpour (Hewlett Packard Enterprise); Sajad Mousavi (Hewlett Packard Enterprise); Ashwin Ramesh Babu (Hewlett Packard Enterprise Labs); Soumyendu Sarkar (Hewlett Packard Enterprise)

Buildings Reinforcement Learning
NeurIPS 2023 Mapping Housing Stock Characteristics from Drone Images for Climate Resilience in the Caribbean (Papers Track)
Abstract and authors: (click to expand)

Abstract: Comprehensive information on housing stock is crucial for climate adaptation initiatives aiming to reduce the adverse impacts of climate-extreme hazards in high-risk regions like the Caribbean. In this study, we propose a workflow for rapidly generating critical baseline housing stock data using very high-resolution drone images and deep learning techniques. Specifically, our work leverages the Segment Anything Model and convolutional neural networks for the automated generation of building footprint and roof classification maps. By enhancing local capacity in government agencies, this work seeks to improve the climate resilience of the housing sector in small island developing states in the Caribbean.

Authors: Isabelle Tingzon (The World Bank); Nuala Margaret Cowan (The World Bank); Pierre Chrzanowski (The World Bank)

Disaster Management and Relief
NeurIPS 2023 Graph-based Neural Weather Prediction for Limited Area Modeling (Papers Track)
Abstract and authors: (click to expand)

Abstract: The rise of accurate machine learning methods for weather forecasting is creating radical new possibilities for modeling the atmosphere. In the time of climate change, having access to high-resolution forecasts from models like these is also becoming increasingly vital. While most existing Neural Weather Prediction (NeurWP) methods focus on global forecasting, an important question is how these techniques can be applied to limited area modeling. In this work we adapt the graph-based NeurWP approach to the limited area setting and propose a multi-scale hierarchical model extension. Our approach is validated by experiments with a local model for the Nordic region.

Authors: Joel Oskarsson (Linköping University); Tomas Landelius (SMHI); Fredrik Lindsten (Linköping University)

Climate Science & Modeling Time-series Analysis
NeurIPS 2023 Self-supervised Pre-training for Precipitation Post-processor (Papers Track)
Abstract and authors: (click to expand)

Abstract: Securing sufficient forecast lead time for local precipitation is essential for preventing hazardous weather events. Nonetheless, global warming-induced climate change is adding to the challenge of accurately predicting severe precipitation events, such as heavy rainfall. In this work, we propose a deep learning-based precipitation post-processor approach to numerical weather prediction (NWP) models. The precipitation post-processor consists of (i) self-supervised pre-training, where parameters of encoder are pre-trained on the reconstruction of masked variables of the atmospheric physics domain, and (ii) transfer learning on precipitation segmentation tasks (target domain) from the pre-trained encoder. We also introduce a heuristic labeling approach for effectively training class-imbalanced datasets. Our experiment results in precipitation correction for regional NWP show that the proposed method outperforms other approaches.

Authors: Sojung An (Korea Institute of Atmosphere Prediction Systems); Junha Lee (Korea Institute of Industrial Technology); Jiyeon Jang (Korea Institute of Atmosphere Prediction Systems); Inchae Na (Korea Institute of Atmosphere Prediction Systems); Wooyeon Park (Korea Institute of Atmosphere Prediction Systems); Sujeong You (KITECH)

Extreme Weather
NeurIPS 2023 Enhancing Data Center Sustainability with a 3D CNN-Based CFD Surrogate Model (Papers Track)
Abstract and authors: (click to expand)

Abstract: Thermal Computational Fluid Dynamics (CFD) models analyze airflow and heat distribution in data centers, but their complex computations hinder efficient energy-saving optimizations for sustainability. We introduce a new method to acquire data and model 3D Convolutional Neural Network (CNN) based surrogates for CFDs, which predict a data center's temperature distribution based on server workload, HVAC airflow rate, and temperature set points. The surrogate model's predictions are highly accurate, with a mean absolute error of 0.31°C compared to CFD-based ground truth temperatures. The surrogate model is three orders of magnitude faster than CFDs in generating the temperature maps for similar-sized data centers, enabling real-time applications. It helps to quickly identify and reduce temperature hot spots($7.7%) by redistributing workloads and saving cooling energy($2.5%). It also aids in optimizing server placement during installation, preventing issues, and increasing equipment lifespan. These optimizations boost sustainability by reducing energy use, improving server performance, and lowering environmental impact.

Authors: Soumyendu Sarkar (Hewlett Packard Enterprise); Antonio Guillen (Hewlett Packard Enterprise); Zachariah Carmichael (University of Notre Dame); Vineet Gundecha (Hewlett Packard Enterpise); Avisek Naug (Hewlett Packard Enterprise); Ashwin Ramesh Babu (Hewlett Packard Enterprise Labs); Ricardo Luna Gutierrez (Hewlett Packard Enterprise)

Buildings Computer Vision & Remote Sensing
NeurIPS 2023 Price-Aware Deep Learning for Electricity Markets (Papers Track)
Abstract and authors: (click to expand)

Abstract: While deep learning gradually penetrates operational planning of power systems, its inherent prediction errors may significantly affect electricity prices. This paper examines how prediction errors propagate into electricity prices, revealing notable pricing errors and their spatial disparity in congested power systems. To improve fairness, we propose to embed electricity market-clearing optimization as a deep learning layer. Differentiating through this layer allows for balancing between prediction and pricing errors, as oppose to minimizing prediction errors alone. This layer implicitly optimizes fairness and controls the spatial distribution of price errors across the system. We showcase the price-aware deep learning in the nexus of wind power forecasting and short-term electricity market clearing.

Authors: Vladimir Dvorkin (Massachusetts Institute of Technology); Ferdinando Fioretto (University of Virginia)

Power & Energy
NeurIPS 2023 A machine learning framework for correcting under-resolved simulations of turbulent systems using nudged datasets (Papers Track)
Abstract and authors: (click to expand)

Abstract: Due to the rapidly changing climate, the frequency and severity of extreme weather, such as storms and heatwaves is expected to increase drastically over the coming decades. Accurately quantifying the risk of such events with high spatial resolution is a critical step in the implementation of strategies to prepare for and mitigate the damages. As fully resolved simulations remain computationally out of reach, policy makers must rely on coarse resolution climate models which either parameterize or completely ignore sub-grid scale dynamics. In this work we propose a machine learning framework to debias under-resolved simulations of complex and chaotic dynamical systems such as atmospheric dynamics. The proposed strategy uses ``nudged'' simulations of the coarse model to generate training data designed to minimize the effects of chaotic divergence. We illustrate through a prototype QG model that the proposed approach allows us to machine learn a map from the chaotic attractor of under-resolved dynamics to that of the fully resolved system. In this way we are able to recover extreme event statistics using a very small training dataset.

Authors: Benedikt Barthel (MIT); Themis Sapsis (MIT)

Climate Science & Modeling
NeurIPS 2023 Can Deep Learning help to forecast deforestation in the Amazonian Rainforest? (Papers Track)
Abstract and authors: (click to expand)

Abstract: Deforestation is a major driver of climate change. To mitigate deforestation, carbon offset projects aim to protect forest areas at risk. However, existing literature shows that most projects have substantially overestimated the risk of deforestation, thereby issuing carbon credits without equivalent emissions reductions. In this study, we examine if the spread of deforestation can be predicted ex-ante using Deep Learning (DL) models. Our input data includes past deforestation development, slope information, land use, and other terrain- and soil-specific covariates. Testing predictions 1-year ahead, we find that our models only achieve low levels of predictability. For pixel-wise classification at a 30 m resolution, our models achieve an F1 score of 0.263. Only when substantially simplifying the task to predicting if any level of deforestation occurs within a 1.5 km squared tile, the model results improve to a moderate performance (F1: 0.608). We conclude that, based on our input data, deforestation cannot be predicted accurately enough to justify the ex-ante issuance of carbon credits for forest conservation projects. As main challenges, there is the extreme class imbalance between pixels that are deforested (minority) and not deforested (majority) as well as the omittance of social, political, and economic drivers of deforestation.

Authors: Tim Engelmann (ETH Zurich); Malte Toetzke (ETH Zurich)

Forests Time-series Analysis
NeurIPS 2023 Spatially-resolved emulation of climate extremes via machine learning stochastic models (Papers Track)
Abstract and authors: (click to expand)

Abstract: Emulators, or reduced-complexity models, serve as an ideal complement to earth system models (ESM) by providing the climate information under various scenarios at much lower computational costs. We develop an emulator of climate extremes that produce the temporal evolution of probability distributions of local variables on a spatially resolved grid. The representative modes of climate change are identified using principal component analysis (PCA), and the PCA time series are approximated using stochastic models. When applied to ERA5 data, the model accurately reproduces the quantiles of local daily maximum temperature and effectively captures the non-Gaussian statistics. We also discuss potential generalization of our emulator to different climate change scenarios.

Authors: Mengze Wang (Massachusetts Institute of Technology); Andre Souza (Massachusetts Institute of Technology); Raffaele Ferrari (Massachusetts Institute of Technology); Themis Sapsis (MIT)

Climate Science & Modeling
NeurIPS 2023 Reinforcement Learning control for Airborne Wind Energy production (Papers Track)
Abstract and authors: (click to expand)

Abstract: Airborne Wind Energy (AWE) is an emerging technology that promises to be able to harvest energy from strong high-altitude winds, while addressing some of the key critical issues of current wind turbines. AWE is based on flying devices (usually gliders or kites) that, tethered to a ground station, fly driven by the wind and convert the mechanical energy of wind into electrical energy by means of a generator. Such systems are usually controlled by adjusting the trajectory of the kite using optimal control techniques, such as model-predictive control. These methods are based upon a mathematical model of the system to control, and they produce results that are strongly dependent on the specific model at use and difficult to generalize. Our aim is to replace these classical techniques with an approach based on Reinforcement Learning (RL), which can be used even in absence of a known model. Experimental results prove that RL is a viable method to control AWE systems in complex simulated environments, including turbulent flows.

Authors: Lorenzo Basile (University of Trieste); Maria Grazia Berni (University of Trieste); Antonio Celani (ICTP)

Power & Energy Reinforcement Learning
NeurIPS 2023 Extreme Event Prediction with Multi-agent Reinforcement Learning-based Parametrization of Atmospheric and Oceanic Turbulence (Papers Track)
Abstract and authors: (click to expand)

Abstract: Global climate models (GCMs) are the main tools for understanding and predicting climate change. However, due to limited numerical resolutions, these models suffer from major structural uncertainties; e.g., they cannot resolve critical processes such as small-scale eddies in atmospheric and oceanic turbulence. Thus, such small-scale processes have to be represented as a function of the resolved scales via closures (parametrization). The accuracy of these closures is particularly important for capturing climate extremes. Traditionally, such closures are based on heuristics and simplifying assumptions about the unresolved physics. Recently, supervised-learned closures, trained offline on high-fidelity data, have been shown to outperform the classical physics-based closures. However, this approach requires a significant amount of high-fidelity training data and can also lead to instabilities. Reinforcement learning is emerging as a potent alternative for developing such closures as it requires only low-order statistics and leads to stable closures. In Scientific Multi-Agent Reinforcement Learning (SMARL) computational elements serve a dual role of discretization points and learning agents. Here, we leverage SMARL and fundamentals of turbulence physics to learn closures for canonical prototypes of atmospheric and oceanic turbulence. The policy is trained using only the enstrophy spectrum, which is nearly invariant and can be estimated from a few high-fidelity samples. We show that these closures lead to stable low-resolution simulations that, at a fraction of the cost, can reproduce the high-fidelity simulations' statistics, including the tails of the probability density functions (PDFs). These results demonstrate the high potential of SMARL for closure modeling for GCMs, especially in the regime of scarce data and indirect observations.

Authors: Rambod Mojgani (Rice University); Daniel Waelchli (ETHZ); Yifei Guan (Rice University); Petros Koumoutsakos (Harvard); Pedram Hassanzadeh (Rice University)

Climate Science & Modeling Hybrid Physical Models
NeurIPS 2023 Machine learning derived sub-seasonal to seasonal extremes (Papers Track)
Abstract and authors: (click to expand)

Abstract: Improving the accuracy of sub-seasonal to seasonal (S2S) extremes can significantly impact society. Providing S2S forecasts in risk or extreme indices can aid disaster response, especially for drought and flood events. Additionally, it can provide updates on disease outbreaks and aid in predicting the occurrence, duration, and decline of heat waves. This work uses a transformer model to predict the daily temperature distributions in the S2S scale. We analyze how the model performs in extreme temperatures by comparing its output distributions with those obtained from ECMWF forecasts across different metrics. Our model produces better responses for temperatures in average and extreme regions. Also, we show how our model better captures the heatwave that hit Europe in the summer of 2019.

Authors: Daniel Salles Civitarese (IBM Research, Brazil); Bianca Zadrozny (IBM Research)

Time-series Analysis Extreme Weather
NeurIPS 2023 Understanding Opinions Towards Climate Change on Social Media (Papers Track)
Abstract and authors: (click to expand)

Abstract: Social media platforms such as Twitter (now known as X) have revolutionized how the public engage with important societal and political topics. Recently, climate change discussions on social media became a catalyst for political polarization and the spreading of misinformation. In this work, we aim to understand how real world events influence the opinions of individuals towards climate change related topics on social media. To this end, we extracted and analyzed a dataset of 13.6 millions tweets sent by 3.6 million users from 2006 to 2019. Then, we construct a temporal graph from the user-user mentions network and utilize the Louvain community detection algorithm to analyze the changes in community structure around Conference of the Parties on Climate Change (COP) events. Next, we also apply tools from the Natural Language Processing literature to perform sentiment analysis and topic modeling on the tweets. Our work acts as a first step towards understanding the evolution of pro-climate change communities around COP events. Answering these questions helps us understand how to raise people's awareness towards climate change thus hopefully calling on more individuals to join the collaborative effort in slowing down climate change.

Authors: Yashaswi Pupneja (University of Montreal); Yuesong Zou (McGill University); Sacha Levy (Yale University); Shenyang Huang (Mila/McGill University)

Behavioral and Social Science Data Mining
NeurIPS 2023 Real-time Carbon Footprint Minimization in Sustainable Data Centers wth Reinforcement Learning (Papers Track) Best ML Innovation
Abstract and authors: (click to expand)

Abstract: As machine learning workloads significantly increase energy consumption, sustainable data centers with low carbon emissions are becoming a top priority for governments and corporations worldwide. There is a pressing need to optimize energy usage in these centers, especially considering factors like cooling, balancing flexible load based on renewable energy availability, and battery storage utilization. The challenge arises due to the interdependencies of these strategies with fluctuating external factors such as weather and grid carbon intensity. Although there's currently no real-time solution that addresses all these aspects, our proposed Data Center Carbon Footprint Reduction (DC-CFR) framework, based on multi-agent Reinforcement Learning (MARL), targets carbon footprint reduction, energy optimization, and cost. Our findings reveal that DC-CFR's MARL agents efficiently navigate these complexities, optimizing the key metrics in real-time. DC-CFR reduced carbon emissions, energy consumption, and energy costs by over 13% with EnergyPlus simulation compared to the industry standard ASHRAE controller controlling HVAC for a year in various regions.

Authors: Soumyendu Sarkar (Hewlett Packard Enterprise); Avisek Naug (Hewlett Packard Enterprise); Ricardo Luna Gutierrez (Hewlett Packard Enterprise); Antonio Guillen (Hewlett Packard Enterprise); Vineet Gundecha (Hewlett Packard Enterpise); Ashwin Ramesh Babu (Hewlett Packard Enterprise Labs); Cullen Bash (HPE)

Buildings Reinforcement Learning
NeurIPS 2023 Machine learning applications for weather and climate predictions need greater focus on extremes (Papers Track)
Abstract and authors: (click to expand)

Abstract: Multiple studies have now demonstrated that machine learning (ML) can give improved skill for predicting or simulating fairly typical weather events, for tasks such as short-term and seasonal weather forecasting, downscaling simulations to higher resolution and emulating and speeding up expensive model parameterisations. Many of these used ML methods with very high numbers of parameters, such as neural networks, which are the focus of the discussion here. Not much attention has been given to the performance of these methods for extreme event severities of relevance for many critical weather and climate prediction applications, with return periods of more than a few years. This leaves a lot of uncertainty about the usefulness of these methods, particularly for general purpose prediction systems that must perform reliably in extreme situations. ML models may be expected to struggle to predict extremes due to there usually being few samples of such events. However, there are some studies that do indicate that ML models can have reasonable skill for extreme weather, and that it is not hopeless to use them in situations requiring extrapolation. This paper reviews these studies, updating an earlier review, and argues that this is an area that needs researching more. Ways to get a better understanding of how well ML models perform at predicting extreme weather events are discussed.

Authors: Peter Watson (Bristol)

Climate Science & Modeling Extreme Weather
NeurIPS 2023 Climate-sensitive Urban Planning through Optimization of Tree Placements (Papers Track)
Abstract and authors: (click to expand)

Abstract: Climate change is increasing the intensity and frequency of many extreme weather events, including heatwaves, which results in increased thermal discomfort and mortality rates. While global mitigation action is undoubtedly necessary, so is climate adaptation, e.g., through climate-sensitive urban planning. Among the most promising strategies is harnessing the benefits of urban trees in shading and cooling pedestrian-level environments. Our work investigates the challenge of optimal placement of such trees. Physical simulations can estimate the radiative and thermal impact of trees on human thermal comfort but induce high computational costs. This rules out optimization of tree placements over large areas and considering effects over longer time scales. Hence, we employ neural networks to simulate the point-wise mean radiant temperatures--a driving factor of outdoor human thermal comfort--across various time scales, spanning from daily variations to extended time scales of heatwave events and even decades. To optimize tree placements, we harness the innate local effect of trees within the iterated local search framework with tailored adaptations. We show the efficacy of our approach across a wide spectrum of study areas and time scales. We believe that our approach is a step towards empowering decision-makers, urban designers and planners to proactively and effectively assess the potential of urban trees to mitigate heat stress.

Authors: Simon Schrodi (University of Freiburg); Ferdinand Briegel (University of Freiburg); Max J. Argus (University Of Freiburg); Andreas Christen (University of Freiburg); Thomas Brox (University of Freiburg)

Cities & Urban Planning
NeurIPS 2023 Reinforcement Learning for Wildfire Mitigation in Simulated Disaster Environments (Papers Track)
Abstract and authors: (click to expand)

Abstract: Climate change has resulted in a year over year increase in adverse weather and weather conditions which contribute to increasingly severe fire seasons. Without effective mitigation, these fires pose a threat to life, property, ecology, cultural heritage, and critical infrastructure. To better prepare for and react to the increasing threat of wildfires, more accurate fire modelers and mitigation responses are necessary. In this paper, we introduce SimFire, a versatile wildland fire projection simulator designed to generate realistic wildfire scenarios, and SimHarness, a modular agent-based machine learning wrapper capable of automatically generating land management strategies within SimFire to reduce the overall damage to the area. Together, this publicly available system allows researchers and practitioners the ability to emulate and assess the effectiveness of firefighter interventions and formulate strategic plans that prioritize value preservation and resource allocation optimization. The repositories are available for download at https://github.com/mitrefireline.

Authors: Alexander Tapley (The MITRE Corporation); savanna o smith (MITRE); Tim Welsh (The MITRE Corporation); Aidan Fennelly (The MITRE Corporation); Dhanuj M Gandikota (The MITRE Corporation); Marissa Dotter (MITRE Corporation); Michael Doyle (The MITRE Corporation); Michael Threet (MITRE)

Reinforcement Learning Disaster Management and Relief
NeurIPS 2023 Combining deep generative models with extreme value theory for synthetic hazard simulation: a multivariate and spatially coherent approach (Papers Track)
Abstract and authors: (click to expand)

Abstract: Climate hazards can cause major disasters when they occur simultaneously as compound hazards. To understand the distribution of climate risk and inform adaptation policies, scientists need to simulate a large number of physically realistic and spatially coherent events. Current methods are limited by computational constraints and the probabilistic spatial distribution of compound events is not given sufficient attention. The bottleneck in current approaches lies in modelling the dependence structure between variables, as inference on parametric models suffers from the curse of dimensionality. Generative adversarial networks (GANs) are well-suited to such a problem due to their ability to implicitly learn the distribution of data in high-dimensional settings. We employ a GAN to model the dependence structure for daily maximum wind speed, significant wave height, and total precipitation over the Bay of Bengal, combining this with traditional extreme value theory for controlled extrapolation of the tails. Once trained, the model can be used to efficiently generate thousands of realistic compound hazard events, which can inform climate risk assessments for climate adaptation and disaster preparedness. The method developed is flexible and transferable to other multivariate and spatial climate datasets.

Authors: Alison M Peard (University of Oxford); Jim Hall (University of Oxford)

Societal Adaptation & Resilience Generative Modeling
NeurIPS 2023 Uncertainty Quantified Machine Learning for Street Level Flooding Predictions in Norfolk, Virginia (Papers Track)
Abstract and authors: (click to expand)

Abstract: Everyday citizens, emergency responders, and critical infrastructure can be dramatically affected by the flooding of streets and roads. Climate change exacerbates these floods through sea level rise and more frequent major storm events. Low-level flooding, such as nuisance flooding, continues to increase in frequency, especially in cities like Norfolk, Virginia, which can expect nearly 200 flooding events by 2050 [1]. Recently, machine learning (ML) models have been leveraged to produce real-time predictions based on local weather and geographic conditions. However, ML models are known to produce unusual results when presented with data that varies from their training set. For decision-makers to determine the trustworthiness of the model's predictions, ML models need to quantify their prediction uncertainty. This study applies Deep Quantile Regression to a previously published, Long Short-Term Memory-based model for hourly water depth predictions [2], and analyzes its out-of-distribution performance.

Authors: Steven Goldenberg (Thomas Jefferson National Accelerator Facility); Diana McSpadden (Thomas Jefferson National Accelerator Facility); Binata Roy (University of Virginia); Malachi Schram (Thomas Jefferson National Accelerator Facility); Jonathan Goodall (University of Virginia); Heather Richter (Old Dominion University)

Climate Science & Modeling Uncertainty Quantification & Robustness
NeurIPS 2023 Weakly-semi-supervised object detection in remotely sensed imagery (Papers Track)
Abstract and authors: (click to expand)

Abstract: Deep learning for detecting objects in remotely sensed imagery can enable new technologies for important applications including mitigating climate change. However, these models often require large datasets labeled with bounding box annotations which are expensive to curate, prohibiting the development of models for new tasks and geographies. To address this challenge, we develop weakly-semi-supervised object detection (WSSOD) models on remotely sensed imagery which can leverage a small amount of bounding boxes together with a large amount of point labels that are easy to acquire at scale in geospatial data. We train WSSOD models which use large amounts of point-labeled images with varying fractions of bounding box labeled images in FAIR1M and a wind turbine detection dataset, and demonstrate that they substantially outperform fully supervised models trained with the same amount of bounding box labeled images on both datasets. Furthermore, we find that the WSSOD models trained with 2-10x fewer bounding box labeled images can perform similarly to or outperform fully supervised models trained on the full set of bounding-box labeled images. We believe that the approach can be extended to other remote sensing tasks to reduce reliance on bounding box labels and increase development of models for impactful applications.

Authors: Ji Hun Wang (Stanford University); Jeremy Irvin (Stanford University); Beri Kohen Behar (Stanford University); Ha Tran (Stanford University); Raghav Samavedam (Stanford University); Quentin Hsu (Stanford University); Andrew Ng (Stanford University)

Earth Observation & Monitoring Unsupervised & Semi-Supervised Learning
NeurIPS 2023 Hybridizing Physics and Neural ODEs for Predicting Plasma Inductance Dynamics in Tokamak Fusion Reactors (Papers Track)
Abstract and authors: (click to expand)

Abstract: While fusion reactors known as tokamaks hold promise as a firm energy source, advances in plasma control, and handling of events where control of plasmas is lost, are needed for them to be economical. A significant bottleneck towards applying more advanced control algorithms is the need for better plasma simulation, where both physics-based and data-driven approaches currently fall short. The former is bottle-necked by both computational cost and the difficulty of modelling plasmas, and the latter is bottle-necked by the relative paucity of data. To address this issue, this work applies the neural ordinary differential equations (ODE) framework to the problem of predicting a subset of plasma dynamics, namely the coupled plasma current and internal inductance dynamics. As the neural ODE framework allows for the natural inclusion of physics-based inductive biases, we train both physics-based and neural network models on data from the Alcator C-Mod fusion reactor and find that a model that combines physics-based equations with a neural ODE performs better than both existing physics-motivated ODEs and a pure neural ODE model.

Authors: Allen Wang (MIT); Cristina Rea (MIT); Darren Garnier (MIT)

Hybrid Physical Models Reinforcement Learning
NeurIPS 2023 Explainable Offline-online Training of Neural Networks for Multi-scale Climate Modeling (Papers Track)
Abstract and authors: (click to expand)

Abstract: In global climate models, small-scale physical processes are represented using subgrid-scale (SGS) models known as parameterizations, and these parameterizations contribute substantially to uncertainties in climate projections. Recently, machine learning techniques, particularly deep neural networks (NNs), have emerged as novel tools for developing SGS parameterizations. Different strategies exist for training these NN-based SGS models. Here, we use a 1D model of the quasi-biennial oscillation (QBO) and atmospheric gravity wave (GW) parameterizations as testbeds to explore various learning strategies and challenges due to scarcity of high-fidelity training data. We show that a 12-layer convolutional NN that predicts GW forcings for given wind profiles, when trained offline in a big-data regime (100-years), produces realistic QBOs once coupled to the 1D model. In contrast, offline training of this NN in a small-data regime (18-months) yields unrealistic QBOs. However, online re-training of just two layers of this NN using ensemble Kalman inversion and only time-averaged QBO statistics leads to parameterizations that yield realistic QBOs. Fourier analysis of these three NNs’ kernels suggests how/why re-training works and reveals that these NNs primarily learn low-pass, high-pass, and a combination of band-pass Gabor filters, consistent with the importance of both local and non-local dynamics in GW propagation/dissipation. These strategies/findings apply to data-driven parameterizations of other climate processes generally.

Authors: Hamid Alizadeh Pahlavan (Rice University); Pedram Hassanzadeh (Rice University); M. Joan Alexander (NorthWest Research Associates)

Hybrid Physical Models Unsupervised & Semi-Supervised Learning
NeurIPS 2023 Southern Ocean Dynamics Under Climate Change: New Knowledge Through Physics-Guided Machine Learning (Papers Track)
Abstract and authors: (click to expand)

Abstract: Complex ocean systems such as the Antarctic Circumpolar Current play key roles in the climate, and current models predict shifts in their strength and area under climate change. However, the physical processes underlying these changes are not well understood, in part due to the difficulty of characterizing and tracking changes in ocean physics in complex models. Using the Antarctic Circumpolar Current as a case study, we extend the method Tracking global Heating with Ocean Regimes (THOR) to a mesoscale eddy permitting climate model and identify regions of the ocean characterized by similar physics, called dynamical regimes, using readily accessible fields from climate models. To this end, we cluster grid cells into dynamical regimes and train an ensemble of neural networks, allowing uncertainty quantification, to predict these regimes and track them under climate change. Finally, we leverage this new knowledge to elucidate the dynamical drivers of the identified regime shifts as noted by the neural network using the `explainability' methods SHAP and Layer-wise Relevance Propagation. A region undergoing a profound shift is where the Antarctic Circumpolar Current intersects the Pacific-Antarctic Ridge, an area important for carbon draw-down and fisheries. In this region, THOR specifically reveals a shift in dynamical regime under climate change driven by changes in wind stress and interactions with bathymetry. Using this knowledge to guide further exploration, we find that as the Antarctic Circumpolar Current shifts north under intensifying wind stress, the dominant dynamical role of bathymetry weakens and the flow intensifies.

Authors: William J Yik (Harvey Mudd College); Maike Sonnewald (University of California, Davis); Mariana Clare (ECMWF); Redouane Lguensat (IPSL)

Oceans & Marine Systems Interpretable ML
NeurIPS 2023 Prototype-oriented Unsupervised Change Detection for Disaster Management (Papers Track)
Abstract and authors: (click to expand)

Abstract: Climate change has led to an increased frequency of natural disasters such as floods and cyclones. This emphasizes the importance of effective disaster monitoring. In response, the remote sensing community has explored change detection methods. These methods are primarily categorized into supervised techniques, which yield precise results but come with high labeling costs, and unsupervised techniques, which eliminate the need for labeling but involve intricate hyperparameter tuning. To address these challenges, we propose a novel unsupervised change detection method named Prototype-oriented Unsupervised Change Detection for Disaster Management (PUCD). PUCD captures changes by comparing features from pre-event, post-event, and prototype-oriented change synthesis images via a foundational model, and refines results using the Segment Anything Model (SAM). Although PUCD is an unsupervised change detection, it does not require complex hyperparameter tuning. We evaluate PUCD framework on the LEVIR-Extension dataset and the disaster dataset and it achieves state-of-the-art performance compared to other methods on the LEVIR-Extension dataset.

Authors: YoungTack Oh (SI Analytics); Minseok Seo (si-analytics); Doyi Kim (SI Analytics); Junghoon Seo (SI Analytics)

Disaster Management and Relief Earth Observation & Monitoring
NeurIPS 2023 How to Recycle: General Vision-Language Model without Task Tuning for Predicting Object Recyclability (Papers Track)
Abstract and authors: (click to expand)

Abstract: Waste segregation and recycling place a crucial role in fostering environmental sustainability. However, discerning the whether a material is recyclable or not poses a formidable challenge, primarily because of inadequate recycling guidelines to accommodate a diverse spectrum of objects and their varying conditions. We investigated the role of vision-language models in addressing this challenge. We curated a dataset consisting >1000 images across 11 disposal categories for optimal discarding and assessed the applicability of general vision-language models for recyclability classification. Our results show that Contrastive Language-Image Pre- training (CLIP) model, which is pretrained to understand the relationship between images and text, demonstrated remarkable performance in the zero-shot recycla- bility classification task, with an accuracy of 89%. Our results underscore the potential of general vision-language models in addressing real-world challenges, such as automated waste sorting, by harnessing the inherent associations between visual and textual information.

Authors: Eliot Park (Harvard Medical School); Eddy Pan (Harvard Medical School); Shreya Johri (Harvard Medical School); Pranav Rajpurkar (Harvard Medical School)

Sustainability Natural Language Processing
NeurIPS 2023 Flowering Onset Detection: Traditional Learning vs. Deep Learning Performance in a Sparse Label Context (Papers Track)
Abstract and authors: (click to expand)

Abstract: Detecting temporal shifts in plant flowering times is of increasing importance in a context of climate change, with applications in plant ecology, but also health, agriculture, and ecosystem management. However, scaling up plant-level monitoring is cost prohibitive, and flowering transitions are complex and difficult to model. We develop two sets of approaches to detect the onset of flowering at large-scale and high-resolution. Using fine grain temperature data with domain knowledge based features and traditional machine learning models provides the best performance. Using satellite data, with deep learning to deal with high dimensionality and transfer learning to overcome ground truth label sparsity, is a challenging but promising approach, as it reaches good performance with more systematically available data.

Authors: Mauricio Soroco (University of British Columbia); Joel Hempel (University of British Columbia); Xinze Xiong (University of British Columbia); Mathias Lécuyer (University of British Columbia); Joséphine Gantois (University of British Columbia)

Computer Vision & Remote Sensing Earth Observation & Monitoring
NeurIPS 2023 Zero shot microclimate prediction with deep learning (Papers Track)
Abstract and authors: (click to expand)

Abstract: While weather station data is a valuable resource for climate prediction, its reliability can be limited in remote locations. Furthermore, making local predictions often relies on sensor data that may not be accessible for a new, unmonitored location. In response to these challenges, we introduce a novel zero-shot learning approach designed to forecast various climate measurements at new and unmonitored locations. Our method surpasses conventional weather forecasting techniques in predicting microclimate variables by leveraging knowledge extracted from other geographic locations.

Authors: Iman Deznabi (UMass); Peeyush Kumar (Microsoft Research); Madalina Fiterau (University of Massachusetts Amherst)

Meta- and Transfer Learning Climate Science & Modeling
NeurIPS 2023 Glacier Movement Prediction with Attention-based Recurrent Neural Networks and Satellite Data (Papers Track)
Abstract and authors: (click to expand)

Abstract: Studying glacier movements is crucial because of their indications for global climate change and its effects on local land masses. Building on established methods for glacier movement prediction from Landsat-8 satellite imaging data, we develop an attention-based deep learning model for time series data prediction of glacier movements. In our approach, the Normalized Difference Snow Index is calculated from the Landsat-8 spectral reflectance bands for data of the Parvati Glacier (India) to quantify snow and ice in the scene images, which is then used for time series prediction. Based on this data, a newly developed Long-Short Term Memory Encoder-decoder neural network model is trained, incorporating a Multi-head Self Attention mechanism in the decoder. The model shows promising results, making the prediction of optical flow vectors from pure model predictions possible.

Authors: Jonas Müller (University of Tübingen); Raphael Braun (University of Tübingen); Hendrik P. A. Lensch (University of Tübingen); Nicole Ludwig (University of Tübingen)

Earth Observation & Monitoring Computer Vision & Remote Sensing
NeurIPS 2023 A Causal Discovery Approach To Learn How Urban Form Shapes Sustainable Mobility Across Continents (Papers Track)
Abstract and authors: (click to expand)

Abstract: For low carbon transport planning it's essential to grasp the location-specific cause-and-effect mechanisms that the built environment has on travel. Yet, current research falls short in representing causal relationships between the "6D" urban form variables and travel, generalizing across different regions, and modelling urban form effects at high spatial resolution. Here, we address these gaps by utilizing a causal discovery and an explainable machine learning framework to detect urban form effects on intra-city travel emissions based on high-resolution mobility data of six cities across three continents. We show that distance to center, demographics and density indirectly affect other urban form features and that location-specific influences align across cities, yet vary in magnitude. In addition, the spread of the city and the coverage of jobs across the city are the strongest determinants of travel-related emissions, highlighting the benefits of compact development and associated benefits. Our work is a starting point for location-specific analysis of urban form effects on mobility using causal discovery approaches, which is highly relevant municipalities across continents.

Authors: Felix Wagner (TU Berlin, MCC Berlin); Florian Nachtigall (MCC Berlin); Lukas B Franken (University of Edinburgh); Nikola Milojevic-Dupont (Mercator Research Institute on Global Commons and Climate Change (MCC)); Marta C. González (Berkeley); Jakob Runge (TU Berlin); Rafael Pereira (IPEA); Felix Creutzig (Mercator Research Institute on Global Commons and Climate Change (MCC))

Cities & Urban Planning Causal & Bayesian Methods
NeurIPS 2023 Unleashing the Autoconversion Rates Forecasting: Evidential Regression from Satellite Data (Papers Track)
Abstract and authors: (click to expand)

Abstract: High-resolution simulations such as the ICOsahedral Non-hydrostatic Large-Eddy Model (ICON-LEM) can be used to understand the interactions between aerosols, clouds, and precipitation processes that currently represent the largest source of uncertainty involved in determining the radiative forcing of climate change. Nevertheless, due to the exceptionally high computing cost required, this simulation-based approach can only be employed for a short period of time within a limited area. Despite the fact that machine learning can mitigate this problem, the related model uncertainties may make it less reliable. To address this, we developed a neural network (NN) model powered with evidential learning to assess the data and model uncertainties applied to satellite observation data. Our study focuses on estimating the rate at which small droplets (cloud droplets) collide and coalesce to become larger droplets (raindrops) – autoconversion rates -- since this is one of the key processes in the precipitation formation of liquid clouds, hence crucial to better understanding cloud responses to anthropogenic aerosols. The results of estimating the autoconversion rates demonstrate that the model performs reasonably well, with the inclusion of both aleatoric and epistemic uncertainty estimation, which improves the credibility of the model and provides useful insights for future improvement.

Authors: Maria C Novitasari (University College London); Johannes Quaas (Universität Leipzig); Miguel Rodrigues (University College London)

Climate Science & Modeling Uncertainty Quantification & Robustness
NeurIPS 2023 Detailed Glacier Area Change Analysis in the European Alps with Deep Learning (Papers Track)
Abstract and authors: (click to expand)

Abstract: Glacier retreat is a key indicator of climate change and requires regular updates of the glacier area. Recently, the release of a new inventory for the European Alps showed that glaciers continued to retreat at about 1.3% per year from 2003 to 2015. The outlines were produced by manually correcting the results of a semi-automatic method applied to Sentinel-2 imagery. In this work we develop a fully-automatic pipeline based on Deep Learning to investigate the evolution of the glaciers in the Alps from 2015 to present (2023). After outlier filtering, we provide individual estimates for around 1300 glaciers, representing 87% of the glacierized area. Regionally we estimate an area loss of -1.8% per year, with large variations between glaciers. Code and data are available at https://github.com/dcodrut/glacier_mapping_alps_tccml.

Authors: Codrut-Andrei Diaconu (DLR); Jonathan Bamber (Technical University of Munich)

Earth Observation & Monitoring Computer Vision & Remote Sensing
NeurIPS 2023 An LSTM-based Downscaling Framework for Australian Precipitation Projections (Papers Track)
Abstract and authors: (click to expand)

Abstract: Understanding potential changes in future rainfall and their local impacts on Australian communities can inform adaptation decisions worth billions of dollars in insurance, agriculture, and other sectors. This understanding relies on downscaling a large ensemble of coarse Global Climate Models (GCMs), our primary tool for simulating future climate. However, the prohibitively high computational cost of downscaling has been a significant barrier. In response, this study develops a cost-efficient downscaling framework for daily precipitation using Long Short-Term Memory (LSTM) models. The models are trained with ERA5 reanalysis data and a customized quantile loss function to better capture precipitation extremes. The framework is employed to downscale precipitation from a GCM member of the CMIP6 ensemble. We demonstrate the skills of the downscaling models to capture spatial and temporal characteristics of precipitation. We also explore regional future changes in precipitation extremes projected by the downscaled GCM. In general, this framework will enable the generation of a large ensemble of regional future projections for Australian rainfall. This will further enhance the assessment of likely climate risks and the quantification of their uncertainties.

Authors: Matthias Bittner (Vienna University of Technology); Sanaa Hobeichi (The University of New South Wales); Muhammad Zawish (Walton Institute, WIT); Samo DIATTA (Assane Seck University of Ziguinchor); Remigius Ozioko (University of Nigeria); Sharon Xu (Indigo Ag); Axel Jantsch (TU Wien)

Extreme Weather Time-series Analysis
NeurIPS 2023 A machine learning pipeline for automated insect monitoring (Papers Track)
Abstract and authors: (click to expand)

Abstract: Climate change and other anthropogenic factors have led to a catastrophic decline in insects, endangering both biodiversity and the ecosystem services on which human society depends. Data on insect abundance, however, remains woefully inadequate. Camera traps, conventionally used for monitoring terrestrial vertebrates, are now being modified for insects, especially moths. We describe a complete, open-source machine learning-based software pipeline for automated monitoring of moths via camera traps, including object detection, moth/non-moth classification, fine-grained identification of moth species, and tracking individuals. We believe that our tools, which are already in use across three continents, represent the future of massively scalable data collection in entomology.

Authors: Aditya Jain (Mila); Fagner Cunha (Federal University of Amazonas); Michael Bunsen (Mila, eButterfly); Léonard Pasi (EPFL); Anna Viklund (Daresay); Maxim Larrivee (Montreal Insectarium); David Rolnick (McGill University, Mila)

Ecosystems & Biodiversity Computer Vision & Remote Sensing
NeurIPS 2023 Climate Variable Downscaling with Conditional Normalizing Flows (Papers Track)
Abstract and authors: (click to expand)

Abstract: Predictions of global climate models typically operate on coarse spatial scales due to the large computational costs of climate simulations. This has led to a considerable interest in methods for statistical downscaling, a similar process to super-resolution in the computer vision context, to provide more local and regional climate information. In this work, we apply conditional normalizing flows to the task of climate variable downscaling. This approach allows for a probabilistic interpretation of the predictions, while also capturing the stochasticity inherent in the relationships among fine and coarse spatial scales. We showcase its successful performance on an ERA5 water content dataset for different upsampling factors. Additionally, we show that the method allows us to assess the predictive uncertainty in terms of standard deviation from the fitted conditional distribution mean.

Authors: Christina Elisabeth Winkler (Mila); Paula Harder (Mila); David Rolnick (McGill University, Mila)

Climate Science & Modeling Generative Modeling
NeurIPS 2023 Global Coastline Evolution Forecasting from Satellite Imagery using Deep Learning (Papers Track)
Abstract and authors: (click to expand)

Abstract: Coastal zones are under increasing pressures due to climate change and the increasing population densities in coastal areas around the globe. Our ability to accurately forecast the evolution of the coastal zone is of critical importance to coastal managers in the context of risk assessment and mitigation. Recent advances in artificial intelligence and remote sensing enable the development of automatic large-scale analysis methodologies based on observation data. In this work, we make use of a novel satellite-derived shoreline forecasting dataset and a variant of the common Encoder-Decoder neural network, UNet, in order to predict shoreline change based on spatio-temporal data. We analyze the importance of including the spatial context at the prediction step and we find that it greatly enhances model performance. Overall, the model presented here demonstrates significant shoreline forecasting skill around the globe, achieving a global correlation of 0.77.

Authors: Guillaume RIU (Laboratory of Spatial Geophysics and Oceanography Studies); Mahmoud AL NAJAR (Laboratory of Spatial Geophysics and Oceanography Studies); Gregoire THOUMYRE (Laboratory of Spatial Geophysics and Oceanography Studies); Rafael ALMAR (Laboratory of Spatial Geophysics and Oceanography Studies); Dennis Wilson (ISAE)

Oceans & Marine Systems Computer Vision & Remote Sensing
NeurIPS 2023 Can Reinforcement Learning support policy makers? A preliminary study with Integrated Assessment Models (Papers Track)
Abstract and authors: (click to expand)

Abstract: Governments around the world aspire to ground decision-making on evidence. Many of the foundations of policy making — e.g. sensing patterns that relate to societal needs, developing evidence-based programs, forecasting potential outcomes of policy changes, and monitoring effectiveness of policy programs — have the potential to benefit from the use of large-scale datasets or simulations together with intelligent algorithms. These could, if designed and deployed in a way that is well grounded on scientific evidence, enable a more comprehensive, faster, and rigorous approach to policy making. Integrated Assessment Models (IAM) is a broad umbrella covering scientific models that attempt to link main features of society and economy with the biosphere into one modelling framework. At present, these systems are probed by by policy makers and advisory groups in a hypothesis-driven manner. In this paper, we empirically demonstrate that modern Reinforcement Learning can be used to probe IAMs and explore the space of solutions in a more principled manner. While the implication of our results are modest since the environment is simplistic, we believe that this is a stepping stone towards more ambitious use cases, which could allow for effective exploration of policies and understanding of their consequences and limitations.

Authors: Theodore LM Wolf (Carbon Re); Nantas Nardelli (CarbonRe); John Shawe-Taylor (University College London); Maria Perez-Ortiz (University College London)

Public Policy Societal Adaptation & Resilience
NeurIPS 2023 IceCloudNet: Cirrus and mixed-phase cloud prediction from SEVIRI input learned from sparse supervision (Papers Track)
Abstract and authors: (click to expand)

Abstract: Clouds containing ice particles play a crucial role in the climate system. Yet they remain a source of great uncertainty in climate models and future climate projections. In this work, we create a new observational constraint of regime-dependent ice microphysical properties at the spatio-temporal coverage of geostationary satellite instruments and the quality of active satellite retrievals. We achieve this by training a convolutional neural network on three years of SEVIRI and DARDAR data sets. This work will enable novel research to improve ice cloud process understanding and hence, reduce uncertainties in a changing climate and help assess geoengineering methods for cirrus clouds.

Authors: Kai Jeggle (ETH Zurich); Mikolaj Czerkawski (ESA); Federico Serva (European Space Agency, Italian Space Agency); Bertrand Le Saux (European Space Agency (ESA)); David Neubauer (ETH Zurich); Ulrike Lohmann (ETH Zurich)

Climate Science & Modeling Computer Vision & Remote Sensing
NeurIPS 2023 SAM-CD: Change Detection in Remote Sensing Using Segment Anything Model (Papers Track)
Abstract and authors: (click to expand)

Abstract: In remote sensing, Change Detection (CD) refers to locating surface changes in the same area over time. Changes can occur due to man-made or natural activities, and CD is important for analyzing climate changes. The recent advancements in satellite imagery and deep learning allow the development of affordable and powerful CD solutions. The breakthroughs in computer vision Foundation Models (FMs) bring new opportunities for better and more flexible remote sensing solutions. However, solving CD using FMs has not been explored before and this work presents the first FM-based deep learning model, SAM-CD. We propose a novel model that adapts the Segment Anything Model (SAM) for solving CD. The experimental results show that the proposed approach achieves the state of the art when evaluated on two challenging benchmark public datasets LEVIR-CD and DSIFN-CD.

Authors: Faroq ALTam (Elm Company); Thariq Khalid (Elm Company); Athul Mathew (Elm Company); Andrew Carnell (Elm Company); Riad Souissi (Elm Company)

Computer Vision & Remote Sensing Cities & Urban Planning
NeurIPS 2023 Segment-then-Classify: Few-shot instance segmentation for environmental remote sensing (Papers Track)
Abstract and authors: (click to expand)

Abstract: Instance segmentation is pivotal for environmental sciences and climate change research, facilitating important tasks from land cover classification to glacier monitoring. This paper addresses the prevailing challenges associated with data scarcity when using traditional models like YOLOv8 by introducing a novel, data-efficient workflow for instance segmentation. The proposed Segment-then-Classify (STC) strategy leverages the zero-shot capabilities of the novel Segment Anything Model (SAM) to segment all objects in an image and then uses a simple classifier such as the Vision Transformer (ViT) to identify objects of interest thereafter. Evaluated on the VHR-10 dataset, our approach demonstrated convergence with merely 40 examples per class. YOLOv8 requires 3 times as much data to achieve the STC's peak performance. The highest performing class in the VHR-10 dataset achieved a near-perfect mAP@0.5 of 0.99 using the STC strategy. However, performance varied greatly across other classes due to the SAM model’s occasional inability to recognize all relevant objects, indicating a need for refining the zero-shot segmentation step. The STC workflow therefore holds promise for advancing few-shot learning for instance segmentation in environmental science.

Authors: Yang Hu (University of California, Santa Barbara); Kelly Caylor (UCSB); Anna S Boser (UCSB)

Computer Vision & Remote Sensing Earth Observation & Monitoring
NeurIPS 2023 Surrogate modeling based History Matching for an Earth system model of intermediate complexity (Papers Track)
Abstract and authors: (click to expand)

Abstract: Climate General Circulation Models (GCMs) constitute the primary tools for climate projections that inform IPCC Assessment Reports. Calibrating, or tuning the parameters of the models can significantly improve their predictions, thus their scientific and societal impacts. Unfortunately, traditional tuning techniques remain time-consuming and computationally costly, even at coarse resolution. A specific challenge for the tuning of climate models lies in the tuning of both fast and slow climatic features: while atmospheric processes adjust on hourly to weekly timescales, vegetation or ocean dynamics drive mechanisms of variability at decadal to millenial timescales. In this work, we explore whether and how History Matching, which uses machine learning based emulators to accelerate and automate the tuning process, is relevant for tuning climate models with multiple timescales. To facilitate this exploration, we work with a climate model of intermediate complexity, yet test experimental tuning protocols that can be directly applied to more complex GCMs to reduce uncertainty in climate projections.

Authors: Maya Janvier (Centrale Supélec); Redouane Lguensat (IPSL); Julie Deshayes (LOCEAN IPSL); Aurélien Quiquet (LSCE); Didier Roche (LSCE); V. Balaji (Schmidt Futures)

Climate Science & Modeling
NeurIPS 2023 FireSight: Short-Term Fire Hazard Prediction Based on Active Fire Remote Sensing Data (Papers Track)
Abstract and authors: (click to expand)

Abstract: Wildfires are becoming unpredictable natural hazards in many regions due to climate change. However, existing state-of-the-art wildfire forecasting tools, such as the Fire Weather Index (FWI), rely solely on meteorological input parameters and have limited ability to model the increasingly dynamic nature of wildfires. In response to the escalating threat, our work addresses this shortcoming in short-term fire hazard prediction. First, we present a comprehensive and high fidelity remotely sensed active fire dataset fused from over 20 satellites. Second, we develop region-specific ML-based 3-7 day wildfire hazard prediction models for hazard South America, Australia, and Southern Europe. The different models cover pixel-wise, spatial and spatio-temporal architectures, and utilize weather, fuel and location data. We evaluate the models using time-based cross-validation and can show superior performance with a PR-AUC score up to 44 times higher compared to the baseline FWI model. Using explainable AI methods, we show that these data-driven models are also capable of learning meaningful physical patterns and inferring region-specific wildfire drivers.

Authors: Julia Gottfriedsen (OroraTech GmbH); Johanna Strebl (OroraTech GmbH); Max Berrendorf (Ludwig-Maximilians-Universität München); Martin Langer (OroraTech GmbH); Volker Tresp (Ludwig-Maximilians-Universität München)

Disaster Management and Relief Extreme Weather
NeurIPS 2023 Scaling Sodium-ion Battery Development with NLP (Papers Track)
Abstract and authors: (click to expand)

Abstract: Sodium-ion batteries (SIBs) have been gaining attention for applications like grid-scale energy storage, largely owing to the abundance of sodium and an expected favorable $/kWh figure. SIBs can leverage the well-established manufacturing knowledge of Lithium-ion Batteries (LIBs), but several materials synthesis and performance challenges for electrode materials need to be addressed. This work extracts a large database of challenges restricting the performance and synthesis of SIB cathode active materials (CAMs) and pairs them with corresponding mitigation strategies from the SIB literature by employing custom natural language processing (NLP) tools. The derived insights enable scientists in research and industry to navigate a large number of proposed strategies and focus on impactful scalability-informed mitigation strategies to accelerate the transition from lab to commercialization.

Authors: Mrigi Munjal (Massachusetts Institute of Technology); Thorben Pein (TU Munich); Vineeth Venugopal (Massachusetts Institute of Technology); Kevin Huang (Massachusetts Institute of Technology); Elsa Olivetti (Massachusetts Institute of Technology)

Chemistry & Materials Power & Energy
NeurIPS 2023 The built environment and induced transport CO2 emissions: A double machine learning approach to account for residential self-selection (Papers Track)
Abstract and authors: (click to expand)

Abstract: Understanding why travel behavior differs between residents of urban centers and suburbs is key to sustainable urban planning. Especially in light of rapid urban growth, identifying housing locations that minimize travel demand and induced CO2 emissions is crucial to mitigate climate change. While the built environment plays an important role, the precise impact on travel behavior is obfuscated by residential self-selection. To address this issue, we propose a double machine learning approach to obtain unbiased, spatially-explicit estimates of the effect of the built environment on travel-related CO2 emissions for each neighborhood by controlling for residential self-selection. We examine how socio-demographics and travel-related attitudes moderate the effect and how it decomposes across the 5Ds of the built environment. Based on a case study for Berlin and the travel diaries of 32,000 residents, we find that the built environment causes household travel-related CO2 emissions to differ by a factor of almost two between central and suburban neighborhoods in Berlin. To highlight the practical importance for urban climate mitigation, we evaluate current plans for 64,000 new residential units in terms of total induced transport CO2 emissions. Our findings underscore the significance of spatially differentiated compact development to decarbonize the transport sector.

Authors: Florian Nachtigall (Technical University of Berlin); Felix Wagner (TU Berlin, MCC Berlin); Peter Berrill (Technical University of Berlin); Felix Creutzig (Mercator Research Institute on Global Commons and Climate Change (MCC))

Cities & Urban Planning Causal & Bayesian Methods
NeurIPS 2023 Lightweight, Pre-trained Transformers for Remote Sensing Timeseries (Papers Track)
Abstract and authors: (click to expand)

Abstract: Machine learning models for parsing remote sensing data have a wide range of societally relevant applications, but labels used to train these models can be difficult or impossible to acquire. This challenge has spurred research into self-supervised learning for remote sensing data. Current self-supervised learning approaches for remote sensing data draw significant inspiration from techniques applied to natural images. However, remote sensing data has important differences from natural images -- for example, the temporal dimension is critical for many tasks and data is collected from many complementary sensors. We show we can create significantly smaller performant models by designing architectures and self-supervised training techniques specifically for remote sensing data. We introduce the Pretrained Remote Sensing Transformer (Presto), a transformer-based model pre-trained on remote sensing pixel-timeseries data. Presto excels at a wide variety of globally distributed remote sensing tasks and performs competitively with much larger models while requiring far less compute. Presto can be used for transfer learning or as a feature extractor for simple models, enabling efficient deployment at scale.

Authors: Gabriel Tseng (NASA Harvest); Ruben Cartuyvels (KULeuven); Ivan Zvonkov (University of Maryland); Mirali Purohit (Arizona State University (ASU)); David Rolnick (McGill University, Mila); Hannah R Kerner (Arizona State University)

Earth Observation & Monitoring Unsupervised & Semi-Supervised Learning
NeurIPS 2023 Top-down Green-ups: Satellite Sensing and Deep Models to Predict Buffelgrass Phenology (Papers Track)
Abstract and authors: (click to expand)

Abstract: An invasive species of grass known as "buffelgrass" contributes to severe wildfires and biodiversity loss in the Southwest United States. We tackle the problem of predicting buffelgrass "green-ups" (i.e. readiness for herbicidal treatment). To make our predictions, we explore temporal, visual and multi-modal models that combine satellite sensing and deep learning. We find that all of our neural-based approaches improve over conventional buffelgrass green-up models, and discuss how neural model deployment promises significant resource savings.

Authors: Lucas Rosenblatt (NYU); Bin Han (University of Washington); Erin Posthumus (USA NPN); Theresa Crimmins (USA NPN); Bill G Howe (University of Washington)

Earth Observation & Monitoring Computer Vision & Remote Sensing
NeurIPS 2023 Typhoon Intensity Prediction with Vision Transformer (Papers Track)
Abstract and authors: (click to expand)

Abstract: Predicting typhoon intensity accurately across space and time is crucial for issuing timely disaster warnings and facilitating emergency response. This has vast potential for minimizing life losses and property damages as well as reducing economic and environmental impacts. Leveraging satellite imagery for scenario analysis is effective but also introduces additional challenges due to the complex relations among clouds and the highly dynamic context. Existing deep learning methods in this domain rely on convolutional neural networks (CNNs), which suffer from limited per-layer receptive fields. This limitation hinders their ability to capture long-range dependencies and global contextual knowledge during inference. In response, we introduce a novel approach, namely "Typhoon Intensity Transformer" (Tint), which leverages self-attention mechanisms with global receptive fields per layer. Tint adopts a sequence-to-sequence feature representation learning perspective. It begins by cutting a given satellite image into a sequence of patches and recursively employs self-attention operations to extract both local and global contextual relations between all patch pairs simultaneously, thereby enhancing per-patch feature representation learning. Extensive experiments on a publicly available typhoon benchmark validate the efficacy of Tint in comparison with both state-of-the-art deep learning and conventional meteorological methods. Our code is available at https://github.com/chen-huanxin/Tint.

Authors: Huanxin Chen (South China University of Technology); Pengshuai Yin (South China University of Technology); Huichou Huang (City University of Hong Kong); Qingyao Wu (South China University of Technology); Ruirui Liu (Brunel University London); Xiatian Zhu (University of Surrey)

Computer Vision & Remote Sensing Climate Science & Modeling
NeurIPS 2023 Data Assimilation using ERA5, ASOS, and the U-STN model for Weather Forecasting over the UK (Papers Track)
Abstract and authors: (click to expand)

Abstract: In recent years, the convergence of data-driven machine learning models with Data Assimilation (DA) offers a promising avenue for enhancing weather forecasting. This study delves into this emerging trend, presenting our methodologies and outcomes. We harnessed the UK's local ERA5 850 hPa temperature data and refined the U-STN12 global weather forecasting model, tailoring its predictions to the UK's climate nuances. From the ASOS network, we sourced t2m data, representing ground observations across the UK. We employed the advanced kriging method with a polynomial drift term for consistent spatial resolution. Furthermore, Gaussian noise was superimposed on the ERA5 T850 data, setting the stage for ensuing multi-time step virtual observations. Probing into the assimilation impacts, the ASOS t2m data was integrated with the ERA5 T850 dataset. Our insights reveal that while global forecast models can adapt to specific regions, incorporating atmospheric data in DA significantly bolsters model accuracy. Conversely, the direct assimilation of surface temperature data tends to mitigate this enhancement, tempering the model's predictive prowess.

Authors: WENQI WANG (Imperial College London); Jacob Bieker (Open Climate Fix); Rossella Arcucci (Imperial College London); Cesar Quilodran-Casas (Imperial College London)

Earth Observation & Monitoring Uncertainty Quantification & Robustness
NeurIPS 2023 Stress-testing the coupled behavior of hybrid physics-machine learning climate simulations on an unseen, warmer climate (Papers Track)
Abstract and authors: (click to expand)

Abstract: Accurate and computationally-viable representations of clouds and turbulence are a long-standing challenge for climate model development. Traditional parameterizations that crudely but efficiently approximate these processes are a leading source of uncertainty in long-term projected warming and precipitation patterns. Machine Learning (ML)-based parameterizations have long been hailed as a promising alternative with the potential to yield higher accuracy at a fraction of the cost of more explicit simulations. However, these ML variants are often unpredictably unstable and inaccurate in online testing (i.e. in a downstream hybrid simulation task where they are dynamically coupled to the large-scale climate model). These issues are exacerbated in out-of-distribution climates. Certain design decisions such as ``climate-invariant" feature transformation, input vector expansion, and temporal history incorporation have been shown to improve online performance, but they may be insufficient for the mission-critical task of online out-of-distribution generalization. If feature selection and transformations can inoculate hybrid physics-ML climate models from non-physical out-of-distribution extrapolation in a changing climate, there is far greater potential in extrapolating from observational data. Otherwise, training on multiple simulated climates becomes an inevitable necessity. While our results show generalization benefits from these design decisions, such benefits do not sufficiently preclude the necessity of using multi-climate simulated training data.

Authors: Jerry Lin (University of California, Irvine); Mohamed Aziz Bhouri (Columbia University); Tom G Beucler (Columbia University & UCI); Sungduk Yu (University of California, Irvine); Michael Pritchard (UCI)

Climate Science & Modeling Hybrid Physical Models
NeurIPS 2023 Deploying Reinforcement Learning based Economizer Optimization at Scale (Papers Track)
Abstract and authors: (click to expand)

Abstract: Building operations account for a significant portion of global emissions, contributing approximately 28\% of global greenhouse gas emissions. With anticipated increase in cooling demand due to rising global temperatures, the optimization of rooftop units (RTUs) in buildings becomes crucial for reducing emissions. We focus on the optimization of the economizer logic within RTUs, which balances the mix of indoor and outdoor air. By effectively utilizing free outside air, economizers can significantly decrease mechanical energy usage, leading to reduced energy costs and emissions. We introduce a reinforcement learning (RL) approach that adaptively controls the economizer based on the unique characteristics of individual facilities. We have trained and deployed our solution in the real-world across a distributed building stock. We address the scaling challenges with our cloud-based RL deployment on 10K+ RTUs across 200+ sites.

Authors: Ivan Cui (Amazon); Wei Yih Yap (Amazon); Charles Prosper (Independant); Bharathan Balaji (Amazon); Jake Chen (Amazon)

Buildings Reinforcement Learning
NeurIPS 2023 The Power of Explainability in Forecast-Informed Deep Learning Models for Flood Mitigation (Papers Track)
Abstract and authors: (click to expand)

Abstract: Floods can cause horrific harm to life and property. However, they can be mitigated or even avoided by the effective use of hydraulic structures such as dams, gates, and pumps. By pre-releasing water via these structures in advance of extreme weather events, water levels are sufficiently lowered to prevent floods. In this work, we propose FIDLAR, a Forecast Informed Deep Learning Architecture, achieving flood management in watersheds with hydraulic structures in an optimal manner by balancing out flood mitigation and unnecessary wastage of water via pre-releases. We perform experiments with FIDLAR using data from the South Florida Water Management District, which manages a coastal area that is highly prone to frequent storms and floods. Results show that FIDLAR performs better than the current state-of-the-art with several orders of magnitude speedup and with provably better pre-release schedules. The dramatic speedups make it possible for FIDLAR to be used for real-time flood management. The main contribution of this paper is the effective use of tools for model explainability, allowing us to understand the contribution of the various environmental factors towards its decisions.

Authors: Jimeng Shi (Florida International University); Vitalii Stebliankin (FIU); Giri Narasimhan (Florida International University)

Disaster Management and Relief Interpretable ML
NeurIPS 2023 Surrogate Neural Networks to Estimate Parametric Sensitivity of Ocean Models (Papers Track)
Abstract and authors: (click to expand)

Abstract: Modeling is crucial to understanding the effect of greenhouse gases, warming, and ice sheet melting on the ocean. At the same time, ocean processes affect phenomena such as hurricanes and droughts. Parameters in the models that cannot be physically measured have a significant effect on the model output. For an idealized ocean model, we generate perturbed parameter ensemble data and generate surrogate neural network models. The neural surrogates accurately predicted the one-step forward dynamics, of which we then computed the parametric sensitivity.

Authors: Yixuan Sun (Argonne National Laboratory); Elizabeth Cucuzzella (Tufts University); Steven Brus (Argonne National Laboratory); Sri Hari Krishna Narayanan (Argonne National Laboratory); Balu Nadiga (Los Alamos National Lab); Luke Van Roekel (Los Alamos National Laboratory); Jan Hückelheim (Argonne National Laboratory); Sandeep Madireddy (Argonne National Laboratory)

Oceans & Marine Systems
NeurIPS 2023 Towards a spatio-temporal deep learning approach to predict malaria outbreaks using earth observation measurements in South Asia (Papers Track)
Abstract and authors: (click to expand)

Abstract: Environmental indicators can play a crucial role in forecasting infectious disease outbreaks, holding promise for community-level interventions. Yet, significant gaps exist in the literature regarding the influence of changes in environmental conditions on disease spread over time and across different regions and climates making it challenging to obtain reliable forecasts. This paper aims to propose an approach to predict malaria incidence over time and space by employing a multi-dimensional long short-term memory model (M-LSTM) to simultaneously analyse environmental indicators such as vegetation, temperature, night-time lights, urban/rural settings, and precipitation. We developed and validated a spatio-temporal data fusion approach to predict district-level malaria incidence rates for the year 2017 using spatio-temporal data from 2000 to 2016 across three South Asian countries: Pakistan, India, and Bangladesh. In terms of predictive performance the proposed M-LSTM model results in lower country-specific error rates compared to existing spatio-temporal deep learning models. The data and code have been made publicly available at the study GitHub repository.

Authors: Usman Nazir (Lahore University of Management Sciences); Ahzam Ejaz (Lahore University of Management Sciences); Muhammad Talha Quddoos (Lahore University of Management Sciences); Momin Uppal (Lahore University of Management Sciences); Sara khalid (University of Oxford)

Health Climate Science & Modeling
NeurIPS 2023 Contextual Reinforcement Learning for Offshore Wind Farm Bidding (Papers Track)
Abstract and authors: (click to expand)

Abstract: We propose a framework for applying reinforcement learning to contextual two-stage stochastic optimization and apply this framework to the problem of energy market bidding of an off-shore wind farm. Reinforcement learning could potentially be used to learn close to optimal solutions for first stage variables of a two-stage stochastic program under different contexts. Under the proposed framework, these solutions would be learned without having to solve the full two-stage stochastic program. We present initial results of training using the DDPG algorithm and present intended future steps to improve performance.

Authors: David Cole (University of Wisconsin-Madison); Himanshu Sharma (Pacific Northwest National Laboratory); Wei Wang (Pacific Northwest National Laboratory)

Power & Energy Reinforcement Learning
NeurIPS 2023 CityTFT: Temporal Fusion Transformer for Urban Building Energy Modeling (Papers Track)
Abstract and authors: (click to expand)

Abstract: Urban Building Energy Modeling (UBEM) is an emerging method to investigate urban design and energy systems against the increasing energy demand at urban and neighborhood levels. However, current UBEM methods are mostly physic-based and time-consuming in multiple climate change scenarios. This work proposes CityTFT, a data-driven UBEM framework, to accurately model the energy demands in urban environments. With the empowerment of the underlying TFT framework and an augmented loss function, CityTFT could predict heating and cooling triggers in unseen climate dynamics with an F1 score of 99.98 \% while RMSE of loads of 13571.3750 Wh.

Authors: Ting-Yu Dai (The University of Texas at Austin); Dev Niyogi (The University of Texas at Austin); Zoltan Nagy (The University of Texas at Austin)

Buildings Time-series Analysis
NeurIPS 2023 A Scalable Network-Aware Multi-Agent Reinforcement Learning Framework for Distributed Converter-based Microgrid Voltage Control (Papers Track)
Abstract and authors: (click to expand)

Abstract: Renewable energy plays a crucial role in mitigating climate change. With the rising use of distributed energy resources (DERs), microgrids (MGs) have emerged as a solution to accommodate high DER penetration. However, controlling MGs' voltage during islanded operation is challenging due to system's nonlinearity and stochasticity. Although multi-agent reinforcement learning (MARL) methods have been applied to distributed MG voltage control, they suffer from bad scalability and are found difficult to control the MG with a large number of DGs due to the well-known curse of dimensionality. To address this, we propose a scalable network-aware reinforcement learning framework which exploits network structure to truncate the critic's Q-function to achieve scalability. Our experiments show effective control of a MG with up to 84 DGs, surpassing the existing maximum of 40 agents in the existing literature. We also compare our framework with state-of-the-art MARL algorithms to show the superior scalability of our framework.

Authors: Han Xu (Tsinghua University); Guannan Qu (Carnegie Mellon University)

Power & Energy Reinforcement Learning
NeurIPS 2023 Reinforcement Learning in agent-based modeling to reduce carbon emissions in transportation (Papers Track)
Abstract and authors: (click to expand)

Abstract: This paper explores the integration of reinforcement learning (RL) into transportation simulations to explore system interventions to reduce greenhouse gas emissions. The study leverages the Behavior, Energy, Automation, and Mobility (BEAM) transportation simulation framework in conjunction with the Berkeley Integrated System for Transportation Optimization (BISTRO) for scenario development. The main objective is to determine optimal parameters for transportation simulations to increase public transport usage and reduce individual vehicle reliance. Initial experiments were conducted on a simplified transportation scenario, and results indicate that RL can effectively find system interventions that increase public transit usage and decrease transportation emissions.

Authors: Yuhao Yuan (UC Berkeley); Felipe Leno da Silva (Lawrence Livermore National Laboratory); Ruben Glatt (Lawrence Livermore National Laboratory)

Transportation Reinforcement Learning
NeurIPS 2023 Continuous Convolutional Neural Networks for Disruption Prediction in Nuclear Fusion Plasmas (Papers Track)
Abstract and authors: (click to expand)

Abstract: Grid decarbonization for climate change requires dispatchable carbon-free energy like nuclear fusion. The tokamak concept offers a promising path for fusion, but one of the foremost challenges in implementation is the occurrence of energetic plasma disruptions. In this study, we delve into Machine Learning approaches to predict plasma state outcomes. Our contributions are twofold: (1) We present a novel application of Continuous Convolutional Neural Networks for disruption prediction and (2) We examine the advantages and disadvantages of continuous models over discrete models for disruption prediction by comparing our model with the previous, discrete state of the art, and show that continuous models offer significantly better performance (Area Under the Receiver Operating Characteristic Curve = 0.974 v.s. 0.799) with fewer parameters.

Authors: William F Arnold (KAIST); Lucas Spangher (MIT PSFC); Cristina Rea (MIT PSFC)

Time-series Analysis Hybrid Physical Models
NeurIPS 2023 Comparing Data-Driven and Mechanistic Models for Predicting Phenology in Deciduous Broadleaf Forests (Papers Track)
Abstract and authors: (click to expand)

Abstract: Understanding the future climate is crucial for informed policy decisions on climate change prevention and mitigation. Earth system models play an important role in predicting future climate, requiring accurate representation of complex sub- processes that span multiple time scales and spatial scales. One such process that links seasonal and interannual climate variability to cyclical biological events is tree phenology in deciduous forests. Phenological dates, such as the start and end of the growing season, are critical for understanding the exchange of carbon and water between the biosphere and the atmosphere. Mechanistic prediction of these dates is challenging. Hybrid modelling, which integrates data-driven approaches into complex models, offers a solution. In this work, as a first step towards this goal, train a deep neural network to predict a phenological index from meteorological time series. We find that this approach outperforms traditional process-based models. This highlights the potential of data-driven methods to improve climate predictions. We also analyze which variables and aspects of the time series influence the predicted onset of the season, in order to gain a better understanding of the advantages and limitations of our model.

Authors: Christian Reimers (Department of Biogeochemical Integration, Max Planck Institute for Biogeochemistry); David Hafezi Rachti (Department of Biogeochemical Integration, Max Planck Institute for Biogeochemistry); Guohua Liu (Department of Biogeochemical Integration, Max Planck Institute for Biogeochemistry); Alexander Winkler (Department of Biogeochemical Integration, Max Planck Institute for Biogeochemistry)

Climate Science & Modeling Interpretable ML
NeurIPS 2023 RMM-VAE: a machine learning method for identifying probabilistic weather regimes targeted to a local-scale impact variable (Papers Track)
Abstract and authors: (click to expand)

Abstract: Identifying large-scale atmospheric patterns that modulate extremes in local-scale variables such as precipitation has the potential to improve long-term climate projections as well as extended-range forecasting skill. This paper proposes a novel probabilistic machine learning method, RMM-VAE, based on a variational autoencoder architecture for identifying weather regimes targeted to a local-scale impact variable. The new method is compared to three existing methods in the task of identifying robust weather regimes that are predictive of precipitation over Morocco while capturing the full phase space of atmospheric dynamics over the Mediterranean. RMM-VAE performs well across these different objectives, outperforming linear methods in reconstructing the full phase space and predicting the target variable, highlighting the potential benefit of applying the method to various climate applications such as downscaling and extended-range forecasting.

Authors: Fiona R Spuler (University of Reading); Marlene Kretschmer (Universität Leipzig); Yevgeniya Kovalchuck (University College London); Magdalena Balmaseda (ECMWF); Ted Shepherd (University of Reading)

Generative Modeling Climate Science & Modeling
NeurIPS 2023 Accelerating GHG Emissions Inference: A Lagrangian Particle Dispersion Model Emulator Using Graph Neural Networks (Papers Track)
Abstract and authors: (click to expand)

Abstract: Inverse modelling systems relying on Lagrangian Particle Dispersion Models (LPDMs) are a popular way to quantify greenhouse gas (GHG) emissions using atmospheric observations, providing independent validation to countries' self-reported emissions. However, the increased volume of satellite measurements cannot be fully leveraged due to computational bottlenecks. Here, we propose a data-driven architecture with Graph Neural Networks that emulates the outputs of LPDMs using only meteorological inputs, and demonstrate it in application with preliminary results for satellite measurements over Brazil.

Authors: Elena Fillola (University of Bristol); Raul Santos Rodriguez (University of Bristol); Matt Rigby (University of Bristol)

Climate Science & Modeling
NeurIPS 2023 Gaussian Processes for Monitoring Air-Quality in Kampala (Papers Track)
Abstract and authors: (click to expand)

Abstract: Monitoring air pollution is of vital importance to the overall health of the population. Unfortunately, devices that can measure air quality can be expensive, and many cities in low and middle-income countries have to rely on a sparse allocation of them. In this paper, we investigate the use of Gaussian Processes for both nowcasting the current air-pollution in places where there are no sensors and forecasting the air-pollution in the future at the sensor locations. In particular, we focus on the city of Kampala in Uganda, using data from AirQo's network of sensors. We demonstrate the advantage of removing outliers, compare different kernel functions and additional inputs. We also compare two sparse approximations to allow for the large amounts of temporal data in the dataset.

Authors: Clara Stoddart (Imperial College London); Lauren Shrack (Massachusetts Institute of Technology); Usman Abdul-Ganiy (AirQo, Makerere University); Richard Sserunjogi (AirQo, Makerere University); Engineer Bainomugisha (AirQo, Makerere University); Deo Okure (AirQo, Makerere University); Ruth Misener (Imperial College London); Jose Pablo Folch (Imperial College London); Ruby Sedgwick (Imperial College London)

Causal & Bayesian Methods Health
NeurIPS 2023 Large Scale Masked Autoencoding for Reducing Label Requirements on SAR Data (Papers Track) Overall Best Paper
Abstract and authors: (click to expand)

Abstract: Satellite-based remote sensing is instrumental in the monitoring and mitigation of the effects of anthropogenic climate change. Large scale, high resolution data derived from these sensors can be used to inform intervention and policy decision making, but the timeliness and accuracy of these interventions is limited by use of optical data, which cannot operate at night and is affected by adverse weather conditions. Synthetic Aperture Radar (SAR) offers a robust alternative to optical data, but its associated complexities limit the scope of labelled data generation for traditional deep learning. In this work, we apply a self-supervised pretraining scheme, masked autoencoding, to SAR amplitude data covering 8.7\% of the Earth's land surface area, and tune the pretrained weights on two downstream tasks crucial to monitoring climate change - vegetation cover prediction and land cover classification. We show that the use of this pretraining scheme reduces labelling requirements for the downstream tasks by more than an order of magnitude, and that this pretraining generalises geographically, with the performance gain increasing when tuned downstream on regions outside the pretraining set. Our findings significantly advance climate change mitigation by facilitating the development of task and region-specific SAR models, allowing local communities and organizations to deploy tailored solutions for rapid, accurate monitoring of climate change effects.

Authors: Matthew J Allen (University of Cambridge); Francisco Dorr (Independent); Joseph A Gallego (National University Of Colombia); Laura Martínez-Ferrer (University of Valencia); Freddie Kalaitzis (University of Oxford); Raul Ramos-Pollan (Universidad de Antioquia); Anna Jungbluth (European Space Agency)

Computer Vision & Remote Sensing Earth Observation & Monitoring
NeurIPS 2023 Sim2Real for Environmental Neural Processes (Papers Track)
Abstract and authors: (click to expand)

Abstract: Machine learning (ML)-based weather models have recently undergone rapid improvements.These models are typically trained on gridded reanalysis data from numerical data assimilation systems. However, reanalysis data comes with limitations, such as assumptions about physical laws and low spatiotemporal resolution. The gap between reanalysis and reality has sparked growing interest in training ML models directly on observations such as weather stations. Modelling scattered and sparse environmental observations requires scalable and flexible ML architectures, one of which is the convolutional conditional neural process (ConvCNP). ConvCNPs can learn to condition on both gridded and off-the-grid context data to make uncertainty-aware predictions at target locations. However, the sparsity of real observations presents a challenge for data-hungry deep learning models like the ConvCNP. One potential solution is `Sim2Real': pre-training on reanalysis and fine-tuning on observational data. We analyse Sim2Real with a ConvCNP trained to interpolate surface air temperature over Germany, using varying numbers of weather stations for fine-tuning. On held-out weather stations, Sim2Real training substantially outperforms the same model trained only with reanalysis data or only with station data, showing that reanalysis data can serve as a stepping stone for learning from real observations. Sim2Real could enable more accurate models for climate change monitoring and adaptation.

Authors: Jonas Scholz (University of Cambridge)

Meta- and Transfer Learning Climate Science & Modeling
NeurIPS 2023 Methane Plume Detection with U-Net Segmentation on Sentinel-2 Image Data (Papers Track)
Abstract and authors: (click to expand)

Abstract: Methane emissions have a significant impact on increasing global warming. Satellite-based methane detection methods can help mitigate methane emissions, as they provide a constant and global detection. The Sentinel-2 constellation, in particular, offers frequent and publicly accessible images on a global scale. We propose a deep learning approach to detect methane plumes from Sentinel-2 images. We construct a dataset of 5200 satellite images with identified methane plumes, on which we train a U-Net model. Preliminary results demonstrate that the model is able to correctly identify methane plumes on training data, although generalization to new methane plumes remains challenging. All code, data, and models are made available online.

Authors: Berenice du Baret (ISAE-Supaero); Simon Finos (ISAE-Supaero); Hugo Guiglion (ISAE-Supaero); Dennis Wilson (ISAE)

Earth Observation & Monitoring Computer Vision & Remote Sensing
NeurIPS 2023 Machine Learning Assisted Bayesian Calibration of Model Physics Parameters for Wetland Methane Emissions: A Case Study at a FLUXNET-CH4 Site (Papers Track)
Abstract and authors: (click to expand)

Abstract: Methane (CH4) possesses a notably higher warming potential than carbon dioxide despite its lower atmospheric concentration, making it integral to global climate dynamics. Wetlands stand out as the predominant natural contributor to global methane emissions. Accurate modeling of methane emissions from wetlands is crucial for understanding and predicting climate change dynamics. However, such modeling efforts are often constrained by the inherent uncertainties in model parameters. Our work leverages machine learning (ML) to calibrate five physical parameters of the Energy Exascale Earth System Model (E3SM) land model (ELM) to improve the model’s accuracy in simulating wetland methane emissions. Unlike traditional deterministic calibration methods that target a single set of optimal values for each parameter, Bayesian calibration takes a probabilistic approach and enables capturing the inherent uncertainties in complex systems and providing robust parameter distributions for reliable predictions. However, Bayesian calibration requires numerous model runs and makes it computationally expensive. We employed an ML algorithm, Gaussian process regression (GPR), to emulate the ELM’s methane model, which dramatically reduced the computational time from 6 CPU hours to just 0.72 milliseconds per simulation. We exemplified the procedure at a representative FLUXNET-CH4 site (US-PFa) with the longest continuous methane emission data. Results showed that the default values for two of the five parameters examined were not aligned well with their respective posterior distributions, suggesting that the model’s default parameter values might not always be optimal for all sites, and that site-specific analysis is warranted. In particular, analyses at sites with different vegetation types and wetland characteristics could reveal more useful insights for understanding methane emissions modeling.

Authors: Sandeep Chinta (Massachusetts Institute of Technology); Xiang Gao (Massachusetts Institute of Technology); Qing Zhu (Lawrence Berkeley National Laboratory)

Climate Science & Modeling
NeurIPS 2023 ALAS: Active Learning for Autoconversion Rates Prediction from Satellite Data (Papers Track)
Abstract and authors: (click to expand)

Abstract: High-resolution simulations, such as the ICOsahedral Non-hydrostatic Large-Eddy Model (ICON-LEM), provide valuable insights into the complex interactions among aerosols, clouds, and precipitation, which are the major contributors to climate change uncertainty. However, due to their exorbitant computational costs, they can only be employed for a limited period and geographical area. To address this, we propose a more cost-effective method powered by an emerging machine learning approach to better understand the intricate dynamics of the climate system. Our approach involves active learning techniques by leveraging high-resolution climate simulation as an oracle that is queried based on an abundant amount of unlabeled data drawn from satellite observations. In particular, we aim to predict autoconversion rates, a crucial step in precipitation formation, while significantly reducing the need for a large number of labeled instances. In this study, we present novel methods: custom query strategy fusion for labeling instances -- weight fusion (WiFi) and merge fusion (MeFi) -- along with active feature selection based on SHAP. These methods are designed to tackle real-world challenges -- in this case, climate change, with a specific focus on the prediction of autoconversion rates -- due to their simplicity and practicality in application.

Authors: Maria C Novitasari (University College London); Johanness Quaas (Universität Leipzig); Miguel Rodrigues (University College London)

Active Learning Climate Science & Modeling
NeurIPS 2023 Towards Causal Representations of Climate Model Data (Papers Track)
Abstract and authors: (click to expand)

Abstract: Climate models, such as Earth system models (ESMs), are crucial for simulating future climate change based on projected Shared Socioeconomic Pathways (SSP) greenhouse gas emissions scenarios. While ESMs are sophisticated and invaluable, machine learning-based emulators trained on existing simulation data can project additional climate scenarios much faster and are computationally efficient. However, they often lack generalizability and interpretability. This work delves into the potential of causal representation learning, specifically the Causal Discovery with Single-parent Decoding (CDSD) method, which could render climate model emulation efficient and interpretable. We evaluate CDSD on multiple climate datasets, focusing on emissions, temperature, and precipitation. Our findings shed light on the challenges, limitations, and promise of using CDSD as a stepping stone towards more interpretable and robust climate model emulation.

Authors: Julien Boussard (Columbia University); Chandni Nagda (University of Illinois at Urbana-Champaign); Julia Kaltenborn (McGill University); Charlotte Lange (Mila); Yaniv Gurwicz (Intel Labs); Peer Nowack (Grantham Institute, Imperial College London. Department of Physics, Imperial College. Data Science Institute, Imperial College. School of Environmental Sciences, University of East Anglia); David Rolnick (McGill University, Mila)

Climate Science & Modeling Causal & Bayesian Methods
NeurIPS 2023 ClimateX: Do LLMs Accurately Assess Human Expert Confidence in Climate Statements? (Papers Track)
Abstract and authors: (click to expand)

Abstract: Evaluating the accuracy of outputs generated by Large Language Models (LLMs) is especially important in the climate science and policy domain. We introduce the Expert Confidence in Climate Statements (ClimateX) dataset, a novel, curated, expert-labeled dataset consisting of 8094 climate statements collected from the latest Intergovernmental Panel on Climate Change (IPCC) reports, labeled with their associated confidence levels. Using this dataset, we show that recent LLMs can classify human expert confidence in climate-related statements, especially in a few-shot learning setting, but with limited (up to 47%) accuracy. Overall, models exhibit consistent and significant over-confidence on low and medium confidence statements. We highlight implications of our results for climate communication, LLMs evaluation strategies, and the use of LLMs in information retrieval systems.

Authors: Romain Lacombe (Stanford University); Kerrie Wu (Stanford University); Eddie Dilworth (Stanford University)

Natural Language Processing Public Policy
NeurIPS 2023 Improving Flood Insights: Diffusion-based SAR to EO Image Translation (Papers Track)
Abstract and authors: (click to expand)

Abstract: Driven by the climate crisis, the frequency and intensity of flood events are on the rise. Electro-optical (EO) satellite imagery is commonly used for rapid disaster response. However, its utility in flood situations is limited by cloud cover and during nighttime. An alternative method for flood detection involves using Synthetic Aperture Radar (SAR) data. Despite SAR's advantages over EO in these situations, it has a significant drawback: human analysts often struggle to interpret SAR data. This paper proposes a novel framework, Diffusion-based SAR-to-EO Image Translation (DSE). The DSE framework converts SAR images into EO-like imagery, thereby enhancing their interpretability for human analysis. Experimental results on the Sen1Floods11 and SEN12-FLOOD datasets confirm that the DSE framework provides enhanced visual information and improves performance in all flood segmentation tests.

Authors: Minseok Seo (si-analytics); YoungTack Oh (SI Analytics); Doyi Kim (SI Analytics); Dongmin Kang (SIA); Yeji Choi (SI Analytics)

Disaster Management and Relief Earth Observation & Monitoring
NeurIPS 2023 A Wildfire Vulnerability Index for Businesses Using Machine Learning Approaches (Papers Track)
Abstract and authors: (click to expand)

Abstract: Climate change and housing growth in ecologically vulnerable areas are increasing the frequency and intensity of wildfire events. These events can have profound impact on communities and the economy, but the financial and operational impacts of wildfire on businesses have not been evaluated extensively yet. This paper presents a Wildfire Vulnerability Index (WVI) that measures the risk of a business failing in the 12 months following a wildfire event. An XGBoost algorithm champion model is compared to two challenger models: 1) a model that extends the champion model by incorporating building and property characteristics and 2) a model that utilizes a neural network approach. Results show that while all models perform well in predicting business failure risk post-disaster event, the model that uses building and property characteristics performs best across all performance metrics. As the natural environment shifts under climate change and more businesses are exposed to the consequences of wildfire, the WVI can help emergency managers allocate disaster aid to businesses at the highest risk of failing and can also provide valuable risk insights for portfolio managers and loan processors.

Authors: Andrew Byrnes (Dun and Bradstreet); Lisa Stites (Dun and Bradstreet)

Disaster Management and Relief
NeurIPS 2023 Exploring Causal Relationship between Environment and Drizzle Properties using Machine Learning (Papers Track)
Abstract and authors: (click to expand)

Abstract: Cloud and precipitation properties are controlled by both local and large-scale forcings. Current weather and climate models represent clouds and precipitation through parameterizations that are based on theoretical relationships between environment, clouds, and precipitation. However, these relationships vary considerably among different weather and cloud conditions, thereby leading to inaccurate simulation of cloud and precipitation properties. In this study, we use observations from a site in the Eastern North Atlantic Ocean (28W, 39.5N) to establish a potential causal relationship between large-scale environment, cloud, and precipitation properties. We estimate the structure of a directed acyclic graph (DAG) with the NOTEARS algorithm (Non-combinatorial Optimization via Trace Exponential and Augmented lagRangian for Structure learning) (Zheng et al., 2018 \cite{Zheng2018DAGsLearning}) with a multi-layer perceptron (MLP) neural network classification architecture. We classify liquid water path (LWP), rain rate, and rain drop diameter in two classes based on lower and upper quantiles to identify the governing mechanisms responsible for the two tails of the distribution. We also invoke Random Forest classification to compare our causal model results with conventional decision tree-based approaches. We hypothesize the dominant role of cloud LWP and net radiative cooling in controlling the cloud and precipitation properties. In this way, this study demonstrates the application of a causal machine learning method to identify which environmental properties potentially control cloud and precipitation development. These results will be extremely valuable to both observational and numerical modeling communities as they could help improve the current parameterizations in the weather and climate models.

Authors: Piyush Garg (Argonne National Laboratory); Virendra Ghate (Argonne National Laboratory); Maria Cadeddu (Argonne National Laboratory); Bethany Lusch (Argonne National Lab)

Climate Science & Modeling Causal & Bayesian Methods
NeurIPS 2023 Asset Bundling for Wind Power Forecasting (Papers Track)
Abstract and authors: (click to expand)

Abstract: The growing penetration of intermittent, renewable generation in US power grids results in increased operational uncertainty. In that context, accurate forecasts are critical, especially for wind generation, which exhibits large variability and is historically harder to predict. To overcome this challenge, this work proposes a novel Bundle-Predict-Reconcile (BPR) framework that integrates asset bundling, machine learning, and forecast reconciliation techniques to accurately predict wind power at the asset, bundle, and fleet level. Notably, our approach effectively introduces an auxiliary learning task (predicting the bundle-level time series) to help the main learning tasks (fleet-level time series) and proposes new asset-bundling criteria to capture the spatio-temporal dynamics of wind power time series. Extensive numerical experiments are conducted on an industry-size dataset of wind farms, demonstrating the benefits of BPR, which consistently and significantly improves forecast accuracy over the baseline approach, especially at the fleet level.

Authors: Hanyu Zhang (Georgia Institute of Technology); Mathieu Tanneau (Georgia Institute of Technology); Chaofan Huang (Georgia Institute of Technology); Roshan Joseph (Georgia Institute of Technology); Shangkun Wang (Georgia Institute of Technology); Pascal Van Hentenryck (Georgia Institute of Technology)

Power & Energy Time-series Analysis
NeurIPS 2023 Soil Organic Carbon Estimation from Climate-related Features with Graph Neural Network (Papers Track)
Abstract and authors: (click to expand)

Abstract: Soil organic carbon (SOC) plays a pivotal role in the global carbon cycle, impacting climate dynamics and necessitating accurate estimation for sustainable land and agricultural management. While traditional methods of SOC estimation face resolution and accuracy challenges, recent advancements harness remote sensing, machine learning, and high-resolution satellite mapping. Graph Neural Networks (GNNs), especially when integrated with positional encoders, offer promise in capturing intricate relationships between soil and climate. Using the LUCAS database, this study compared four GNN operators in the positional encoder framework. Results revealed that the PESAGE and PETransformer models outperformed others in SOC estimation, indicating their potential in capturing the complex interplay between SOC and climate features. Our findings underscore the potential of GNN architectures in advancing SOC prediction, paving the way for future explorations with more advanced GNN models.

Authors: Weiying Zhao (Deep Planet); Natalia Efremova (Queen Mary University London)

Carbon Capture & Sequestration Computer Vision & Remote Sensing
NeurIPS 2023 Deep Glacier Image Velocimetry: Mapping glacier velocities from Sentinel-2 imagery with deep learning (Papers Track)
Abstract and authors: (click to expand)

Abstract: Glacier systems are highly sensitive to climate change and play a pivotal role in global mean sea level rise. As such, it is important to monitor how glacier velocities and ice dynamics evolve under a changing climate. The growing wealth of satellite observations has facilitated the inference of glacier velocities from remote sensing imagery through feature tracking algorithms. At present, these rely on sparse cross-correlation estimates as well as computationally expensive optical flow solutions. Here we present a novel use of deep-learning for estimating annual glacier velocities, utilizing the recurrent optical-flow based architecture, RAFT, on consecutive pairs of optical Sentinel-2 imagery. Our results highlight that deep learning can generate dense per-pixel velocity estimates within an automated framework that utilizes Sentinel-2 images over the French Alps.

Authors: James B Tlhomole (Imperial College London); Matthew Piggott (Imperial College London); Graham Hughes (Imperial College London)

Earth Observation & Monitoring Computer Vision & Remote Sensing
NeurIPS 2023 DeepEn2023: Energy Datasets for Edge Artificial Intelligence (Papers Track)
Abstract and authors: (click to expand)

Abstract: Climate change poses one of the most significant challenges to humanity. As a result of these climatic shifts, the frequency of weather, climate, and water-related disasters has multiplied fivefold over the past 50 years, resulting in over 2$ million deaths and losses exceeding U.S. $3.64 trillion. Leveraging AI-powered technologies for sustainable development and combating climate change is a promising avenue. Numerous significant publications are dedicated to using AI to improve renewable energy forecasting, enhance waste management, and monitor environmental changes in real-time. However, very few research studies focus on making AI itself environmentally sustainable. This oversight regarding the sustainability of AI within the field might be attributed to a mindset gap and the absence of comprehensive energy datasets. In addition, with the ubiquity of edge AI systems and applications, especially on-device learning, there is a pressing need to measure, analyze, and optimize their environmental sustainability, such as energy efficiency. To this end, in this paper, we propose large-scale energy datasets for edge AI, named DeepEn2023, covering a wide range of kernels, state-of-the-art deep neural network models, and popular edge AI applications. We anticipate that DeepEn2023 will enhance transparency regarding sustainability in on-device deep learning across a range of edge AI systems and applications.

Authors: XIAOLONG TU (Georgia State University); Anik Mallik (University of North Carolina at Charlotte); Jiang Xie (University of North Carolina at Charlotte); Haoxin Wang (Georgia State University)

NeurIPS 2023 Interpretable machine learning approach to understand U.S. prevented planting events and project climate change impacts (Papers Track)
Abstract and authors: (click to expand)

Abstract: Extreme weather events in 2019 prevented U.S. farmers from planting crop on a record 19.4 million acres, more than double the previous record. Insurance reports show the majority of these were intended to be corn acres and prevented due to excess soil moisture and precipitation. However, we still lack a detailed understanding of how weather and soil conditions lead to prevented planting, as well as how climate change may impact future outcomes. Machine learning provides a promising approach to this challenge. Here, we model the drivers of prevented corn planting using soil characteristics, monthly weather conditions, and geospatial information. Due to the extreme nature of events causing prevented planting, we use a novel-design zero-inflated regression (ZIR) model that predicts the occurrence of prevented planting as well as the potential severity. We identify key environmental drivers of prevented planting, including May rainfall and soil drainage class. Under climate change scenarios, the model interestingly projects future instances of prevented planting to be less frequent but more severe relative to the historical period.

Authors: Haynes Stephens (University of Chicago)

Agriculture & Food
NeurIPS 2023 GraphTransformers for Geospatial Forecasting of Hurricane Trajectories (Papers Track)
Abstract and authors: (click to expand)

Abstract: In this paper we introduce a novel framework for trajectory prediction of geospatial sequences using GraphTransformers. When viewed across several sequences, we observed that a graph structure automatically emerges between different geospatial points that is often not taken into account for such sequence modeling tasks. We show that by leveraging this graph structure explicitly, geospatial trajectory prediction can be significantly improved. Our GraphTransformer approach improves upon state-of-the-art Transformer based baseline significantly on HURDAT, a dataset where we are interested in predicting the trajectory of a hurricane on a 6 hourly basis. This helps inform evacuation efforts by narrowing down target location by 10 to 20 kilometers along both the north-south and east-west directions.

Authors: Satyaki Chakraborty (Carnegie Mellon University); Pallavi Banerjee (University of Washington)

Disaster Management and Relief Climate Science & Modeling
NeurIPS 2023 Integrating Building Survey Data with Geospatial Data: A Cluster-Based Ethical Approach (Papers Track)
Abstract and authors: (click to expand)

Abstract: This research paper delves into the unique energy challenges faced by Alaska, arising from its remote geographical location, severe climatic conditions, and heavy reliance on fossil fuels while emphasizing the shortage of comprehensive building energy data. The study introduces an ethical framework that leverages machine learning and geospatial techniques to enable the large-scale integration of data, facilitating the mapping of energy consumption data at the individual building level. Utilizing the Alaska Retrofit Information System (ARIS) and the USA Structures datasets, this framework not only identifies and acknowledges limitations inherent in existing datasets but also establishes a robust ethical foundation for data integration. This framework innovation sets a noteworthy precedent for the responsible utilization of data in the domain of climate justice research, ultimately informing the development of sustainable energy policies through an enhanced understanding of building data and advancing ongoing research agendas. Future research directions involve the incorporation of recently released datasets, which provide precise building location data, thereby further validating the proposed ethical framework and advancing efforts in addressing Alaska's intricate energy challenges.

Authors: Vidisha Chowdhury (University of Pennsylvania); Gabriela Gongora-Svartzman (Carnegie Mellon University); Erin D Trochim (University of Alaska Fairbanks); Philippe Schicker (Carnegie Mellon University)

Climate Justice Interpretable ML
NeurIPS 2023 Breeding Programs Optimization with Reinforcement Learning (Papers Track)
Abstract and authors: (click to expand)

Abstract: Crop breeding is crucial in improving agricultural productivity while potentially decreasing land usage, greenhouse gas emissions, and water consumption. However, breeding programs are challenging due to long turnover times, high-dimensional decision spaces, long-term objectives, and the need to adapt to rapid climate change. This paper introduces the use of Reinforcement Learning (RL) to optimize simulated crop breeding programs. RL agents are trained to make optimal crop selection and cross-breeding decisions based on genetic information. To benchmark RL-based breeding algorithms, we introduce a suite of Gym environments. The study demonstrates the superiority of RL techniques over standard practices in terms of genetic gain when simulated in silico using real-world genomic maize data.

Authors: Omar G. Younis (ETH Zurich); Luca Corinzia (ETH Zurich - Information Science & Engineering Group); Ioannis N Athanasiadis (Wageningen University and Research); Andreas Krause (ETH Zürich); Joachim Buhmann (ETH Zurich); Matteo Turchetta (ETH Zurich)

Reinforcement Learning Agriculture & Food
NeurIPS 2023 Discovering Effective Policies for Land-Use Planning (Papers Track) Best Pathway to Impact
Abstract and authors: (click to expand)

Abstract: How areas of land are allocated for different uses, such as forests, urban, and agriculture, has a large effect on carbon balance, and therefore climate change. Based on available historical data on changes in land use and a simulation of carbon emissions/absorption, a surrogate model can be learned that makes it possible to evaluate the different options available to decision-makers efficiently. An evolutionary search process can then be used to discover effective land-use policies for specific locations. Such a system was built on the Project Resilience platform and evaluated with the Land-Use Harmonization dataset and the BLUE simulator. It generates Pareto fronts that trade off carbon impact and amount of change customized to different locations, thus providing a potentially useful tool for land-use planning.

Authors: Risto Miikkulainen (UT Austin; Cognizant Technology Solutions); Olivier Francon (Cognizant AI Labs); Daniel Young (Cognizant AI Labs); Babak Hodjat (Cognizant AI Labs); Hugo Cunha (Cognizant AI Labs); Jacob Bieker (Open Climate Fix)

Forests
NeurIPS 2023 Probabilistic land cover modeling via deep autoregressive models (Papers Track)
Abstract and authors: (click to expand)

Abstract: Land use and land cover (LULC) modeling is a challenging task due to long-range dependencies between geographic features and distinct spatial patterns related in topography, ecology, and human development. We explore the usage of a modified Pixel Constrained CNN as applied to inpainting for categorical image data from the National Land Cover Database for producing a diverse set of land use counterfactual scenarios. We find that this approach is effective for producing a distribution of realistic image completions in certain masking configurations. However, the resulting distribution is not well-calibrated in terms of spatial summary statistics commonly used with LULC data and exhibits substantial underdispersion.

Authors: Christopher Krapu (Duke University); Ryan Calder (Virginia Tech); Mark Borsuk (Duke University)

Forests Uncertainty Quantification & Robustness
NeurIPS 2023 Simulating the Air Quality Impact of Prescribed Fires Using a Graph Neural Network-Based PM2.5 Emissions Forecasting System (Papers Track)
Abstract and authors: (click to expand)

Abstract: The increasing size and severity of wildfires across western North America have generated dangerous levels of PM2.5 pollution in recent years. In a warming climate, expanding the use of prescribed fires is widely considered to be the most robust fire mitigation strategy. However, reliably forecasting the potential air quality impact from these prescribed fires, a critical ingredient in determining the fires’ location and time, at hourly to daily time scales remains a challenging problem. This paper proposes a novel integration of prescribed fire simulation with a spatio-temporal graph neural network-based PM2.5 forecasting model. The experiments in this work focus on determining the optimal time for implementing prescribed fires in California as well as quantifying the potential air quality trade-offs involved in conducting more prescribed fires outside the fire season.

Authors: Kyleen Liao (Saratoga High School); Jatan Buch (Columbia University); Kara D. Lamb (Columbia University); Pierre Gentine (Columbia University)

Earth Observation & Monitoring Generative Modeling
NeurIPS 2023 Hyperspectral shadow removal with iterative logistic regression and latent Parametric Linear Combination of Gaussians (Papers Track)
Abstract and authors: (click to expand)

Abstract: Shadow detection and removal is a challenging problem in the analysis of hyperspectral images. Yet, this step is crucial for analyzing data for remote sensing applications like methane detection. In this work, we develop a shadow detection and removal method only based on the spectrum of each pixel and the overall distribution of spectral values. We first introduce Iterative Logistic Regression(ILR) to learn a spectral basis in which shadows can be linearly classified. We then model the joint distribution of the mean radiance and the projection coefficients of the spectra onto the above basis as a parametric linear combination of Gaussians. We can then extract the maximum likelihood mixing parameter of the Gaussians to estimate the shadow coverage and to correct the shadowed spectra. Our correction scheme reduces correction artefacts at shadow borders. The shadow detection and removal method is applied to hyperspectral images from MethaneAIR, a precursor to the satellite MethaneSAT.

Authors: Core Francisco Park (Harvard University); Maya Nasr (Harvard University); Manuel Pérez-Carrasco (University of Concepcion); Eleanor Walker (Harvard University); Douglas Finkbeiner (Harvard University); Cecilia Garraffo (AstroAI at the Center for Astrophysics, Harvard & Smitnsonian)

Earth Observation & Monitoring Uncertainty Quantification & Robustness
NeurIPS 2023 Cooperative Logistics: Can Artificial Intelligence Enable Trustworthy Cooperation at Scale? (Papers Track)
Abstract and authors: (click to expand)

Abstract: Cooperative Logistics studies the setting where logistics companies pool their resources together to improve their individual performance. Prior literature suggests carbon savings of approximately 22%. If attained globally, this equates to 480,000,000 tonnes of CO2-eq. Whilst well-studied in operations research – industrial adoption remains limited due to a lack of trustworthy cooperation. A key remaining challenge is fair and scalable gain sharing (i.e., how much should each company be fairly paid?). We propose the use of deep reinforcement learning with a neural reward model for coalition structure generation and present early findings.

Authors: Stephen Mak (University of Cambridge); Tim Pearce (Microsoft Research); Matthew Macfarlane (University of Amsterdam); Liming Xu (University of Cambridge); Michael Ostroumov (Value Chain Lab); Alexandra Brintrup (University of Cambridge)

Transportation Reinforcement Learning
NeurIPS 2023 Resource Efficient and Generalizable Representation Learning of High-Dimensional Weather and Climate Data (Papers Track)
Abstract and authors: (click to expand)

Abstract: We study self-supervised representation learning on high-dimensional data under resource constraints. Our work is motivated by applications of vision transformers to weather and climate data. Such data frequently comes in the form of tensors that are both higher dimensional and of larger size than the RGB imagery one encounters in many computer vision experiments. This raises scaling issues and brings up the need to leverage available compute resources efficiently. Motivated by results on masked autoencoders, we show that it is possible to use sampling of subtensors as the sole augmentation strategy for contrastive learning with a sampling ratio of $\sim$1\%. This is to be compared to typical masking ratios of $75\%$ or $90\%$ for image and video data respectively. In an ablation study, we explore extreme sampling ratios and find comparable skill for ratios as low as $\sim$0.0625\%. Pursuing efficiencies, we are finally investigating whether it is possible to generate robust embeddings on dimension values which were not present at training time. We answer this question to the positive by using learnable position encoders which have continuous dependence on dimension values.

Authors: Juan Nathaniel (Columbia University); Marcus Freitag (IBM); Patrick Curran (Environment and Climate Change Canada); Isabel Ruddick (Environment and Climate Change Canada); Johannes Schmude (IBM)

Unsupervised & Semi-Supervised Learning Meta- and Transfer Learning
NeurIPS 2023 Difference Learning for Air Quality Forecasting Transport Emulation (Papers Track)
Abstract and authors: (click to expand)

Abstract: Human health is negatively impacted by poor air quality including increased risk for respiratory and cardiovascular disease. Due to a recent increase in extreme air quality events, both globally and locally in the United States, finer resolution air quality forecasting guidance is needed to effectively adapt to these events. The National Oceanic and Atmospheric Administration provides air quality forecasting guidance for the Continental United States. Their air quality forecasting model is based on a 15 km spatial resolution; however, the goal is to reach a three km spatial resolution. This is currently not feasible due in part to prohibitive computational requirements for modeling the transport of chemical species. In this work, we describe a deep learning transport emulator that is able to reduce computations while maintaining skill comparable with the existing numerical model. We show how this method maintains skill in the presence of extreme air quality events, making it a potential candidate for operational use. We also explore evaluating how well this model maintains the physical properties of the modeled transport for a given set of species.

Authors: Reed R Chen (Johns Hopkins University Applied Physics Laboratory); Christopher Ribaudo (Johns Hopkins University Applied Physics Laboratory); Jennifer Sleeman (University of Maryland, Baltimore County and Johns Hopkins University Applied Physics Laboratory); Chace Ashcraft (JHU/APL); Marisa Hughes (JHU)

Extreme Weather
NeurIPS 2023 Fusion of Physics-Based Wildfire Spread Models with Satellite Data using Generative Algorithms (Papers Track)
Abstract and authors: (click to expand)

Abstract: Climate change has driven increases in wildfire prevalence, prompting development of wildfire spread models. Advancements in the use of satellites to detect fire locations provides opportunity to enhance fire spread forecasts from numerical models via data assimilation. In this work, a method is developed to infer the history of a wildfire from satellite measurements using a conditional Wasserstein Generative Adversarial Network (cWGAN), providing the information necessary to initialize coupled atmosphere-wildfire models in a physics-informed approach based on measurements. The cWGAN, trained with solutions from WRF-SFIRE, produces samples of fire arrival times (fire history) from the conditional distribution of arrival times given satellite measurements, and allows for assessment of prediction uncertainty. The method is tested on four California wildfires and predictions are compared against measured fire perimeters and reported ignition times. An average Sorensen's coefficient of 0.81 for the fire perimeters and an average ignition time error of 32 minutes suggests that the method is highly accurate.

Authors: Bryan Shaddy (University of Southern California); Deep Ray (University of Maryland); Angel Farguell (San Jose State University); Valentina Calaza (University of Southern California); Jan Mandel (University of Colorado Denver); James Haley (Cooperative Institute for Research in the Atmosphere); Kyle Hilburn (Cooperative Institute for Research in the Atmosphere); Derek Mallia (University of Utah); Adam Kochanski (San Jose State University); Assad Oberai (University of Southern California)

Generative Modeling Extreme Weather
NeurIPS 2023 A 3D super-resolution of wind fields via physics-informed pixel-wise self-attention generative adversarial network (Papers Track)
Abstract and authors: (click to expand)

Abstract: To mitigate global warming, greenhouse gas sources need to be resolved at a high spatial resolution and monitored in time to ensure the reduction and ultimately elimination of the pollution source. However, the complexity of computation in resolving high-resolution wind fields left the simulations impractical to test different time lengths and model configurations. This study presents a preliminary development of a physics-informed super-resolution (SR) generative adversarial network (GAN) that super-resolves the three-dimensional (3D) low-resolution wind fields by upscaling x9 times. We develop a pixel-wise self-attention (PWA) module that learns 3D weather dynamics via a self-attention computation followed by a 2D convolution. We also employ a loss term that regularizes the self-attention map during pretraining, capturing the vertical convection process from input wind data. The new PWA SR-GAN shows the high-fidelity super-resolved 3D wind data, learns a wind structure at the high-frequency domain, and reduces the computational cost of a high-resolution wind simulation by x 89.7 times.

Authors: Takuya Kurihana (University of Chicago); Levente Klein (IBM Research); Kyongmin Yeo (IBM Research); Daniela Szwarcman (IBM Research); Bruce G Elmegreen (IBM Research); Surya Karthik Mukkavilli (IBM Research, Zurich); Johannes Schmude (IBM)

Generative Modeling Climate Science & Modeling
NeurIPS 2023 Proof-of-concept: Using ChatGPT to Translate and Modernize an Earth System Model from Fortran to Python/JAX (Papers Track)
Abstract and authors: (click to expand)

Abstract: Earth system models (ESMs) are vital for understanding past, present, and future climate, but they suffer from legacy technical infrastructure. ESMs are primarily implemented in Fortran, a language that poses a high barrier of entry for early career scientists and lacks a GPU runtime, which has become essential for continued advancement as GPU power increases and CPU scaling slows. Fortran also lacks differentiability — the capacity to differentiate through numerical code — which enables hybrid models that integrate machine learning methods. Converting an ESM from Fortran to Python/JAX could resolve these issues. This work presents a semi-automated method for translating individual model components from Fortran to Python/JAX using a large language model (GPT-4). By translating the photosynthesis model from the Community Earth System Model (CESM), we demonstrate that the Python/JAX version results in up to 100x faster runtimes using GPU parallelization, and enables parameter estimation via automatic differentiation. The Python code is also easy to read and run and could be used by instructors in the classroom. This work illustrates a path towards the ultimate goal of making climate models fast, inclusive, and differentiable.

Authors: Anthony Zhou (Columbia University), Linnia Hawkins (Columbia University), Pierre Gentine (Columbia University)

Climate Science & Modeling Natural Language Processing
NeurIPS 2023 Learning to forecast diagnostic parameters using pre-trained weather embedding (Papers Track)
Abstract and authors: (click to expand)

Abstract: Data-driven weather prediction (DDWP) models are increasingly becoming popular for weather forecasting. However, while operational weather forecasts predict a wide variety of weather variables, DDWPs currently forecast a specific set of key prognostic variables. Non-prognostic ("diagnostic") variables are sometimes modeled separately as dependent variables of the prognostic variables (c.f. FourCastNet \cite{pathak2022fourcastnet}), or by including the diagnostic variable as a target in the DDWP. However, the cost of training and deploying bespoke models for each diagnostic variable can increase dramatically with more diagnostic variables, and limit the operational use of such models. Likewise, retraining an entire DDWP each time a new diagnostic variable is added is also cost-prohibitive. We present an two-stage approach that allows new diagnostic variables to be added to an end-to-end DDWP model without the expensive retraining. In the first stage, we train an autoencoder that learns to embed prognostic variables into a latent space. In the second stage, the autoencoder is frozen and "downstream" models are trained to predict diagnostic variables using only the latent representations of prognostic variables as input. Our experiments indicate that models trained using the two-stage approach offer accuracy comparable to training bespoke models, while leading to significant reduction in resource utilization during training and inference. This approach allows for new "downstream" models to be developed as needed, without affecting existing models and thus reducing the friction in operationalizing new models.

Authors: Peetak Mitra (Excarta); Vivek Ramavajjala (Excarta)

Climate Science & Modeling Hybrid Physical Models
NeurIPS 2023 PressureML: Modelling Pressure Waves to Generate Large-Scale Water-Usage Insights in Buildings (Papers Track)
Abstract and authors: (click to expand)

Abstract: Several studies have indicated that delivering insights and feedback on water usage has been effective in curbing water consumption, making it a pivotal component in achieving long-term sustainability objectives. Despite a significant proportion of water consumption originating from large residential and commercial buildings, there is a scarcity of cost-effective and easy-to-integrate solutions that provide water usage insights in such structures. Furthermore, existing methods for disaggregating water usage necessitate training data and rely on frequent data sampling to capture patterns, both of which pose challenges when scaling up and adapting to new environments. In this work, we aim to solve these challenges through a novel end-to-end approach which records data from pressure sensors and uses time-series classification by DNN models to determine room-wise water consumption in a building. This consumption data is then fed to a novel water disaggregation algorithm which can suggest a set of water-usage events, and has a flexible requirement of training data and sampling granularity. We conduct experiments using our approach and demonstrate its potential as a promising avenue for in-depth exploration, offering valuable insights into water usage on a large scale.

Authors: Tanmaey Gupta (Microsoft Research India); Anupam Sobti (IIT Delhi); Akshay Nambi (Microsoft Research)

Buildings Meta- and Transfer Learning
NeurIPS 2023 Facilitating Battery Swapping Services for Freight Trucks with Spatial-Temporal Demand Prediction (Papers Track)
Abstract and authors: (click to expand)

Abstract: Electrifying heavy-duty trucks offers a substantial opportunity to curtail carbon emissions, advancing toward a carbon-neutral future. However, the inherent challenges of limited battery energy and the sheer weight of heavy-duty trucks lead to reduced mileage and prolonged charging durations. Consequently, battery-swapping services emerge as an attractive solution for these trucks. This paper employs a two-fold approach to investigate the potential and enhance the efficacy of such services. Firstly, spatial-temporal demand prediction models are adopted to predict the traffic patterns for the upcoming hours. Subsequently, the prediction guides an optimization module for efficient battery allocation and deployment. Analyzing the heavy-duty truck data on a highway network spanning over 2,500 miles, our model and analysis underscore the value of prediction/machine learning in facilitating future decision-makings. In particular, we find that the initial phase of implementing battery-swapping services favors mobile battery-swapping stations, but as the system matures, fixed-location stations are preferred.

Authors: Linyu Liu (Tsinghua University); Zhen Dai (Chongqing Expressway Group Company); Shiji Song (Department of Automation, Tsinghua University); Xiaocheng Li (Imperial College London); Guanting Chen (The University of North Carolina at Chapel Hill)

Transportation
NeurIPS 2023 Domain Adaptation for Sustainable Soil Management using Causal and Contrastive Constraint Minimization (Papers Track)
Abstract and authors: (click to expand)

Abstract: Monitoring organic matter is pivotal for maintaining soil health and can help inform sustainable soil management practices. While sensor-based soil information offers higher-fidelity and reliable insights into organic matter changes, sampling and measuring sensor data is cost-prohibitive. We propose a multi-modal, scalable framework that can estimate organic matter from remote sensing data, a more readily available data source while leveraging sparse soil information for improving generalization. Using the sensor data, we preserve underlying causal relations among sensor attributes and organic matter. Simultaneously we leverage inherent structure in the data and train the model to discriminate among domains using contrastive learning. This causal and contrastive constraint minimization ensures improved generalization and adaptation to other domains. We also shed light on the interpretability of the framework by identifying attributes that are important for improving generalization. Identifying these key soil attributes that affect organic matter will aid in efforts to standardize data collection efforts.

Authors: Somya Sharma (U. Minnesota); Swati Sharma (Microsoft Research); RAFAEL PADILHA (Microsoft Research); Emre Kiciman (Microsoft Research); Ranveer Chandra (Microsoft Research)

Carbon Capture & Sequestration Causal & Bayesian Methods
NeurIPS 2023 Inference of CO2 flow patterns--a feasibility study (Papers Track)
Abstract and authors: (click to expand)

Abstract: As the global deployment of carbon capture and sequestration (CCS) technology intensifies in the fight against climate change, it becomes increasingly imperative to establish robust monitoring and detection mechanisms for potential underground CO2 leakage, particularly through pre-existing or induced faults in the storage reservoir's seals. While techniques such as history matching and time-lapse seismic monitoring of CO2 storage have been used successfully in tracking the evolution of CO2 plumes in the subsurface, these methods lack principled approaches to characterize uncertainties related to the CO2 plumes' behavior. Inclusion of systematic assessment of uncertainties is essential for risk mitigation for the following reasons: (i) CO2 plume-induced changes are small and seismic data is noisy; (ii) changes between regular and irregular (e.g., caused by leakage) flow patterns are small; and (iii) the reservoir properties that control the flow are strongly heterogeneous and typically only available as distributions. To arrive at a formulation capable of inferring flow patterns for regular and irregular flow from well and seismic data, the performance of conditional normalizing flow will be analyzed on a series of carefully designed numerical experiments. While the inferences presented are preliminary in the context of an early CO2 leakage detection system, the results do indicate that inferences with conditional normalizing flows can produce high-fidelity estimates for CO2 plumes with or without leakage. We are also confident that the inferred uncertainty is reasonable because it correlates well with the observed errors. This uncertainty stems from noise in the seismic data and from the lack of precise knowledge of the reservoir's fluid flow properties.

Authors: Abhinav Prakash Gahlot (Georgia Institute of Technology); Huseyin Tuna Erdinc (Georgia Institute of Technology); Rafael Orozco (Georgia Institute of Technology); Ziyi Yin (Georgia Institute of Technology); Felix Herrmann (Georgia Institute of Technology)

Carbon Capture & Sequestration Generative Modeling
NeurIPS 2023 Generative Nowcasting of Marine Fog Visibility in the Grand Banks area and Sable Island in Canada (Papers Track)
Abstract and authors: (click to expand)

Abstract: This study presents the application of generative deep learning techniques to evaluate marine fog visibility nowcasting using the FATIMA (Fog and turbulence interactions in the marine atmosphere) campaign observations collected during July 2022 in the North Atlantic in the Grand Banks area and vicinity of Sable Island (SI), northeast of Canada. The measurements were collected using the Vaisala Forward Scatter Sensor model FD70 and Weather Transmitter model WXT50, and Gill R3A ultrasonic anemometer mounted on the Research Vessel Atlantic Condor. To perform nowcasting, the time series of fog visibility (Vis), wind speed, dew point depression, and relative humidity with respect to water were preprocessed to have lagged time step features. Generative nowcasting of Vis time series for lead times of 30 and 60 minutes were performed using conditional generative adversarial networks (cGAN) regression at visibility thresholds of Vis < 1 km and < 10 km. Extreme gradient boosting (XGBoost) was used as a baseline method for comparison against cGAN. At the 30 min lead time, Vis was best predicted with cGAN at Vis < 1 km (RMSE = 0.151 km) and with XGBoost at Vis < 10 km (RMSE = 2.821 km). At the 60 min lead time, Vis was best predicted with XGBoost at Vis < 1 km (RMSE = 0.167 km) and Vis < 10 km (RMSE = 3.508 km), but the cGAN RMSE was similar to XGBoost. Despite nowcasting Vis at 30 min being quite difficult, the ability of the cGAN model to track the variation in Vis at 1 km suggests that there is potential for generative analysis of marine fog visibility using observational meteorological parameters.

Authors: Eren Gultepe (Southern Illinois University Edwardsville); Sen Wang (University of Notre Dame); Byron Blomquist (NOAA); Harindra Fernando (University of Notre Dame); Patrick Kreidl (University of North Florida); David Delene (University of North Dakota); Ismail Gultepe (Ontario Tech University)

Climate Science & Modeling Generative Modeling
NeurIPS 2023 Data-Driven Traffic Reconstruction and Kernel Methods for Identifying Stop-and-Go Congestion (Papers Track)
Abstract and authors: (click to expand)

Abstract: Identifying stop-and-go events (SAGs) in traffic flow presents an important avenue for advancing data-driven research for climate change mitigation and sustainability, owing to their substantial impact on carbon emissions, travel time, fuel consumption, and roadway safety. In fact, SAGs are estimated to account for 33-50\% of highway driving externalities. However, insufficient attention has been paid to precisely quantifying where, when, and how much these SAGs take place– necessary for downstream decision making, such as intervention design and policy analysis. A key challenge is that the data available to researchers and governments are typically sparse and aggregated to a granularity that obscures SAGs. To overcome such data limitations, this study thus explores the use of traffic reconstruction techniques for SAG identification. In particular, we introduce a kernel-based method for identifying spatio-temporal features in traffic and leverage bootstrapping to quantify the uncertainty of the reconstruction process. Experimental results on California highway data demonstrate the promise of the method for capturing SAGs. This work contributes to a foundation for data-driven decision making to advance sustainability of traffic systems.

Authors: Edgar Ramirez Sanchez (MIT); Shreyaa Raghavan (MIT); Cathy Wu ()

Transportation Uncertainty Quantification & Robustness
NeurIPS 2023 AI assisted Search for Atmospheric CO2 Capture (Papers Track)
Abstract and authors: (click to expand)

Abstract: Carbon capture technologies is an important tool for mitigating climate change. In recent years, polymer membrane separation methods have emerged as a promising technology for separating CO2 and other green house gases from the atmosphere. Designing new polymers for such tasks is quite difficult. In this work we look at machine learning based methods to search for new polymer designs optimized for CO2 separation. An ensemble ML models is trained on a large database of molecules to predict permeabilities of CO2/N2 and CO2/O2 pairs. We then use search based optimization to discover new polymers that surpass existing polymer designs. Simulations are then done to verify the predicted performance of the new designs. Overall result suggests that ML based search can be used to discover new polymers optimized for carbon capture.

Authors: Shivashankar Shivashankar (Student)

Carbon Capture & Sequestration Reinforcement Learning
NeurIPS 2023 Towards Recommendations for Value Sensitive Sustainable Consumption (Papers Track)
Abstract and authors: (click to expand)

Abstract: Excessive consumption can strain natural resources, harm the environment, and widen societal gaps. While adopting a more sustainable lifestyle means making significant changes and potentially compromising personal desires, balancing sustainability with personal values poses a complex challenge. This article delves into designing recommender systems using neural networks and genetic algorithms, aiming to assist consumers in shopping sustainably without disregarding their individual preferences. We approach the search for good recommendations as a problem involving multiple objectives, representing diverse sustainability goals and personal values. While using a synthetic historical dataset based on real-world sources, our evaluations reveal substantial environmental benefits without demanding drastic personal sacrifices, even if consumers accept only a fraction of the recommendations.

Authors: Thomas Asikis (University of Zurich)

Recommender Systems Societal Adaptation & Resilience
NeurIPS 2023 Towards autonomous large-scale monitoring the health of urban trees using mobile sensing (Papers Track)
Abstract and authors: (click to expand)

Abstract: Healthy urban greenery is a fundamental asset to mitigate climate change phenomenons such as extreme heat and air pollution. However, urban trees are often affected by abiotic and biotic stressors that hamper their functionality, and whenever not timely managed, even their survival. The current visual or instrumented inspection techniques often require a high amount of human labor making frequent assessments infeasible at a city-wide scale. In this work, we present the GreenScan Project, a ground-based sensing system designed to provide health assessment of urban trees at high space-time resolutions, with low costs. The system utilises thermal and multi-spectral imaging sensors, fused using computer vision models to estimate two tree health indexes, namely NDVI and CTD. Preliminary evaluation of the system was performed through data collection experiments in Cambridge, USA. Overall, this work illustrates the potential of autonomous mobile ground-based tree health monitoring on city-wide scales at high temporal resolutions with low-costs.

Authors: Akshit Gupta (Delft University of Technology); Martine Rutten (Delft University of Technology); RANGA RAO VENKATESHA PRASAD (TUDelft); Remko Uijlenhoet (Delft University of Technology)

Cities & Urban Planning Computer Vision & Remote Sensing
NeurIPS 2023 Towards Global, General-Purpose Pretrained Geographic Location Encoders (Papers Track)
Abstract and authors: (click to expand)

Abstract: Location information is essential for modeling tasks in climate-related fields ranging from ecology to the Earth system sciences. However, obtaining meaningful location representation is challenging and requires a model to distill semantic location information from available data, such as remote sensing imagery. To address this challenge, we introduce SatCLIP, a global, general-purpose geographic location encoder that provides vector embeddings summarizing the characteristics of a given location for convenient usage in diverse downstream tasks. We show that SatCLIP embeddings, pretrained on multi-spectral Sentinel-2 satellite data, can be used for various predictive out-of-domain tasks, including temperature prediction and animal recognition in imagery, and outperform existing competing approaches. SatCLIP embeddings also prove helpful in overcoming geographic domain shift. This demonstrates the potential of general-purpose location encoders and opens the door to learning meaningful representations of our planet from the vast, varied, and largely untapped modalities of geospatial data.

Authors: Konstantin Klemmer (Microsoft Research); ESTHER ROLF (Google Research); Caleb Robinson (Microsoft AI for Good Research Lab); Lester Mackey (Microsoft Research); Marc Rußwurm (École Polytechnique Fédérale de Lausanne)

Unsupervised & Semi-Supervised Learning Computer Vision & Remote Sensing
NeurIPS 2023 Spatially Regularized Graph Attention Autoencoder Framework for Detecting Rainfall Extremes (Papers Track)
Abstract and authors: (click to expand)

Abstract: We introduce a novel Graph Attention Autoencoder (GAE) with spatial regularization to address the challenge of scalable anomaly detection in spatiotemporal rainfall data across India from 1990 to 2015. Our model leverages a Graph Attention Network (GAT) to capture spatial dependencies and temporal dynamics in the data, further enhanced by a spatial regularization term ensuring geographic coherence. We construct two graph datasets employing rainfall, pressure, and temperature attributes from the Indian Meteorological Department and ERA5 Reanalysis on Single Levels, respectively. Our network operates on graph representations of the data, where nodes represent geographic locations, and edges, inferred through event synchronization, denote significant co-occurrences of rainfall events. Through extensive experiments, we demonstrate that our GAE effectively identifies anomalous rainfall patterns across the Indian landscape. Our work paves the way for sophisticated spatiotemporal anomaly detection methodologies in climate science, contributing to better climate change preparedness and response strategies.

Authors: Mihir Agarwal (IIT Gandhinagar); Progyan Das (IIT Gandhinagar); Udit Bhatia (IIT Gandhinagar)

Climate Science & Modeling
NeurIPS 2023 Antarctic Bed Topography Super-Resolution via Transfer Learning (Papers Track)
Abstract and authors: (click to expand)

Abstract: High-fidelity topography models of the bedrock underneath the thick Antarctic ice sheet can improve scientists' understanding of ice flow and its contributions to sea level rise. However, the bed topography of Antarctica is one of the most challenging surfaces on earth to map, requiring aircrafts with ice penetrating radars to survey the vast and remote continent. We propose FROST, Fusion Regression for Optimal Subglacial Topography, a method that leverages readily available surface topography data from satellites as an auxiliary input modality for bed topography super-resolution. FROST uses a non-parametric Gaussian Process model to transfer local, non-stationary covariance patterns from the ice surface to the bedrock. In a controlled topography reconstruction experiment over complex East Antarctic terrain, our proposed method outperforms bicubic interpolation at all five tested magnification factors, reducing RMSE by 67% at x2, and 25% at x6 magnification. This work demonstrates the opportunity for data fusion methods to advance downstream climate modelling and steward climate change adaptation.

Authors: Kim Bente (The University of Sydney); Roman Marchant (University of Technology Sydney); Fabio Ramos (NVIDIA, The University of Sydney)

Computer Vision & Remote Sensing Climate Science & Modeling
NeurIPS 2023 Elucidating the Relationship Between Climate Change and Poverty using Graph Neural Networks, Ensemble Models, and Remote Sensing Data (Papers Track)
Abstract and authors: (click to expand)

Abstract: Climate and poverty are intrinsically related: regions with extreme temperatures, large temperature variability, and recurring extreme weather events tend to be ranked among the poorest and most vulnerable to climate change. Nevertheless, there currently is no established method to directly estimate the impact of specific climate variables on poverty and to identify geographical regions at high risk of being negatively affected by climate change. In this work, we propose a new approach based on Graph Neural Networks (GNNs) to estimate the effect of climate and remote sensing variables on poverty indicators measuring Education, Health, Living Standards, and Income. Furthermore, we use the trained models and perturbation analyses to identify the geographical regions most vulnerable to the potential variations in climate variables.

Authors: Parinthapat Pengpun (Bangkok Christian International School); Alessandro Salatiello (University of Tuebingen)

Climate Finance & Economics Earth Observation & Monitoring
NeurIPS 2023 Sustainability AI copilot: Analyze & ideate at scale to enable positive impact (Papers Track)
Abstract and authors: (click to expand)

Abstract: With the advances in large scale Foundation Models, web scale access to sustainability data, planetary scale satellite data, the opportunity for larger section of the world population to create positive climate impact can be activated by empowering everyone to ideate via AI copilots. The challenge is: How to enable more people to think & take action on climate & Sustainable Development goals?. We develop AI co-pilots to engage broader community for enabling impact at scale by democratizing climate thinking & ideation tools. We demonstrated how ideating with SAI transforms any seed idea into a holistic one, given the relation between climate & social economic aspects. SAI employs Language Models to represent the voice of the often neglected vulnerable people to the brainstorming discussion for inclusive climate action. We demonstrated how SAI can even create another AI that learns geospatial insights and offers advice to prevent humanitarian disasters from climate change. In this work, we conceptualized, designed, implemented & demonstrated Sustainability AI copilot (SAI) & innovated 4 use cases:- SAI enables sustainability enthusiasts to convert early stage budding thoughts into a robust holistic idea by creatively employing a chain of Large Language Models to think with six-thinking hats ideation. SAI can enables non-experts to become geospatial analysts by generating code to analyze planetary scale satellite data. SAI also ideates in multi-modal latent space to explore climate friendly product designs. SAI also enables human right activists to create awareness about inclusion of vulnerable and persons with disability in the climate conversation. SAI even creates AI apps for persons with disability. We demonstrated working prototypes at the project website, https://sites.google.com/view/climate-copilot . Thus, SAI co-pilot empowers everyone to come together to ideate to make progress on climate and related sustainable development goals.

Authors: Rajagopal A (Indian Institute of Technology); Nirmala V (Queen Marys); Immanuel Raja (Karunya University); Arun V (NIT)

Earth Observation & Monitoring Generative Modeling
NeurIPS 2023 Causality and Explainability for Trustworthy Integrated Pest Management (Proposals Track)
Abstract and authors: (click to expand)

Abstract: Pesticides, widely used in agriculture for pest control, contribute to the climate crisis. Integrated pest management (IPM) is preferred as a climate-smart alternative. However, low adoption rates of IPM are observed due to farmers' skepticism about its effectiveness, so we introduce an enhancing data analysis framework for IPM to combat that. Our framework provides i) robust pest population predictions across diverse environments with invariant and causal learning, ii) interpretable pest presence predictions using transparent models, iii) actionable advice through counterfactual explanations for in-season IPM interventions, iv) field-specific treatment effect estimations, and v) causal inference to assess advice effectiveness.

Authors: Ilias Tsoumas (National Observatory of Athens); Vasileios Sitokonstantinou (University of Valencia); Georgios Giannarakis (National Observatory of Athens); Evagelia Lampiri (University of Thessaly); Christos Athanassiou (University of Thessaly); Gustau Camps-Valls (Universitat de València); Charalampos Kontoes (National Observatory of Athens); Ioannis N Athanasiadis (Wageningen University and Research)

Causal & Bayesian Methods Agriculture & Food
NeurIPS 2023 Monitoring Sustainable Global Development Along Shared Socioeconomic Pathways (Proposals Track)
Abstract and authors: (click to expand)

Abstract: Sustainable global development is one of the most prevalent challenges facing the world today, hinging on the equilibrium between socioeconomic growth and environmental sustainability. We propose approaches to monitor and quantify sustainable development along the Shared Socioeconomic Pathways (SSPs), including mathematically derived scoring algorithms, and machine learning methods. These integrate socioeconomic and environmental datasets, to produce an interpretable metric for SSP alignment. An initial study demonstrates promising results, laying the groundwork for the application of different methods to the monitoring of sustainable global development.

Authors: Michelle Wan (University of Cambridge); Jeff Clark (University of Bristol); Edward Small (Royal Melbourne Institute of Technology); Elena Fillola (University of Bristol); Raul Santos Rodriguez (University of Bristol)

Public Policy Time-series Analysis
NeurIPS 2023 Understanding Climate Legislation Decisions with Machine Learning (Proposals Track)
Abstract and authors: (click to expand)

Abstract: Effective action is crucial in order to avert climate disaster. Key in enacting change is the swift adoption of climate positive legislation which advocates for climate change mitigation and adaptation. This is because government legislation can result in far-reaching impact, due to the relationships between climate policy, technology, and market forces. To advocate for legislation, current strategies aim to identify potential levers and obstacles, presenting an opportunity for the application of recent advances in machine learning language models. Here we propose a machine learning pipeline to analyse climate legislation, aiming to investigate the feasibility of natural language processing for the classification of climate legislation texts, to predict policy voting outcomes. By providing a model of the decision making process, the proposed pipeline can enhance transparency and aid policy advocates and decision makers in understanding legislative decisions, thereby providing a tool to monitor and understand legislative decisions towards climate positive impact.

Authors: Jeff Clark (University of Bristol); Michelle Wan (University of Cambridge); Raul Santos Rodriguez (University of Bristol)

Public Policy Natural Language Processing
NeurIPS 2023 Mapping the Landscape of Artificial Intelligence in Climate Change Research: A Meta-Analysis on Impact and Applications (Proposals Track)
Abstract and authors: (click to expand)

Abstract: This proposal advocates a comprehensive and systematic analysis aimed at mapping and characterizing the intricate landscape of Artificial Intelligence and Machine Learning applications and their impacts within the domain of climate change research, both in adaption and mitigation efforts. Notably, a significant upswing in this interdisciplinary intersection has been observed since 2020. Utilizing advanced topic clustering techniques and qualitative analysis, we have discerned 12 distinct macro areas that supplement, enrich, and expand upon those identified in prior research. The primary objective of this undertaking is to furnish a data-rich panoramic view and informative insights regarding the functions and tools of the mentioned disciplines. Our intention is to offer valuable guidance to the scholarly community and propel further research endeavors, encouraging meticulous examinations of research trends and gaps in addressing the formidable challenges posed by climate change and the climate crisis.

Authors: Christian Burmester (Osnabrück University); Teresa Scantamburlo (UniversityofVenice)

Natural Language Processing Behavioral and Social Science
NeurIPS 2023 Unlocking the Potential of Renewable Energy Through Curtailment Prediction (Proposals Track)
Abstract and authors: (click to expand)

Abstract: A significant fraction (5-15%) of renewable energy generated goes into waste in the grids around the world today due to oversupply issues and transmission constraints. Being able to predict when and where renewable curtailment occurs would improve renewable utilization. The core of this work is to enable the machine learning community to help decarbonize electricity grids by unlocking the potential of renewable energy through curtailment prediction.

Authors: Bilge Acun (Meta / FAIR); Brent Morgan (Meta); Henry Richardson (WattTime); Nat Steinsultz (WattTime); Carole-Jean Wu (Meta / FAIR)

Power & Energy
NeurIPS 2023 Assessing data limitations in ML-based LCLU (Proposals Track)
Abstract and authors: (click to expand)

Abstract: This study addresses the accuracy challenge in Global Land Use and Land Cover (LULC) maps, crucial for policy making towards climate change mitigation. We evaluate two LULC products based on advanced machine learning techniques across two representative nations, Ecuador and Germany, employing a novel accuracy metric. The analysis unveils a notable accuracy enhancement in the convolutional neural network-based approach against the random forest model used for comparison. Our findings emphasize the potential of sophisticated machine learning methodologies in advancing LULC mapping accuracy, an essential stride towards data-driven, climate-relevant land management and policy decisions.

Authors: Angel Encalada-Davila (ESPOL); Christian Tutiven (ESPOL University); Jose E Cordova-Garcia (ESPOL)

Earth Observation & Monitoring Computer Vision & Remote Sensing
NeurIPS 2023 Predicting Adsorption Energies for Catalyst Screening with Transfer Learning Using Crystal Hamiltonian Graph Neural Network (Proposals Track)
Abstract and authors: (click to expand)

Abstract: As the world moves towards a clean energy future to mitigate the risks of climate change, the discovery of new catalyst materials plays a significant role in enabling the sustainable production and transformation of energy [2]. The development and verification of fast, accurate, and efficient artificial intelligence and machine learning techniques is critical to shortening time-intensive calculations, reducing costs, and improving computational feasibility. We propose applying the Crystal Hamiltonian Graph Neural Network (CHGNet) on the OC20 dataset in order to iteratively perform structure-to-energy and forces calculations and identify the lowest energy across relaxed structures for a given adsorbate-surface combination. CHGNet's predictions will be compared and benchmarked to corresponding values calculated by density functional theory (DFT) [7] and other models to determine its efficacy.

Authors: Angelina Chen (Foothill College/Lawrence Berkeley National Lab); Hui Zheng (Lawrence Berkeley National Lab); Paula Harder (Mila)

Meta- and Transfer Learning Chemistry & Materials
NeurIPS 2023 Physics-informed DeepONet for battery state prediction (Proposals Track)
Abstract and authors: (click to expand)

Abstract: Electrification has emerged as a pivotal trend in the energy transition to address climate change, leading to a substantial surge in the demand for batteries. Accurately predicting the internal states and performance of batteries assumes paramount significance, as it ensures the safe and stable operation of batteries and informs decision-making processes, such as optimizing battery operation for arbitrage opportunities. However, current models struggle to strike a balance between precision and computational efficiency or are limited in their applicability to specific scenarios. We aim to adopt a physics-informed deep operator network (PI-DeepONet) for internal battery state estimation based on the rigorous P2D model, which can simultaneously achieve high precision and computational efficiency. Furthermore, it exhibits promising prospects for extension beyond lithium-ion batteries to encompass various battery technologies.

Authors: Keyan Guo (Peking University)

Power & Energy Hybrid Physical Models
NeurIPS 2023 Decarbonizing Maritime Operations: A Data-Driven Revolution (Proposals Track)
Abstract and authors: (click to expand)

Abstract: The maritime industry faces an unprecedented challenge in the form of decarbonization. With strict emissions reduction targets in place, the industry is turning to machine learning-based decision support models to achieve sustainability goals. This proposal explores the transformative potential of digitalization and machine learning approaches in maritime operations, from optimizing ship speeds to enhancing supply chain management. By examining various machine learning techniques, this work provides a roadmap for reducing emissions while improving operational efficiency in the maritime sector.

Authors: Ismail Bourzak (UQAR); Loubna Benabou (UQAR); Sara El Mekkaoui (DNV); Abdelaziz Berrado (EMI Engineering School)

Oceans & Marine Systems
NeurIPS 2023 High-resolution Global Building Emissions Estimation using Satellite Imagery (Proposals Track)
Abstract and authors: (click to expand)

Abstract: Globally, buildings account for 30% of end-use energy consumption and 27% of energy sector emissions, and yet the building sector is lacking in low-temporal-latency, high-spatial-resolution data on energy consumption and resulting emissions. Existing methods tend to either have low resolution, high latency (often a year or more), or rely on data typically unavailable at scale (such as self-reported energy consumption). We propose a machine learning based bottom-up model that combines satellite-imagery-derived features to compute Scope 1 global emissions estimates both for residential and commercial buildings at a 1 square km resolution with monthly global updates.

Authors: Paul J Markakis (Duke University); Jordan Malof (University of Montana); Trey Gowdy (Duke University); Leslie Collins (Duke University); Aaron Davitt (WattTime); Gabriela Volpato (WattTime); Kyle Bradbury (Duke University)

Buildings Computer Vision & Remote Sensing
NeurIPS 2023 Sand Mining Watch: Leveraging Earth Observation Foundation Models to Inform Sustainable Development (Proposals Track)
Abstract and authors: (click to expand)

Abstract: As the major ingredient of concrete and asphalt, sand is vital to economic growth, and will play a key role in aiding the transition to a low carbon society. However, excessive and unregulated sand mining in the Global South has high socio-economic and environmental costs, and amplifies the effects of climate change. Sand mines are characterized by informality and high temporal variability, and data on the location and extent of these mines tends to be sparse. We propose to build custom sand-mine detection tools by fine-tuning foundation models for earth observation, which leverage self supervised learning - a cost-effective and powerful approach in sparse data regimes. Our preliminary results show that these methods outperform fully supervised approaches, with the best performing model achieving an average precision score of 0.57 for this challenging task. These tools allow for real-time monitoring of sand mining activity and can enable more effective policy and regulation, to inform sustainable development.

Authors: Ando Shah (UC Berkeley); Suraj R Nair (UC Berkeley); Tom Boehnel (TU Munich); Joshua Blumenstock (University of California, Berkeley)

Earth Observation & Monitoring Unsupervised & Semi-Supervised Learning
NeurIPS 2023 Understanding Insect Range Shifts with Out-of-Distribution Detection (Proposals Track)
Abstract and authors: (click to expand)

Abstract: Climate change is inducing significant range shifts in insects and other organisms. Large-scale temporal data on populations and distributions are essential for quantifying the effects of climate change on biodiversity and ecosystem services, providing valuable insights for both conservation and pest management. With images from camera traps, we aim to use Mahalanobis distance-based confidence scores to automatically detect new moth species in a region. We intend to make out-of-distribution detection interpretable by identifying morphological characteristics of different species using Grad-CAM. We hope this algorithm will be a useful tool for entomologists to study range shifts and inform climate change adaptation.

Authors: Yuyan Chen (McGill University, Mila); David Rolnick (McGill University, Mila)

Ecosystems & Biodiversity Computer Vision & Remote Sensing
NeurIPS 2023 Aquaculture Mapping: Detecting and Classifying Aquaculture Ponds using Deep Learning (Tutorials Track)
Abstract and authors: (click to expand)

Abstract: Mapping aquaculture ponds is critical for restoration, conservation, and climate adaptation efforts. Aquaculture can contribute to high levels of water pollution from untreated effluent and negatively impact coastal ecosystems. Large-scale aquaculture is also a significant driver in mangrove deforestation, thus reducing the world’s carbon sinks and exacerbating the effects of climate change. However, finding and mapping these ponds on the ground can be highly labor and time-intensive. Most existing automated techniques are focused only on spatial location and do not consider production intensification, which is also crucial to understanding their impact on the surrounding ecosystem. We can classify them into two main types: a) Extensive ponds, which are large, irregularly-shaped ponds that rely on natural productivity, and b) intensive ponds which are smaller and regularly shaped. Intensive ponds use machinery such as aerators that maximize production and also result in the characteristic presence of air bubbles on the pond’s surface. The features of these two types of ponds make them distinguishable and detectable from satellite imagery. In this tutorial, we will discuss types of aquaculture ponds in detail and demonstrate how they can be detected and classified using satellite imagery. The tutorial will introduce an open dataset of human-labeled aquaculture ponds in the Philippines and Indonesia. Using this dataset, the tutorial will use semantic segmentation to map out similar ponds over an entire country and classify them as either extensive or intensive, going through the entire process of i) satellite imagery retrieval, ii) preprocessing these images into a training-ready dataset, iii) model training, and iv) finally model rollout on a sample area. Throughout, the tutorial will leverage PyTorch Lightning, a machine learning framework that provides a simplified and streamlined interface for model experimentation and deployment. This tutorial aims to discuss the relevance of aquaculture ponds in climate adaptation and equip users with the necessary inputs and tools to perform their own ML-powered earth observation projects.

Authors: John Christian G Nacpil (Thinking Machines Data Science, Inc.); Joshua Cortez (Thinking Machines Data Science)

Computer Vision & Remote Sensing Earth Observation & Monitoring
NeurIPS 2023 Zero-Emission Vehicle Intelligence (ZEVi): Effectively Charging Electric Vehicles at Scale Without Breaking Power Systems (or the Bank) (Tutorials Track)
Abstract and authors: (click to expand)

Abstract: Transportation contributes to 29% of all greenhouse gas (GHG) emissions in the US, of which 58% are from light-duty vehicles and 28% from medium-to-heavy duty vehicles (MHDVs) [1]. Battery electric vehicles (EVs) emit 90% less life cycle GHGs than their internal combustion engine (ICEV) counterparts [2], but currently only comprise 2% of all vehicles in the U.S. EVs thus represent a crucial step in decarbonizing road transportation. One major challenge in replacing ICEVs with EVs at scale is the ability to charge a large number of EVs within the constraints of power systems in a cost-effective way. This is an especially prominent problem for MHDVs used in commercial fleets such as shuttle buses and delivery trucks, as they generally require more energy to complete assigned trips compared to light-duty vehicles. In this tutorial, we describe the myriad challenges in charging EVs at scale and define common objectives such as minimizing total load on power systems, minimizing fleet operating costs, as well as maximizing vehicle state of charge and onsite photovoltaic energy usage. We discuss common constraints such as vehicle trip energy requirements, charging station power limits, and limits on vehicles’ time to charge between trips. We survey several different methods to formulate EV charging and energy dispatch as a mathematically solvable optimization problem, using tools such as convex optimization, Markov decision process (MDP), and reinforcement learning (RL). We introduce a commercial application of model-based predictive control (MPC) algorithm, ZEVi (Zero Emission Vehicle intelligence), which solves optimal energy dispatch strategies for charging sessions of commercial EV fleets. Using a synthetic dataset modeled after a real fleet of electric school buses, we engage the audience with a hands-on exercise applying ZEVi to find the optimal charging strategy for a commercial fleet. Lastly, we briefly discuss other contexts in which methods originating from process control and deep learning, like MPC and RL, can be applied to solve problems related to climate change mitigation and adaptation. With the examples provided in this tutorial, we hope to inspire the audience to come up with their own creative ways to apply these methods in different fields within the climate domain. References [1] EPA (2023). Inventory of U.S. Greenhouse Gas Emissions and Sinks: 1990-2021. U.S. Environmental Protection Agency, EPA 430-R-23-002. [2] Verma, S., Dwivedi, G., & Verma, P. (2022). Life cycle assessment of electric vehicles in comparison to combustion engine vehicles: A review. Materials Today: Proceedings, 49, 217-222.

Authors: Shasha Lin (NextEra Mobility); Jonathan Brophy (NextEra Mobility); Tamara Monge (NextEra Mobility); Jamie Hussman (NextEra Mobility); Michelle Lee (NextEra Mobility); Sam Penrose (NextEra Mobility)

Transportation Reinforcement Learning
NeurIPS 2023 Agile Modeling for Bioacoustic Monitoring (Tutorials Track)
Abstract and authors: (click to expand)

Abstract: Bird, insect, and other wild animal populations are rapidly declining, highlighting the need for better monitoring, understanding, and protection of Earth’s remaining wild places. However, direct monitoring of biodiversity is difficult. Passive Acoustic Monitoring (PAM) enables detection of the vocalizing species in an ecosystem, many of which can be difficult or impossible to detect by satellite or camera trap. Large-scale PAM deployments using low-cost devices allow measuring changes over time and responses to environmental changes, and targeted deployments can discover and monitor endangered or invasive species. Machine learning methods are needed to analyze the thousands or even millions of hours of audio produced by large-scale deployments. But there are a massive number of potential signals to target for bioacoustic measurement, and many of the most interesting lack training data. Many rare species are difficult to observe. Detecting specific call-types and juvenile calls can give further insight into behavior and population health, but almost no structured datasets exist for these use-cases. No single classifier can address all of these needs, so practitioners regularly need to create new classifiers to address novel problems. Soundscape annotation efforts are very expensive, and machine learning experts are scarce, creating a bottleneck on analysis. We aim to eliminate the bottleneck by providing an efficient, self-contained active learning workflow for biologists. In this tutorial, we present an integrated workflow for analyzing large unlabeled bioacoustic datasets, adapting new agile modeling techniques to audio. Our goal is to allow experts to create a new high quality classifier for a novel class with under one hour of effort. We achieve this by leveraging transfer learning from high-quality bioacoustic models, vector search over audio databases, and lightweight Python notebook UX. The workflow can begin from a single example, proceeds through an efficient active learning loop, and finally applies the produced classifier to a large mass of unlabeled data to produce insights for ecologists and land managers.

Authors: tom denton (google); Jenny Hamer (Google Research); Rob Laber (Google)

Ecosystems & Biodiversity Active Learning
ICLR 2023 Tutorial: Quantus x Climate - Applying explainable AI evaluation in climate science (Tutorials Track)
Abstract and authors: (click to expand)

Abstract: Explainable artificial intelligence (XAI) methods shed light on the predictions of deep neural networks (DNNs). In the climate context, XAI has been applied to improve and validate deep learning (DL) methods while providing researchers with new insight into physical processes. However, the evaluation, validation and selection of XAI methods are challenging due to often lacking ground truth explanations. In this tutorial, we introduce the XAI evaluation package Quantus to the climate community. We start by providing the users with pre-processed input and output data alongside a convolutional neural network (CNN) trained to assign yearly temperature maps to classes according to their decade. We explain the network prediction of an example temperature map using five different explanation techniques Gradient GradientShap, IntegratedGradients, LRP-z and Occlusion. By visually analyzing each explanation method around the North Atlantic (NA) cooling patch 10-80W, 20-60N, we provide a motivating example that shows that different explanations may disagree in their explained evidence which subsequently can lead to different scientific interpretation and potentially, misleading conclusions. We continue by introducing Quantus including the explanation properties that can be evaluated such as robustness, faithfulness, complexity, localization and randomization. We guide the participants towards a practical understanding of XAI evaluation by demonstrating how metrics differ in their scoring and interpretation. Moreover, we teach the participants to compare and select an appropriate XAI method by performing a comprehensive XAI evaluation. Lastly, we return to the motivating example, highlighting how Quantus can facilitate well-founded XAI research in climate science.

Authors: Philine L Bommer (TU Berlin); Anna Hedström (Technische Universität Berlin); Marlene Kretschmer (University of Reading); Marina M.-C. Höhne (TU Berlin)

Interpretable ML Climate Science & Modeling
ICLR 2023 CityLearn: A Tutorial on Reinforcement Learning Control for Grid-Interactive Efficient Buildings and Communities (Tutorials Track)
Abstract and authors: (click to expand)

Abstract: Buildings are responsible for up to 75% of electricity consumption in the United States. Grid-Interactive Efficient Buildings can provide flexibility to solve the issue of power supply-demand mismatch, particularly brought about by renewables. Their high energy efficiency and self-generating capabilities can reduce demand without affecting the building function. Additionally, load shedding and shifting through smart control of storage systems can further flatten the load curve and reduce grid ramping cost in response to rapid decrease in renewable power supply. The model-free nature of reinforcement learning control makes it a promising approach for smart control in grid-interactive efficient buildings, as it can adapt to unique building needs and functions. However, a major challenge for the adoption of reinforcement learning in buildings is the ability to benchmark different control algorithms to accelerate their deployment on live systems. CityLearn is an open source OpenAI Gym environment for the implementation and benchmarking of simple and advanced control algorithms, e.g., rule-based control, model predictive control or deep reinforcement learning control thus, provides solutions to this challenge. This tutorial leverages CityLearn to demonstrate different control strategies in grid-interactive efficient buildings. Participants will learn how to design three controllers of varying complexity for battery management using a real-world residential neighborhood dataset to provide load shifting flexibility. The algorithms will be evaluated using six energy flexibility, environmental and economic key performance indicators, and their benefits and shortcomings will be identified. By the end of the tutorial, participants will acquire enough familiarity with the CityLearn environment for extended use in new datasets or personal projects.

Authors: Kingsley E Nweye (The University of Texas at Austin); Allen Wu (The University of Texas at Austin); Hyun Park (The University of Texas at Austin); Yara Almilaify (The University of Texas at Austin); Zoltan Nagy (The University of Texas at Austin)

Buildings Cities & Urban Planning Power & Energy Reinforcement Learning Time-series Analysis
ICLR 2023 Smart Meter Data Analytics: Practical Use-Cases and Best Practices of Machine Learning Applications for Energy Data in the Residential Sector (Tutorials Track)
Abstract and authors: (click to expand)

Abstract: To cope with climate change, the energy system is undergoing a massive transformation. With the electrification of all sectors, the power grid is facing high additional demand. As a result, the digitization of the grid is becoming more of a focus. The smart grid relies heavily on the increasing deployment of smart electricity meters around the world. The corresponding smart meter data is typically a time series of power or energy measurements with a resolution of 1s to 60 min. This data provides valuable insights and opportunities for monitoring and controlling activities in the power grid. In this tutorial, we therefore provide an overview of best practices for analyzing smart meter data. We focus on machine learning applications and low resolution (15-60 minutes) energy data in a residential setting. We only use real-world datasets and cover use-cases that are highly relevant for practical applications. Although this tutorial is specifically tailored to an audience from the energy domain, we believe that anyone from the data analytics and machine learning community can benefit from it, as many techniques are applicable to any time series data. Through our tutorial, we hope to foster new ideas, contribute to an interdisciplinary exchange between different research fields, and educate people about energy use.

Authors: Tobias Brudermueller (ETH Zurich); Markus Kreft (ETH Zurich)

Power & Energy Buildings Time-series Analysis
ICLR 2023 Mitigating climate and health impact of small-scale kiln industry using multi-spectral classifier and deep learning (Papers Track)
Abstract and authors: (click to expand)

Abstract: Industrial air pollution has a direct health impact and is a major contributor to climate change. Small scale industries particularly bull-trench brick kilns are one of the major causes of air pollution in South Asia often creating hazardous levels of smog that is injurious to human health. To mitigate the climate and health impact of the kiln industry, fine-grained kiln localization at different geographic locations is needed. Kiln localization using multi-spectral remote sensing data such as vegetation index results in a noisy estimates whereas use of high-resolution imagery is infeasible due to cost and compute complexities. This paper proposes a fusion of spatio-temporal multi-spectral data with high-resolution imagery for detection of brick kilns within the "Brick-Kiln-Belt" of South Asia. We first perform classification using low-resolution spatio-temporal multi-spectral data from Sentinel-2 imagery by combining vegetation, burn, build up and moisture indices. Then orientation aware object detector: YOLOv3 (with theta value) is implemented for removal of false detections and fine-grained localization. Our proposed technique, when compared with other benchmarks, results in a 21 times improvement in speed with comparable or higher accuracy when tested over multiple countries.

Authors: Usman Nazir (Lahore University of Management Sciences); Murtaza Taj (Lahore University of Management Sciences); Momin Uppal (Lahore University of Management Sciences); Sara khalid (University of Oxford)

Earth Observation & Monitoring Health
ICLR 2023 Global Flood Prediction: a Multimodal Machine Learning Approach (Papers Track)
Abstract and authors: (click to expand)

Abstract: Flooding is one of the most destructive and costly natural disasters, and climate changes would further increase risks globally. This work presents a novel mul- timodal machine learning approach for multi-year global flood risk prediction, combining geographical information and historical natural disaster dataset. Our multimodal framework employs state-of-the-art processing techniques to extract embeddings from each data modality, including text-based geographical data and tabular-based time-series data. Experiments demonstrate that a multimodal ap- proach, that is combining text and statistical data, outperforms a single-modality approach. Our most advanced architecture, employing embeddings extracted us- ing transfer learning upon DistilBert model, achieves 75%-77% ROCAUC score in predicting the next 1-5 year flooding event in historically flooded locations. This work demonstrates the potentials of using machine learning for long-term planning in natural disaster management

Authors: Cynthia Zeng (MIT); Dimitris Bertsimas (MIT)

Extreme Weather Disaster Management and Relief
ICLR 2023 Improving global high-resolution Earth system model simulations of precipitation with generative adversarial networks (Papers Track)
Abstract and authors: (click to expand)

Abstract: Precipitation extremes are expected to become stronger and more frequent in response to anthropogenic global warming. Accurately projecting the ecological and socioeconomic impacts is an urgent task. Impact models are developed and calibrated with observation-based data but rely on Earth system model (ESM) output for future scenarios. ESMs, however, exhibit significant biases in their output because they cannot fully resolve complex cross-scale interactions of processes that produce precipitation cannot. State-of-the-art bias correction methods only address errors in the simulated frequency distributions, locally at every individual grid cell. Improving unrealistic spatial patterns of the ESM output, which would require spatial context, has not been possible so far. Here we show that a post-processing method based on physically constrained generative adversarial networks (GANs) can correct biases of a state-of-the-art global ESM both in local frequency distributions and in the spatial patterns at once. While our method improves local frequency distributions similarly well to a gold-standard ESM bias-adjustment framework, it strongly outperforms existing methods in correcting spatial patterns. Our study highlights the importance of physical constraints in neural networks for out-of-sample predictions in the context of climate change.

Authors: Philipp Hess (Technical University of Munich)

Generative Modeling Climate Science & Modeling Extreme Weather Computer Vision & Remote Sensing Hybrid Physical Models
ICLR 2023 Machine Learning for Advanced Building Construction (Papers Track)
Abstract and authors: (click to expand)

Abstract: High-efficiency retrofits can play a key role in reducing carbon emissions associated with buildings if processes can be scaled-up to reduce cost, time, and disruption. Here we demonstrate an artificial intelligence/computer vision (AI/CV)-enabled framework for converting exterior build scans and dimensional data directly into manufacturing and installation specifications for overclad panels. In our workflow point clouds associated with LiDAR-scanned buildings are segmented into a facade feature space, vectorized features are extracted using an iterative random-sampling consensus algorithm, and from this representation an optimal panel design plan satisfying manufacturing constraints is generated. This system and the corresponding construction process is demonstrated on a test facade structure constructed at the National Renewable Energy Laboratory (NREL). We also include a brief summary of a techno-economic study designed to estimate the potential energy and cost impact of this new system.

Authors: Hilary Egan (NREL); Clement Fouquet (Trimble Inc.); Chioke Harris (NREL)

Buildings
ICLR 2023 Coregistration of Satellite Image Time Series Through Alignment of Road Networks (Papers Track)
Abstract and authors: (click to expand)

Abstract: Due to climate change, thawing permafrost affects transportation infrastructure in northern regions. Tracking deformations over time of these structures can allow identifying the most vulnerable sections to permafrost degradation and implement climate adaptation strategies. The Sentinel-2 mission provides data well-suited for multitemporal analysis due to its high temporal resolution and multispectral coverage. However, the geometrical misalignment of Sentinel-2 imagery makes this analysis challenging. Towards the goal of estimating the deformation of linear infrastructure in northern Canada, we propose an automatic subpixel coregistration algorithm for satellite image time series based on the matching of binary masks of roads produced by a deep learning model. We demonstrate the feasibility of achieving subpixel coregistration through alignment of roads on a small dataset of high-resolution Sentinel-2 images from the region of Gillam in northern Canada. This is the first step towards training a road deformation prediction model.

Authors: Andres Felipe Perez Murcia (University of Manitoba); Pooneh Maghoul (University of Manitoba); Ahmed Ashraf (University of Manitoba)

Earth Observation & Monitoring Transportation Computer Vision & Remote Sensing
ICLR 2023 Estimating Residential Solar Potential using Aerial Data (Papers Track)
Abstract and authors: (click to expand)

Abstract: Project Sunroof estimates the solar potential of residential buildings using high quality aerial data. That is, it estimates the potential solar energy (and associated financial savings) that can be captured by buildings if solar panels were to be installed on their roofs. Unfortunately its coverage is limited by the lack of high resolution digital surface map (DSM) data. We present a deep learning approach that bridges this gap by enhancing widely available low-resolution data, thereby dramatically increasing the coverage of Sunroof. We also present some ongoing efforts to potentially improve accuracy even further by replacing certain algorithmic components of Sunroof’s processing pipeline with deep learning.

Authors: Ross Goroshin (Google); Carl Elkin (Google)

Power & Energy Computer Vision & Remote Sensing
ICLR 2023 Improving extreme weather events detection with light-weight neural networks (Papers Track)
Abstract and authors: (click to expand)

Abstract: To advance automated detection of extreme weather events, which are increasing in frequency and intensity with climate change, we explore modifications to a novel light-weight Context Guided convolutional neural network architecture trained for semantic segmentation of tropical cyclones and atmospheric rivers in climate data. Our primary focus is on tropical cyclones, the most destructive weather events, for which current models show limited performance. We investigate feature engineering, data augmentation, learning rate modifications, alternative loss functions, and architectural changes. In contrast to previous approaches optimizing for intersection over union, we specifically seek to improve recall to penalize under-counting and prioritize identification of tropical cyclones. We report success through the use of weighted loss functions to counter class imbalance for these rare events. We conclude with directions for future research on extreme weather events detection, a crucial task for prediction, mitigation, and equitable adaptation to the impacts of climate change.

Authors: Romain Lacombe (Stanford University); Hannah Grossman (Stanford); Lucas P Hendren (Stanford University); David Ludeke (Stanford University)

Extreme Weather Climate Science & Modeling Computer Vision & Remote Sensing
ICLR 2023 CaML: Carbon Footprinting of Products with Zero-Shot Semantic Text Similarity (Papers Track)
Abstract and authors: (click to expand)

Abstract: Estimating the embodied carbon in products is a key step towards understanding their impact, and undertaking mitigation actions. Precise carbon attribution is challenging at scale, requiring both domain expertise and granular supply chain data. As a first-order approximation, standard reports use Economic Input-Output based Life Cycle Assessment (EIO-LCA) which estimates carbon emissions per dollar at an industry sector level using transactions between different parts of the economy. For EIO-LCA, an expert needs to map each product to one of upwards of 1000 potential industry sectors. We present CaML, an algorithm to automate EIO-LCA using semantic text similarity matching by leveraging the text descriptions of the product and the industry sector. CaML outperforms the previous manually intensive method, yielding a MAPE of 22% with no domain labels.

Authors: Bharathan Balaji (Amazon); Venkata Sai Gargeya Vunnava (amazon); Geoffrey Guest (Amazon); Jared Kramer (Amazon)

Natural Language Processing Supply Chains
ICLR 2023 Graph Neural Network Generated Metal-Organic Frameworks for Carbon Capture (Proposals Track)
Abstract and authors: (click to expand)

Abstract: The level of carbon dioxide (CO2) in our atmosphere is rapidly rising and is projected to double today‘s levels to reach 1,000 ppm by 2100 under certain scenarios, primarily driven by anthropogenic sources. Technology that can capture CO2 from anthropogenic sources, remove from atmosphere and sequester it at the gigaton scale by 2050 is required stop and reverse the impact of climate change. Metal-organic frameworks (MOFs) have been a promising technology in various applications including gas separation as well as CO2 capture from point-source flue gases or removal from the atmosphere. MOFs offer unmatched surface area through their highly porous crystalline structure and MOF technology has potential to become a leading adsorption-based CO2 separation technology providing high surface area, structure stability and chemical tunability. Due to its complex structure, MOF crystal structure (atoms and bonds) cannot be easily represented in tabular format for machine learning (ML) applications whereas graph neural networks (GNN) have already been explored in representation of simpler chemical molecules. In addition to difficulty in MOF data representation, an infinite number of combinations can be created for MOF crystals, which makes ML applications more suitable to alleviate dependency on subject matter experts (SME) than conventional computational methods. In this work, we propose training of GNNs in variational autoencoder (VAE) setting to create an end-to-end workflow for the generation of new MOF crystal structures directly from the data within the crystallographic information files (CIFs) and conditioned by additional CO2 performance values.

Authors: Zikri Bayraktar (Schlumberger Doll Research); Shahnawaz Molla (Schlumberger Doll Research); Sharath Mahavadi (Schlumberger Doll Research)

Generative Modeling Chemistry & Materials
ICLR 2023 Long-lead forecasts of wintertime air stagnation index in southern China using oceanic memory effects (Papers Track)
Abstract and authors: (click to expand)

Abstract: Stagnant weather condition is one of the major contributors to air pollution as it is favorable for the formation and accumulation of pollutants. To measure the atmosphere’s ability to dilute air pollutants, Air Stagnation Index (ASI) has been introduced as an important meteorological index. Therefore, making long-lead ASI forecasts is vital to make plans in advance for air quality management. In this study, we found that autumn Niño indices derived from sea surface temperature (SST) anomalies show a negative correlation with wintertime ASI in southern China, offering prospects for a prewinter forecast. We developed an LSTM-based model to predict the future wintertime ASI. Results demonstrated that multivariate inputs (past ASI and Niño indices) achieve better forecast performance than univariate input (only past ASI). The model achieves a correlation coefficient of 0.778 between the actual and predicted ASI, exhibiting a high degree of consistency.

Authors: Chenhong Zhou (Hong Kong Baptist University); Xiaorui Zhang (Hong Kong Baptist University); Meng Gao (Hong Kong Baptist University); Shanshan Liu (University of science and technology of China); Yike Guo (Hong Kong University of Science and Technology); Jie Chen (Hong Kong Baptist University)

Climate Science & Modeling Extreme Weather Time-series Analysis
ICLR 2023 Attention-based Domain Adaptation Forecasting of Streamflow in Data-Sparse Regions (Papers Track)
Abstract and authors: (click to expand)

Abstract: Streamflow forecasts are critical to guide water resource management, mitigate drought and flood effects, and develop climate-smart infrastructure and governance. Many global regions, however, have limited streamflow observations to guide evidence-based management strategies. In this paper, we propose an attention-based domain adaptation streamflow forecaster for data-sparse regions. Our approach leverages the hydrological characteristics of a data-rich source domain to induce effective 24hr lead-time streamflow prediction in a data-constrained target domain. Specifically, we employ a deep-learning framework leveraging domain adaptation techniques to simultaneously train streamflow predictions and discern between both domains using an adversarial method. Experiments against baseline cross-domain forecasting models show improved performance for 24hr lead-time streamflow forecasting.

Authors: Roland R Oruche (University of Missouri-Columbia); Fearghal O'Donncha (IBM Research)

Meta- and Transfer Learning Disaster Management and Relief Time-series Analysis
ICLR 2023 Predicting Cycling Traffic in Cities: Is bike-sharing data representative of the cycling volume? (Proposals Track)
Abstract and authors: (click to expand)

Abstract: A higher share of cycling in cities can lead to a reduction in greenhouse gas emissions, a decrease in noise pollution, and personal health benefits. Data-driven approaches to planning new infrastructure to promote cycling are rare, mainly because data on cycling volume are only available selectively. By leveraging new and more granular data sources, we predict bicycle count measurements in Berlin, using data from free-floating bike-sharing systems in addition to weather, vacation, infrastructure, and socioeconomic indicators. To reach a high prediction accuracy given the diverse data, we make use of machine learning techniques. Our goal is to ultimately predict traffic volume on all streets beyond those with counters and to understand the variance in feature importance across time and space. Results indicate that bike-sharing data are valuable to improve the predictive performance, especially in cases with high outliers, and help generalize the models to new locations.

Authors: Silke K. Kaiser (Hertie School)

Transportation
ICLR 2023 Unsupervised machine learning techniques for multi-model comparison: A case study on Antarctic Intermediate Water in CMIP6 models (Papers Track)
Abstract and authors: (click to expand)

Abstract: The Climate Model Intercomparison Project provides access to ensembles of model experiments that are widely used to better understand past, present, and future climate changes. In this study, we use Principal Component Analysis and K-means and hierarchical clustering techniques to guide identification of models in the CMIP6 dataset that are best suited for specific modelling objectives. An example is discussed here that focuses on how CMIP6 models reproduce the physical properties of Antarctic Intermediate Water, a key feature of the global oceanic circulation and of the ocean-climate system, noting that the tools and methods introduced here can readily be extended to the analysis of other features and regions.

Authors: Ophelie Meuriot (Imperial College London); Yves Plancherel (Imperial College London); Veronica Nieves (University of Valencia)

Climate Science & Modeling Unsupervised & Semi-Supervised Learning
ICLR 2023 An automatic mobile approach for Tree DBH Estimation Using a Depth Map and a Regression Convolutional Neural Network (Papers Track)
Abstract and authors: (click to expand)

Abstract: Carbon credit programs finance projects to reduce emissions, remove pollutants, improve livelihoods, and protect natural ecosystems. Ensuring the quality and integrity of such projects is essential to their success. One of the most important variables used in nature-based solutions to measure carbon sequestration is the diameter at breast height (DBH) of trees. In this paper, we propose an automatic mobile computer vision method to estimate the DBH of a tree using a single depth map on a smartphone, along with our created dataset DepthMapDBH2023. We successfully demonstrated that this dataset paired with a lightweight regression convolutional neural network is able to accurately estimate the DBH of trees distinct in appearance, shape, number of tree forks, tree density and crowding, and vine presence. Automation of these measurements will help crews in the field who are collecting data for forest inventories. Gathering as much on-the-ground data as possible is required to ensure the transparency of carbon credit projects. Access to high-quality datasets of manual measurements helps improve biomass models which are widely used in the field of ecological simulation. The code used in this paper will be publicly available on Github and the dataset on Kaggle.

Authors: Margaux Masson-Forsythe (Earthshot Labs); Margaux Masson-Forsythe (Earthshot Labs)

Computer Vision & Remote Sensing Carbon Capture & Sequestration Climate Science & Modeling Forests Land Use
ICLR 2023 Safe Multi-Agent Reinforcement Learning for Price-Based Demand Response (Papers Track)
Abstract and authors: (click to expand)

Abstract: Price-based demand response (DR) enables households to provide the flexibility required in power grids with a high share of volatile renewable energy sources. Multi-agent reinforcement learning (MARL) offers a powerful, decentralized decision-making tool for autonomous agents participating in DR programs. Unfortunately, MARL algorithms do not naturally allow one to incorporate safety guarantees, preventing their real-world deployment. To meet safety constraints, we propose a safety layer that minimally adjusts each agent's decisions. We investigate the influence of using a reward function that reflects these safety adjustments. Results show that considering safety aspects in the reward during training improves both convergence speed and performance of the MARL agents in the investigated numerical experiments.

Authors: Hannah Markgraf (Technical University of Munich); Matthias Althoff (Technical University of Munich)

Reinforcement Learning Buildings Power & Energy
ICLR 2023 BurnMD: A Fire Projection and Mitigation Modeling Dataset (Papers Track)
Abstract and authors: (click to expand)

Abstract: Today's fire projection modeling tools struggle to keep up with the rapid rate and increasing severity of climate change, leaving disaster managers dependent on tools which are increasingly unrepresentative of complex interactions between fire behavior, environmental conditions, and various mitigation options. This has consequences for equitably minimizing wildfire risks to life, property, ecology, cultural heritage, and public health. Fortunately, decades of data exist for fuel populations, weather conditions, and outcomes of significant fires in the American West and globally. The fire management community faces a lack of data standardization and validation among many competing fire models. Likewise, the machine learning community lacks curated datasets and benchmarks to develop solutions necessary to generate impact in this space. We present a novel dataset composed of 308 medium sized fires from the years 2018-2021, complete with both time series airborne based inference and ground operational estimation of fire extent, and operational mitigation data such as control line construction. As the first large wildfire dataset with mitigation information, Burn Mitigation Dataset (BurnMD) will help advance fire projection modeling, fire risk modeling, and AI generated land management policies.

Authors: Marissa Dotter (MITRE Corporation)

Forests Climate Science & Modeling Computer Vision & Remote Sensing Time-series Analysis
ICLR 2023 MAHTM: A Multi-Agent Framework for Hierarchical Transactive Microgrids (Papers Track)
Abstract and authors: (click to expand)

Abstract: Integration of variable renewable energy into the grid has posed challenges to system operators in achieving optimal trade-offs among energy availability, cost affordability, and pollution controllability. This paper proposes a multi-agent reinforcement learning framework for managing energy transactions in microgrids. The framework addresses the challenges above: it seeks to optimize the usage of available resources by minimizing the carbon footprint while benefiting all stakeholders. The proposed architecture consists of three layers of agents, each pursuing different objectives. The first layer, comprised of prosumers and consumers, minimizes the total energy cost. The other two layers control the energy price to decrease the carbon impact while balancing the consumption and production of both renewable and conventional energy. This framework also takes into account fluctuations in energy demand and supply.

Authors: Nicolas M Cuadrado (MBZUAI); Roberto Alejandro Gutierrez Guillen (MBZUAI); Yongli Zhu (Texas A&M University); Martin Takac (Mohamed bin Zayed University of Artificial Intelligence)

Power & Energy Reinforcement Learning
ICLR 2023 Improving a Shoreline Forecasting Model with Symbolic Regression (Papers Track)
Abstract and authors: (click to expand)

Abstract: Given the current context of climate change and the increasing population densities at coastal zones around the globe, there is an increasing need to be able to predict the development of our coasts. Recent advances in artificial intelligence allow for automatic analysis of observational data. Symbolic Regression (SR) is a type of Machine Learning algorithm that aims to find interpretable symbolic expressions that can explain relations in the data. In this work, we aim to study the problem of forecasting shoreline change using SR. We make use of Cartesian Genetic Programming (CGP) in order to encode and improve upon ShoreFor, a physical shoreline prediction model. During training, CGP individuals are evaluated and selected according to their predictive score at five different coastal sites. This work presents a comparison between a CGP-evolved model and the base ShoreFor model. In addition to evolution's ability to produce well-performing models, it demonstrates the usefulness of SR as a research tool to gain insight into the behaviors of shorelines in various geographical zones.

Authors: Mahmoud AL NAJAR (Laboratory of Spatial Geophysics and Oceanography Studies); Rafael ALMAR (Laboratory of Spatial Geophysics and Oceanography Studies); Erwin BERGSMA (CNES); Jean-Marc DELVIT (CNES); Dennis Wilson (ISAE)

Interpretable ML Earth Observation & Monitoring Extreme Weather Hybrid Physical Models Time-series Analysis
ICLR 2023 Remote Control: Debiasing Remote Sensing Predictions for Causal Inference (Papers Track)
Abstract and authors: (click to expand)

Abstract: Understanding and properly estimating the impacts of environmental interventions is of critical importance as we work towards achieving global climate goals. Remote sensing has become an essential tool for evaluating when and where climate policies have positive impacts on factors like greenhouse gas emissions and carbon sequestration. However, when machine learning models trained to predict outcomes using remotely sensed data simply minimize a standard loss function, the predictions that they generate can produce biased estimates in downstream causal inference. If prediction error in the outcome variable is correlated with policy variables or important confounders, as is the case for many widely used remote sensing data sets, estimates of the causal impacts of policies can be biased. In this paper, we demonstrate how this bias can arise, and we propose the use of an adversarial debiasing model (Zhang, Lemoine, and Mitchell 2018) in order to correct the issue when using satellite data to generate machine learning predictions for use in causal inference. We apply this method to a case study of the relationship between roads and tree cover in West Africa, where our results indicate that adversarial debiasing can recover a much more accurate estimate of the parameter of interest compared to when the standard approach is used.

Authors: Matthew Gordon (Yale); Megan Ayers (Yale University); Eliana Stone (Yale School of the Environment); Luke C Sanford (Yale School of the Environment)

Forests Computer Vision & Remote Sensing Land Use
ICLR 2023 A simplified machine learning based wildfire ignition model from insurance perspective (Papers Track)
Abstract and authors: (click to expand)

Abstract: In the context of climate change, wildfires are becoming more frequent, intense, and prolonged in the western US, particularly in California. Wildfires cause catastrophic socio-economic losses and are projected to worsen in the near future. Inaccurate estimates of fire risk put further pressure on wildfire (re)insurance and cause many homes to lose wildfire insurance coverage. Efficient and effective prediction of fire ignition is one step towards better fire risk assessment. Here we present a simplified machine learning-based fire ignition model at yearly scale that is well suited to the use case of one-year term wildfire (re)insurance. Our model yields a recall, precision, and the area under the precision-recall curve of 0.69, 0.86 and 0.81, respectively, for California, and significantly higher values of 0.82, 0.90 and 0.90, respectively, for the populated area, indicating its good performance. In addition, our model feature analysis reveals that power line density, enhanced vegetation index (EVI), vegetation optical depth (VOD), and distance to the wildland-urban interface stand out as the most important features determining ignitions. The framework of this simplified ignition model could easily be applied to other regions or genesis of other perils like hurricane, and it paves the road to a broader and more affordable safety net for homeowners.

Authors: Yaling Liu (OurKettle Inc); Son Le (OurKettle Inc.); Yufei Zou (Our Kettle, Inc.); mojtaba Sadgedhi (OurKettle Inc.); Yang Chen (University of California, Irvine); Niels Andela (BeZero Carbon); Pierre Gentine (Columbia University)

Earth Observation & Monitoring Climate Science & Modeling Societal Adaptation & Resilience Interpretable ML
ICLR 2023 Nested Fourier Neural Operator for Basin-Scale 4D CO2 Storage Modeling (Papers Track)
Abstract and authors: (click to expand)

Abstract: Carbon capture and storage (CCS) plays an essential role in global decarbonization. Scaling up CCS requires accurate and high-resolution modeling of the storage reservoir pressure buildup and the gaseous plume migration. However, such modeling is very challenging at scale due to the high computational costs of existing numerical methods. This challenge leads to significant uncertainty in evaluating storage opportunities which can delay the pace of global CCS deployments. We introduce a machine-learning approach for dynamic basin-scale modeling that speeds up flow prediction nearly 700,000 times compared to existing methods. Our framework, Nested Fourier Neural Operator (FNO), provides a general-purpose simulator alternative under diverse reservoir conditions, geological heterogeneity, and injection schemes. It enables unprecedented real-time high-fidelity modeling to support decision-making in basin-scale CCS projects.

Authors: Gege Wen (Stanford University)

Carbon Capture & Sequestration Climate Science & Modeling
ICLR 2023 SEA LEVEL PROJECTIONS WITH MACHINE LEARNING USING ALTIMETRY AND CLIMATE MODEL ENSEMBLES (Papers Track)
Abstract and authors: (click to expand)

Abstract: Satellite altimeter observations retrieved since 1993 show that the global mean sea level is rising at an unprecedented rate (3.4mm/year). With almost three decades of observations, we can now investigate the contributions of anthropogenic climate-change signals such as greenhouse gases, aerosols, and biomass burning in this rising sea level. We use machine learning (ML) to investigate future patterns of sea level change. To understand the extent of contributions from the climate-change signals, and to help in forecasting sea level change in the future, we turn to climate model simulations. This work presents a machine learning framework that exploits both satellite observations and climate model simulations to generate sea level rise projections at a 2-degree resolution spatial grid, 30 years into the future. We train fully connected neural networks (FCNNs) to predict altimeter values through a non-linear fusion of the climate model hindcasts (for 1993-2019). The learned FCNNs are then applied to future climate model projections to predict future sea level patterns. We propose segmenting our spatial dataset into meaningful clusters and show that clustering helps to improve predictions of our ML model.

Authors: Saumya Sinha (University of Colorado, Boulder); John Fasullo (NCAR); R. Steven Nerem (Univesity of Colorado, Boulder); Claire Monteleoni (University of Colorado Boulder)

Climate Science & Modeling Oceans & Marine Systems
ICLR 2023 Global-Local Policy Search and Its Application in Grid-Interactive Building Control (Papers Track)
Abstract and authors: (click to expand)

Abstract: As the buildings sector represents over 70% of the total U.S. electricity consumption, it offers a great amount of untapped demand-side resources to tackle many critical grid-side problems and improve the overall energy system's efficiency. To help make buildings grid-interactive, this paper proposes a global-local policy search method to train a reinforcement learning (RL) based controller which optimizes building operation during both normal hours and demand response (DR) events. Experiments on a simulated five-zone commercial building demonstrate that by adding a local fine-tuning stage to the evolution strategy policy training process, the control costs can be further reduced by 7.55% in unseen testing scenarios. Baseline comparison also indicates that the learned RL controller outperforms a pragmatic linear model predictive controller (MPC), while not requiring intensive online computation.

Authors: Xiangyu Zhang (National Renewable Energy Laboratory); Yue Chen (National Renewable Energy Laboratory); Andrey Bernstein (NREL)

Buildings Power & Energy Reinforcement Learning
ICLR 2023 Accuracy is not the only Metric that matters: Estimating the Energy Consumption of Deep Learning Models (Papers Track)
Abstract and authors: (click to expand)

Abstract: Modern machine learning models have started to consume incredible amounts of energy, thus incurring large carbon footprints (Strubell et al., 2019). To address this issue, we have created an energy estimation pipeline, which allows practitioners to estimate the energy needs of their models in advance, without actually running or training them. We accomplished this, by collecting high-quality energy data and building a first baseline model, capable of predicting the energy consumption of DL models by accumulating their estimated layer-wise energies.

Authors: Johannes Getzner (Technical University of Munich); Bertrand Charpentier (Technical University of Munich); Stephan Günnemann (Technical University of Munich)

Power & Energy Data Mining
ICLR 2023 Topology Estimation from Voltage Edge Sensing for Resource-Constrained Grids (Papers Track)
Abstract and authors: (click to expand)

Abstract: Electric grids are the conduit for renewable energy delivery and will play a crucial role in mitigating climate change. To do so successfully in resource-constrained low- and middle-income countries (LMICs), increasing operational efficiency is key. Such efficiency demands evolving knowledge of the grid’s state, of which topology---how points on the network are physically connected---is fundamental. In LMICs, knowledge of distribution topology is limited and established methods for topology estimation rely on expensive sensing infrastructure, such as smart meters or PMUs, that are inaccessible at scale. This paper lays the foundation for topology estimation from more accessible data: outlet-level voltage magnitude measurements. It presents a graph-based algorithm and explanatory visualization using the Fielder vector for estimating and communicating topological proximity from this data. We demonstrate the method on a real dataset collected in Accra, Ghana, thus opening the possibility of globally accessible, cutting-edge grid monitoring through non-traditional sensing strategies coupled with ML.

Authors: Mohini S Bariya (nLine); Genevieve Flaspohler (nLine)

Unsupervised & Semi-Supervised Learning Climate Justice Time-series Analysis
ICLR 2023 Chasing Low-Carbon Electricity for Practical and Sustainable DNN Training (Papers Track)
Abstract and authors: (click to expand)

Abstract: Deep learning has experienced significant growth in recent years, resulting in increased energy consumption and carbon emission from the use of GPUs for training deep neural networks (DNNs). Answering the call for sustainability, conventional solutions have attempted to move training jobs to locations or time frames with lower carbon intensity. However, moving jobs to other locations may not always be feasible due to large dataset sizes or data regulations. Moreover, postponing training can negatively impact application service quality because the DNNs backing the service are not updated in a timely fashion. In this work, we present a practical solution that reduces the carbon footprint of DNN training without migrating or postponing jobs. Specifically, our solution observes real-time carbon intensity shifts during training and controls the energy consumption of GPUs, thereby reducing carbon footprint while maintaining training performance. Furthermore, in order to proactively adapt to shifting carbon intensity, we propose a lightweight machine learning algorithm that predicts the carbon intensity of the upcoming time frame. Our solution, Chase, reduces the total carbon footprint of training ResNet-50 on ImageNet by 13.6% while only increasing training time by 2.5%.

Authors: Zhenning Yang (University of Michigan, Ann Arbor); Luoxi Meng (University of Michigan, Ann Arbor); Jae-Won Chung (University of Michigan, Ann Arbor); Mosharaf Chowdhury (University of Michigan, Ann Arbor)

Time-series Analysis
ICLR 2023 SOLAR PANEL MAPPING VIA ORIENTED OBJECT DETECTION (Papers Track)
Abstract and authors: (click to expand)

Abstract: Maintaining the integrity of solar power plants is a vital component in dealing with the current climate crisis. This process begins with analysts creating a de- tailed map of a plant with the coordinates of every solar panel, making it possible to quickly locate and mitigate potential faulty solar panels. However, this task is extremely tedious and is not scalable for the ever increasing capacity of so- lar power across the globe. Therefore, we propose an end-to-end deep learning framework for detecting individual solar panels using a rotated object detection architecture. We evaluate our approach on a diverse dataset of solar power plants collected from across the United States and report a mAP score of 83.3%.

Authors: Conor Wallace (DroneBase); Isaac Corley (University of Texas at San Antonio); Jonathan Lwowski (DroneBase)

Computer Vision & Remote Sensing
ICLR 2023 Disentangling observation biases to monitor spatio-temporal shifts in species distributions (Proposals Track)
Abstract and authors: (click to expand)

Abstract: The accelerated pace of environmental change due to anthropogenic activities makes it more important than ever to understand current and future ecosystem dynamics at a global scale. Species observations stemming from citizen science platforms are increasingly leveraged to gather information about the geographic distributions of many species. However, their usability is limited by the strong biases inherent to these community-driven efforts. These biases in the sampling effort are often treated as noise that has to be compensated for. In this project, we posit that better modelling the sampling effort (including the usage of the different platforms across countries, local accessibility, attractiveness of the location for platform users, affinity of different user groups for different species, etc.) is the key towards improving Species Distribution Models (SDM) using observations from citizen science platforms, thus opening up the possibility of leveraging them to monitor changes in species distributions and population densities.

Authors: Diego Marcos (Inria); Christophe Botella (); Ilan Havinga (Wageningen University); Dino Ienco (INRAE); Cassio F. Dantas (TETIS, INRAE, Univ Montpellier); Pierre Alliez (INRIA Sophie-Antipolis, France); Alexis Joly (INRIA, FR)

Causal & Bayesian Methods Earth Observation & Monitoring Computer Vision & Remote Sensing Hybrid Physical Models
ICLR 2023 Mapping global innovation networks around clean energy technologies (Proposals Track)
Abstract and authors: (click to expand)

Abstract: Reaching net zero emissions requires rapid innovation and scale-up of clean tech. In this context, clean tech innovation networks (CTINs) can play a crucial role by pooling necessary resources and competences and enabling knowledge transfers between different actors. However, existing evidence on CTINs is limited due to a lack of comprehensive data. Here, we develop a machine learning framework to identify CTINs from announcements on social media to map the global CTIN landscape. Specifically, we classify the social media announcements regarding the type of technology (e.g., hydrogen, solar), interaction type (e.g., equity investment, R\&D collaboration), and status (e.g., commencement, update). We then extract referenced organizations via entity recognition. Thereby, we generate a large-scale dataset of CTINs across different technologies, countries, and over time. This allows us to compare characteristics of CTINs, such as the geographic proximity of actors, and to investigate the association between network evolution and technology innovation and diffusion. As a direct implication, our work helps policy makers to promote CTINs by identifying current barriers and needs.

Authors: Malte Toetzke (ETH Zurich); Francesco Re (ETH Zurich); Benedict Probst (ETH Zurich); Stefan Feuerriegel (LMU Munich); Laura Diaz Anadon (University of Cambridge); Volker Hoffmann (ETH Zurich)

Climate Finance & Economics Supply Chains Natural Language Processing
ICLR 2023 Widespread increases in future wildfire risk to global forest carbon offset projects revealed by explainable AI (Papers Track)
Abstract and authors: (click to expand)

Abstract: Carbon offset programs are critical in the fight against climate change. One emerging threat to the long-term stability and viability of forest carbon offset projects is wildfires, which can release large amounts of carbon and limit the efficacy of associated offsetting credits. However, analysis of wildfire risk to forest carbon projects is challenging because existing models for forecasting long-term fire risk are limited in predictive accuracy. Therefore, we propose an explainable artificial intelligence (XAI) model trained on 7 million global satellite wildfire observations. Validation results suggest substantial potential for high resolution, enhanced accuracy projections of global wildfire risk, and the model outperforms the U.S. National Center for Atmospheric Research's leading fire model. Applied to a collection of 190 global forest carbon projects, we find that fire exposure is projected to increase 55% [37-76%] by 2080 under a mid-range scenario (SSP2-4.5). Our results indicate the large wildfire carbon project damages seen in the past decade are likely to become more frequent as forests become hotter and drier. In response, we hope the model can support wildfire managers, policymakers, and carbon market analysts to preemptively quantify and mitigate long-term permanence risks to forest carbon projects.

Authors: Tristan Ballard (Sust Inc); Gopal Erinjippurath (Sust Global); Matthew W Cooper (Sust Global); Chris Lowrie (Sust Global)

Climate Finance & Economics Forests Interpretable ML Land Use
ICLR 2023 A High-Resolution, Data-Driven Model of Urban Carbon Emissions (Papers Track) Best Pathway to Impact
Abstract and authors: (click to expand)

Abstract: Cities represent both a fundamental contributor to greenhouse (GHG) emissions and a catalyst for climate action. Many global cities have outlined sustainability and climate change mitigation plans, focusing on energy efficiency, shifting away from fossil fuels, and prioritizing environmental and social justice. To achieve broad-based and equitable carbon emissions reductions and sustainability goals, new data-driven methodologies are needed to identify and target efficiency and carbon reduction opportunities in the built environment at the building, neighborhood, and city-scale. Our methodology integrates data from numerous data sources and develops data-driven and physical models of energy use and carbon emissions from buildings and transportation to generate a high spatiotemporal resolution model of urban greenhouse gas emissions. The method and data tool are designed to support city leaders and urban policymakers with an unprecedented view of localized carbon emissions to enable data-driven and evidenced-based climate action.

Authors: Bartosz Bonczak (New York University); Boyeong Hong (New York University); Constantine E. Kontokosta (New York University)

Hybrid Physical Models Climate Science & Modeling Data Mining
ICLR 2023 ClimaX: A foundation model for weather and climate (Papers Track)
Abstract and authors: (click to expand)

Abstract: Recent data-driven approaches based on machine learning aim to directly solve a downstream forecasting or projection task by learning a data-driven functional mapping using deep neural networks. However, these networks are trained using curated and homogeneous climate datasets for specific spatiotemporal tasks, and thus lack the generality of currently used physics-informed numerical models for weather and climate modeling. We develop and demonstrate ClimaX, a flexible and generalizable deep learning model for weather and climate science that can be trained using heterogeneous datasets spanning different variables, spatiotemporal coverage, and physical groundings. ClimaX extends the Transformer architecture with novel encoding and aggregation blocks that allow effective use of available compute and data while maintaining general utility. ClimaX is pre-trained with a self-supervised learning objective on climate datasets derived from CMIP6. The pretrained ClimaX can then be fine-tuned to address a breadth of climate and weather tasks, including those that involve atmospheric variables and spatiotemporal scales unseen during pretraining. Compared to existing data-driven baselines, we show that this generality in ClimaX results in superior performance on benchmarks for weather forecasting and climate projections.

Authors: Tung Nguyen (University of California, Los Angeles); Johannes Brandstetter (Microsoft Research); Ashish Kapoor (Microsoft); Jayesh Gupta (Microsoft Research); Aditya Grover (UCLA)

Climate Science & Modeling Meta- and Transfer Learning
ICLR 2023 Learning to Communicate and Collaborate in a Competitive Multi-Agent Setup to Clean the Ocean from Macroplastics (Papers Track)
Abstract and authors: (click to expand)

Abstract: Finding a balance between collaboration and competition is crucial for artificial agents in many real-world applications. We investigate this using a Multi-Agent Reinforcement Learning (MARL) setup on the back of a high-impact problem. The accumulation and yearly growth of plastic in the ocean cause irreparable damage to many aspects of oceanic health and the marina system. To prevent further damage, we need to find ways to reduce macroplastics from known plastic patches in the ocean. Here we propose a Graph Neural Network (GNN) based communication mechanism that increases the agents' observation space. In our custom environment, agents control a plastic collecting vessel. The communication mechanism enables agents to develop a communication protocol using a binary signal. While the goal of the agent collective is to clean up as much as possible, agents are rewarded for the individual amount of macroplastics collected. Hence agents have to learn to communicate effectively while maintaining high individual performance. We compare our proposed communication mechanism with a multi-agent baseline without the ability to communicate. Results show communication enables collaboration and increases collective performance significantly. This means agents have learned the importance of communication and found a balance between collaboration and competition.

Authors: Philipp D Siedler (Aleph Alpha)

Reinforcement Learning Oceans & Marine Systems
ICLR 2023 Sub-seasonal to seasonal forecasts through self-supervised learning (Proposals Track)
Abstract and authors: (click to expand)

Abstract: Sub-seasonal to seasonal (S2S) weather forecasts are an important decision- making tool that informs economical and logistical planning in agriculture, energy management, and disaster mitigation. They are issued on time scales of weeks to months and differ from short-term weather forecasts in two important ways: (i) the dynamics of the atmosphere on these timescales can be described only statistically and (ii) these dynamics are characterized by large-scale phenomena in both space and time. While deep learning (DL) has shown promising results in short-term weather forecasting, DL-based S2S forecasts are challenged by comparatively small volumes of available training data and large fluctuations in predictability due to atmospheric conditions. In order to develop more reliable S2S predictions that leverage current advances in DL, we propose to utilize the masked auto-encoder (MAE) framework to learn generic representations of large-scale atmospheric phenomena from high resolution global data. Besides exploring the suitability of the learned representations for S2S forecasting, we will also examine whether they account for climatic phenomena (e.g., the Madden-Julian Oscillation) that are known to increase predictability on S2S timescales.

Authors: Jannik Thuemmel (University of Tuebingen); Felix Strnad (Potsdam Institute for Climate Impact Research); Jakob Schlör (Eberhard Karls Universität Tübingen); Martin V. Butz (University of Tübingen); Bedartha Goswami (University of Tübingen)

Unsupervised & Semi-Supervised Learning Climate Science & Modeling Extreme Weather Interpretable ML
ICLR 2023 Mining Effective Strategies for Climate Change Communication (Papers Track)
Abstract and authors: (click to expand)

Abstract: With the goal of understanding effective strategies to communicate about climate change, we build interpretable models to rank tweets related to climate change with respect to the engagement they generate. Our models are based on the Bradley-Terry model of pairwise comparison outcomes and use a combination of the tweets’ topic and metadata features to do the ranking. To remove confounding factors related to author popularity and minimise noise, they are trained on pairs of tweets that are from the same author and around the same time period and have a sufficiently large difference in engagement. The models achieve good accuracy on a held-out set of pairs. We show that we can interpret the parameters of the trained model to identify the topic and metadata features that contribute to high engagement. Among other observations, we see that topics related to climate projections, human cost and deaths tend to have low engagement while those related to mitigation and adaptation strategies have high engagement. We hope the insights gained from this study will help craft effective climate communication to promote engagement, thereby lending strength to efforts to tackle climate change.

Authors: Aswin Suresh (EPFL); Lazar Milikic (EPFL); Francis Murray (EPFL); Yurui Zhu (EPFL); Matthias Grossglauser (École Polytechnique Fédérale de Lausanne (EPFL))

Natural Language Processing Behavioral and Social Science Public Policy Societal Adaptation & Resilience Data Mining Interpretable ML Unsupervised & Semi-Supervised Learning
ICLR 2023 Graph-Based Deep Learning for Sea Surface Temperature Forecasts (Papers Track)
Abstract and authors: (click to expand)

Abstract: Sea surface temperature (SST) forecasts help with managing the marine ecosystem and the aquaculture impacted by anthropogenic climate change. Numerical dynamical models are resource intensive for SST forecasts; machine learning (ML) models could reduce high computational requirements and have been in the focus of the research community recently. ML models normally require a large amount of data for training. Environmental data are collected on regularly-spaced grids, so early work mainly used grid-based deep learning (DL) for prediction. However, both grid data and the corresponding DL approaches have inherent problems. As geometric DL has emerged, graphs as a more generalized data structure and graph neural networks (GNNs) have been introduced to the spatiotemporal domains. In this work, we preliminarily explored graph re-sampling and GNNs for global SST forecasts, and GNNs show better one month ahead SST prediction than the persistence model in most oceans in terms of root mean square errors.

Authors: Ding Ning (University of Canterbury); Varvara Vetrova (University of Canterbury); Karin Bryan (University of Waikato)

Data Mining Climate Science & Modeling Oceans & Marine Systems Time-series Analysis
ICLR 2023 Activity-Based Recommendations for the Reduction of CO2 Emissions in Private Households (Papers Track)
Abstract and authors: (click to expand)

Abstract: This paper proposes an activity prediction framework for a multi-agent recommendation system to tackle the energy-efficiency problem in residential buildings. Our system generates an activity-shifting schedule based on the social practices from the users’ domestic life. We further provide a utility option for the recommender system to focus on saving CO2 emissions or energy costs, or both. The empirical results show that while focusing on the reduction of CO2 emissions, the system provides an average of 12% of emission savings and 7% of electricity cost savings. When concentrating on energy costs, 6% of emission savings and 20% of electricity cost savings are possible for the studied households.

Authors: Alona Zharova (Humboldt University of Berlin); Laura Löschmann (Humboldt University of Berlin)

Recommender Systems Buildings Power & Energy
ICLR 2023 Data-driven mean-variability optimization of PV portfolios with automatic differentiation (Papers Track)
Abstract and authors: (click to expand)

Abstract: Increasing PV capacities has a crucial role to reach carbon-neutral energy systems. To promote PV expansion, policy designs have been developed which rely on energy yield maximization to increase the total PV energy supply in energy systems. Focusing on yield maximization, however, ignores negative side-effects such as an increased variability due to similar-orientated PV systems at clustered regions. This can lead to costly ancillary services and thereby reduces the acceptance of renewable energy. This paper suggests to rethink PV portfolio designs by deriving mean-variability hedged PV portfolios with smartly orientated tilt and azimuth angles. Based on a data-driven method inspired from modern portfolio theory, we formulate the problem as a biobjective, non-convex optimization problem which is solved based on automatically differentiating the physical PV conversion model subject to individual tilt and azimuth angles. To illustrate the performance of the proposed method, a case study is designed to derive efficient frontiers in the mean-variability spectrum of Germany's PV portfolio based on representative grid points. The proposed method allows decision-makers to hedge between variability and yield in PV portfolio design decisions. This is the first study highlighting the problem of ignoring variability in PV portfolio expansion schemes and introduces a way to tackle this issue using modern methods inspired by Machine Learning.

Authors: Matthias Zech (German Aerospace Center (DLR), Institute of Networked Energy Systems); Lueder von Bremen (German Aerospace Center (DLR), Institute of Networked Energy Systems)

Power & Energy Climate Finance & Economics Interpretable ML
ICLR 2023 DiffESM: Conditional Emulation of Earth System Models with Diffusion Models (Papers Track)
Abstract and authors: (click to expand)

Abstract: Earth System Models (ESMs) are essential tools for understanding the impact of human actions on Earth's climate. One key application of these models is studying extreme weather events, such as heat waves or dry spells, which have significant socioeconomic and environmental consequences. However, the computational demands of running a sufficient number of simulations to analyze the risks are often prohibitive. In this paper we demonstrate that diffusion models -- a class of generative deep learning models -- can effectively emulate the spatio-temporal trends of ESMs under previously unseen climate scenarios, while only requiring a small fraction of the computational resources. We present a diffusion model that is conditioned on monthly averages of temperature or precipitation on a 96x96 global grid, and produces daily values that are both realistic and consistent with those averages. Our results show that the output from our diffusion model closely matches the spatio-temporal behavior of the ESM it emulates in terms of the frequency of phenomena such as heat waves, dry spells, or rainfall intensity.

Authors: Seth Bassetti (Western Washington University); Brian Hutchinson (Western Washington University); Claudia Tebaldi (Joint Global Change Research Institute); Ben Kravitz (Indiana University)

Climate Science & Modeling Generative Modeling
ICLR 2023 Emission-Constrained Optimization of Gas Systems with Input-Convex Neural Networks (Papers Track)
Abstract and authors: (click to expand)

Abstract: Planning optimization of gas networks under emission constraints prioritizes gas supply with the smallest emission footprint. As this problem includes complex gas flow physical laws, standard optimization solvers cannot guarantee convergence to a feasible solution, especially under strict emission constraints. To address this issue, we develop an input-convex neural network (ICNN) aided optimization routine which incorporates a set of trained ICNNs approximating the gas flow equations with high precision. Numerical tests on the Belgium gas network demonstrate that the ICNN-aided optimization dominates non-convex and relaxation-based solvers, with larger optimality gains pertaining to stricter emission targets.

Authors: Vladimir Dvorkin (Massachusetts Institute of Technology); Samuel C Chevalier (Technical University of Denmark); Spyros Chatzivasileiadis (Technical University of Denmark)

Power & Energy Supply Chains
ICLR 2023 Deep ensembles to improve uncertainty quantification of statistical downscaling models under climate change conditions (Papers Track)
Abstract and authors: (click to expand)

Abstract: Recently, deep learning has emerged as a promising tool for statistical downscaling, the set of methods for generating high-resolution climate fields from coarse low-resolution variables. Nevertheless, their ability to generalize to climate change conditions remains questionable, mainly due to the stationarity assumption. We propose deep ensembles as a simple method to improve the uncertainty quantification of statistical downscaling models. By better capturing uncertainty, statistical downscaling models allow for superior planning against extreme weather events, a source of various negative social and economic impacts. Since no observational future data exists, we rely on a pseudo reality experiment to assess the suitability of deep ensembles for quantifying the uncertainty of climate change projections. Deep ensembles allow for a better risk assessment, highly demanded by sectoral applications to tackle climate change.

Authors: Jose González-Abad (Instituto de Fı́sica de Cantabria (IFCA), CSIC-Universidad de Cantabria); Jorge Baño-Medina (Institute of Physics of Cantabria)

Uncertainty Quantification & Robustness Climate Science & Modeling
ICLR 2023 Bayesian Inference of Severe Hail in Australia (Papers Track)
Abstract and authors: (click to expand)

Abstract: Severe hailstorms are responsible for some of the most costly insured weather events in Australia and can cause significant damage to homes, businesses, and agriculture. However their response to climate change remains uncertain, in large part due to the challenges of observing severe hailstorms. We propose a novel Bayesian approach which explicitly models known biases and uncertainties of current hail observations to produce more realistic estimates of severe hail risk from existing observations. Training this model on data from south-east Queensland, Australia, suggests that previous analyses of severe hail that did not account for this uncertainty may produce poorly calibrated risk estimates. Preliminary evaluation on withheld data confirms that our model produces well-calibrated probabilities and is applicable out of sample. Whilst developed for hail, we highlight also the generality of our model and its potential applications to other severe weather phenomena and areas of climate change adaptation and mitigation.

Authors: Isabelle C Greco (University of New South Wales); Steven Sherwood (University of New South Wales); Timothy Raupach (University of New South Wales); Gab Abramowitz (University of New South Wales)

Causal & Bayesian Methods Climate Science & Modeling Disaster Management and Relief Earth Observation & Monitoring Extreme Weather Uncertainty Quantification & Robustness
ICLR 2023 Exploring the potential of neural networks for Species Distribution Modeling (Papers Track)
Abstract and authors: (click to expand)

Abstract: Species distribution models (SDMs) relate species occurrence data with environmental variables and are used to understand and predict species distributions across landscapes. While some machine learning models have been adopted by the SDM community, recent advances in neural networks may have untapped potential in this field. In this work, we compare the performance of multi-layer perceptron (MLP) neural networks to well-established SDM methods on a benchmark dataset spanning 225 species in six geographical regions. We also compare the performance of MLPs trained separately for each species to an equivalent model trained on a set of species and performing multi-label classification. Our results show that MLP models achieve comparable results to state-of-the-art SDM methods, such as MaxEnt. We also find that multi-species MLPs perform slightly better than single-species MLPs. This study indicates that neural networks, along with all their convenient and valuable characteristics, are worth considering for SDMs.

Authors: Robin Zbinden (EPFL); Nina van Tiel (EPFL); Benjamin Kellenberger (Yale University); Lloyd H Hughes (EPFL); Devis Tuia (EPFL)

Ecosystems & Biodiversity
ICLR 2023 Modelling Atmospheric Dynamics with Spherical Fourier Neural Operators (Papers Track)
Abstract and authors: (click to expand)

Abstract: Fourier Neural Operators (FNOs) have established themselves as an efficient method for learning resolution-independent operators in a wide range of scientific machine learning applications. This can be attributed to their ability to effectively model long-range dependencies in spatio-temporal data through computationally ef- ficient global convolutions. However, the use of discrete Fourier transforms (DFTs) in FNOs leads to spurious artifacts and pronounced dissipation when applied to spherical coordinates, due to the incorrect assumption of flat geometry. To ad- dress the issue, we introduce Spherical FNOs (SFNOs), which use the generalized Fourier transform for learning operators on spherical geometries. We demonstrate the effectiveness of the method for forecasting atmospheric dynamics, producing stable auto-regressive results for a simulated time of one year (1,460 steps) while retaining physically plausible dynamics. This development has significant implica- tions for machine learning-based climate dynamics emulation, which could play a crucial role in accelerating our response to climate change.

Authors: Boris Bonev (NVIDIA); Thorsten Kurth (Nvidia); Christian Hundt (NVIDIA AI Technology Center); Jaideep Pathak (NVIDIA Corporation); Maximilian Baust (NVIDIA); Karthik Kashinath (NVIDIA); Anima Anandkumar (NVIDIA/Caltech)

Hybrid Physical Models Climate Science & Modeling Extreme Weather
ICLR 2023 Distributed Reinforcement Learning for DC Open Energy Systems (Papers Track)
Abstract and authors: (click to expand)

Abstract: The direct current open energy system (DCOES) enables the production, storage, and exchange of renewable energy within local communities, which is helpful, especially in isolated villages and islands where centralized power supply is unavailable or unstable. As solar and wind energy production varies in time and space depending on the weather and the energy usage patterns differ for different households, how to store and exchange energy is an important research issue. In this work, we explore the use of deep reinforcement learning (DRL) for adaptive control of energy storage in local batteries and energy sharing through DC grids. We extend the Autonomous Power Interchange System (APIS) emulator from SonyCSL to combine it with reinforcement learning algorithms in each house. We implemented deep Q-network (DQN) and prioritized DQN to dynamically set the parameters of the real-time energy exchange protocol of APIS and tested it using the actual data collected from the DCOES in the faculty houses of Okinawa Institute of Science and Technology (OIST). The simulation results showed that RL agents outperformed the hand-tuned control strategy. Sharing average energy production, storage, and usage within the local community further improved efficiency. The implementation of DRL methods for adaptive energy storage and exchange can help reducing carbon emission and positively impact the climate.

Authors: Qiong Huang (Okinawa Institute of Science and Technology Graduate University); Kenji Doya (Okinawa Institute of Science and Technology)

Reinforcement Learning Power & Energy
ICLR 2023 Uncovering the Spatial and Temporal Variability of Wind Resources in Europe: A Web-Based Data-Mining Tool (Papers Track)
Abstract and authors: (click to expand)

Abstract: We introduce REmap-eu.app, a web-based data-mining visualization tool of the spatial and temporal variability of wind resources. It uses the latest open-access dataset of the daily wind capacity factor in 28 European countries between 1979 and 2019 and proposes several user-configurable visualizations of the temporal and spatial variations of the wind power capacity factor. The platform allows for a deep analysis of the distribution, the cross-country correlation, and the drivers of low wind power events. It offers an easy-to-use interface that makes it suitable for the needs of researchers and stakeholders. The tool is expected to be useful in identifying areas of high wind potential and possible challenges that may impact the large-scale deployment of wind turbines in Europe. Particular importance is given to the visualization of low wind power events and to the potential of cross-border cooperations in mitigating the variability of wind in the context of increasing reliance on weather-sensitive renewable energy sources.

Authors: Alban Puech (École Polytechnique); Jesse Read (Ecole Polytechnique)

Data Mining Climate Science & Modeling Power & Energy Time-series Analysis
ICLR 2023 Understanding forest resilience to drought with Shapley values (Proposals Track)
Abstract and authors: (click to expand)

Abstract: Increases in drought frequency, intensity, and duration due to climate change are threatening forests around the world. Climate-driven tree mortality is associated with devastating ecological and societal consequences, including the loss of carbon sequestration, habitat provisioning, and water filtration services. A spatially fine-grained understanding of the site characteristics making forests more resilient to drought is still lacking. Furthermore, the complexity of drought effects on forests, which can be cumulative and delayed, demands investigation of the most appropriate drought indices. In this study, we aim to gain a better understanding of the temporal and spatial drivers of drought-induced changes in forest vitality using Shapley values, which allow for the relevance of predictors to be quantified locally. A better understanding of the contribution of meteorological and environmental factors to trees’ response to drought can support forest managers aiming to make forests more climate-resilient.

Authors: Stenka Vulova (Technische Universität Berlin); Alby Duarte Rocha (Technische Universität Berlin); Akpona Okujeni (Humboldt-Universität zu Berlin); Johannes Vogel (Freie Universität Berlin); Michael Förster (Technische Universität Berlin); Patrick Hostert (Humboldt-Universität zu Berlin); Birgit Kleinschmit (Technische Universität Berlin)

Forests Earth Observation & Monitoring Climate Science & Modeling Ecosystems & Biodiversity Extreme Weather Interpretable ML
ICLR 2023 Improving the spatial accuracy of extreme tropical cyclone rainfall in ERA5 using deep learning (Papers Track)
Abstract and authors: (click to expand)

Abstract: We propose a novel method for the bias adjustment and post-processing of gridded rainfall data products. Our method uses U-Net (a deep convolutional neural network) as a backbone, and a novel loss function given by the combination of a pixelwise bias component (Mean Absolute Error) and a spatial accuracy component (Fractions Skill Score). We evaluate the proposed approach by adjusting extreme rainfall from the popular ERA5 reanalysis dataset, using the multi-source observational dataset MSWEP as a target. We focus on a sample of extreme rainfall events induced by tropical cyclones and show that the proposed method significantly reduces both the MAE (by 16\%) and FSS (by 53\%) of ERA5.

Authors: Guido Ascenso (Politecnico di Milano); Andrea Ficchì (Politecnico di Milano); Matteo Giuliani (Politecnico di Milano); Leone Cavicchia (Fondazione Centro Euro-Mediterraneo sui Cambiamenti Climatici (CMCC)); Enrico Scoccimarro (Fondazione Centro Euro-Mediterraneo sui Cambiamenti Climatici (CMCC)); Andrea Castelletti (Politecnico di Milano)

Extreme Weather Climate Science & Modeling Computer Vision & Remote Sensing
ICLR 2023 XAI for transparent wind turbine power curve models (Papers Track)
Abstract and authors: (click to expand)

Abstract: Accurate wind turbine power curve models, which translate ambient conditions into turbine power output, are crucial for wind energy to scale and fulfill its proposed role in the global energy transition. While machine learning (ML) methods have shown significant advantages over parametric, physics-informed approaches, they are often criticized for being opaque "black boxes", which hinders their application in practice. We apply Shapley values, a popular explainable artificial intelligence (XAI) method, and the latest findings from XAI for regression models, to uncover the strategies ML models have learned from operational wind turbine data. Our findings reveal that the trend towards ever larger model architectures, driven by a focus on test set performance, can result in physically implausible model strategies. Therefore, we call for a more prominent role of XAI methods in model selection. Moreover, we propose a practical approach to utilize explanations for root cause analysis in the context of wind turbine performance monitoring. This can help to reduce downtime and increase the utilization of turbines in the field.

Authors: Simon Letzgus (Technische Universität Berlin)

Power & Energy Interpretable ML
ICLR 2023 Green AutoML for Plastic Litter Detection (Papers Track)
Abstract and authors: (click to expand)

Abstract: The world’s oceans are polluted with plastic waste and the detection of it is an important step toward removing it. Wolf et al. (2020) created a plastic waste dataset to develop a plastic detection system. Our work aims to improve the machine learning model by using Green Automated Machine Learning (AutoML). One aspect of Green-AutoML is to search for a machine learning pipeline, while also minimizing the carbon footprint. In this work, we train five standard neural architectures for image classification on the aforementioned plastic waste dataset. Subsequently, their performance and carbon footprints are compared to an Efficient Neural Architecture Search as a well-known AutoML approach. We show the potential of Green-AutoML by outperforming the original plastic detection system by 1.1% in accuracy and using 33 times fewer floating point operations at inference, and only 29% of the carbon emissions of the best-known baseline. This shows the large potential of AutoML on climate-change relevant applications and at the same time contributes to more efficient modern Deep Learning systems, saving substantial resources and reducing the carbon footprint.

Authors: Daphne Theodorakopoulos (German Research Centre for Artificial Intelligence (DFKI), Marine Perception Department and Leibniz University Hannover, Institute of Artificial Intelligence); Christoph Manß (German Research Centre for Artificial Intelligence (DFKI), Marine Perception Department); Frederic Stahl (German Research Centre for Artificial Intelligence (DFKI), Marine Perception Department); Marius Lindauer (Leibniz University Hannover)

Oceans & Marine Systems Computer Vision & Remote Sensing
ICLR 2023 Robustly modeling the nonlinear impact of climate change on agriculture by combining econometrics and machine learning (Proposals Track)
Abstract and authors: (click to expand)

Abstract: Climate change is expected to have a dramatic impact on agricultural production; however, due to natural complexity, the exact avenues and relative strengths by which this will happen are still unknown. The development of accurate forecasting models is thus of great importance to enable policy makers to design effective interventions. To date, most machine learning methods aimed at tackling this problem lack a consideration of causal structure, thereby making them unreliable for the types of counterfactual analysis necessary when making policy decisions. Econometrics has developed robust techniques for estimating cause-effect relations in time-series, specifically through the use of cointegration analysis and Granger causality. However, these methods are frequently limited in flexibility, especially in the estimation of nonlinear relationships. In this work, we propose to integrate the non-linear function approximators with the robust causal estimation methods to ultimately develop an accurate agricultural forecasting model capable of robust counterfactual analysis. This method would be a valuable new asset for government and industrial stakeholders to understand how climate change impacts agricultural production.

Authors: Benedetta Francesconi (Independent Researcher); Ying-Jung C Deweese (Descartes Labs / Georgia Insititute of Technology)

Agriculture & Food Climate Science & Modeling Public Policy Societal Adaptation & Resilience Time-series Analysis
ICLR 2023 Towards Green, Accurate, and Efficient AI Models Through Multi-Objective Optimization (Proposals Track)
Abstract and authors: (click to expand)

Abstract: Machine learning is one of the fastest growing services in modern hyperscale data centers. While AI’s exponential scaling has enabled unprecedented modeling capabilities across computer vision, natural language processing, protein modeling, personalized recommendation, it comes at the expense of significant energy and environmental footprints. This work aims to co-optimize machine learning models in terms of their accuracy, compute efficiency, and environmental sustainability by using multi-objective bayesian optimization. We aim to extend current multi-objective optimization frameworks, such as the openly available Ax (adaptive experimentation) platform to balance accuracy, efficiency, and environmental sustainability of deep neural networks. In order to optimize for environmental sustainability we will consider the impact across AI model life cycles (e.g., training, inference) and hardware life cycles (e.g., manufacturing, operational use). Given this is a research proposal, we expect to demonstrate that designing for sustainable AI models yields distinct optimal neural network architectures than ones designed for accuracy and efficiency given the external impacts of varying renewable energy and tradeoffs between compute and storage for embodied carbon overheads.

Authors: Udit Gupta (Harvard University); Daniel R Jiang (Meta); Maximilian Balandat (Facebook); Carole-Jean Wu (Meta AI)

ICLR 2023 EfficientTempNet: Temporal Super-Resolution of Radar Rainfall (Papers Track)
Abstract and authors: (click to expand)

Abstract: Rainfall data collected by various remote sensing instruments such as radars or satellites has different space-time resolutions. This study aims to improve the temporal resolution of radar rainfall products to help with more accurate climate change modeling and studies. In this direction, we introduce a solution based on EfficientNetV2, namely EfficientTempNet, to increase the temporal resolution of radar-based rainfall products from 10 minutes to 5 minutes. We tested EfficientRainNet over a dataset for the state of Iowa, US, and compared its performance to three different baselines to show that EfficientTempNet presents a viable option for better climate change monitoring.

Authors: Bekir Z Demiray (University of Iowa); Muhammed A Sit (The University of Iowa); Ibrahim Demir (University of Iowa)

Earth Observation & Monitoring Climate Science & Modeling Extreme Weather Computer Vision & Remote Sensing Time-series Analysis
ICLR 2023 Bird Distribution Modelling using Remote Sensing and Citizen Science data (Papers Track) Overall Best Paper
Abstract and authors: (click to expand)

Abstract: Climate change is a major driver of biodiversity loss, changing the geographic range and abundance of many species. However, there remain significant knowl- edge gaps about the distribution of species, due principally to the amount of effort and expertise required for traditional field monitoring. We propose an approach leveraging computer vision to improve species distribution modelling, combining the wide availability of remote sensing data with sparse on-ground citizen science data from .We introduce a novel task and dataset for mapping US bird species to their habitats by predicting species encounter rates from satellite images, along with baseline models which demonstrate the power of our approach. Our methods open up possibilities for scalably modelling ecosystems properties worldwide.

Authors: Mélisande Teng (Mila, Université de Montréal); Amna Elmustafa (African Institute for Mathematical Science); Benjamin Akera (McGill University); Hugo Larochelle (UdeS); David Rolnick (McGill University, Mila)

Ecosystems & Biodiversity Computer Vision & Remote Sensing
ICLR 2023 Efficient HVAC Control with Deep Reinforcement Learning and EnergyPlus (Papers Track)
Abstract and authors: (click to expand)

Abstract: Heating and cooling comprise a significant fraction of the energy consumed by buildings, which in turn account for a significant fraction of society’s energy use. Most building heating, ventilation, and air conditioning (HVAC) systems use standard control schemes that meet basic operating constraints and comfort requirements but with suboptimal efficiency. Deep reinforcement learning (DRL) has shown immense potential for high-performing control in a variety of simulated settings, but has not been widely deployed for real-world control. Here we provide two contributions toward increasing the viability of real-world, DRL-based HVAC control, leveraging the EnergyPlus building simulator. First, we use the new EnergyPlus Python API to implement a first-of-its-kind, purely Python-based EnergyPlus DRL learning framework capable of generalizing to a wide variety of building configurations and weather scenarios. Second, we demonstrate an approach to constrained learning for this setting, removing the requirement to tune reward functions in order to maximize energy efficiency given temperature constraints. We tested our framework on realistic building models of a data center, an office building, and a secondary school. In each case, trained agents maintained temperature control while achieving energy savings relative to standard approaches.

Authors: Jared Markowitz (Johns Hopkins University Applied Physics Laboratory); Nathan Drenkow (Johns Hopkins University Applied Physics Laboratory)

Buildings Reinforcement Learning
ICLR 2023 Fourier Neural Operators for Arbitrary Resolution Climate Data Downscaling (Papers Track)
Abstract and authors: (click to expand)

Abstract: Running climate simulations informs us of future climate change. However, it is computationally expensive to resolve complex climate processes numerically. As one way to speed up climate simulations, neural networks have been used to downscale climate variables from fast-running low-resolution simulations. So far, all neural network downscaling models can only downscale input samples with a pre-defined upsampling factor. In this work, we propose a Fourier neural operator downscaling model. It trains with data of a small upsampling factor and then can zero-shot downscale its input to arbitrary unseen high-resolutions. Evaluated on Navier-Stokes equation solution data and ERA5 water content data, our downscaling model demonstrates better performance than widely used convolutional and adversarial generative super-resolution models in both learned and zero-shot downscaling. Our model's performance is further boosted when a constraint layer is applied. In the end, we show that by combining our downscaling model with a low-resolution numerical PDE solver, the downscaled solution outperforms the solution of the state-of-the-art high-resolution data-driven solver. Our model can be used to cheaply and accurately generate arbitrarily high-resolution climate simulation data with fast-running low-resolution simulation as input.

Authors: Qidong Yang (New York University); Paula Harder (Fraunhofer ITWM); Venkatesh Ramesh (University of Montreal, Mila); Alex Hernandez-Garcia (Mila - Quebec AI Institute); Daniela Szwarcman (IBM Research); Prasanna Sattigeri (IBM Research); Campbell D Watson (IBM Reserch); David Rolnick (McGill University, Mila)

Climate Science & Modeling Earth Observation & Monitoring Computer Vision & Remote Sensing Generative Modeling
ICLR 2023 Data-driven multiscale modeling of subgrid parameterizations in climate models (Papers Track) Best ML Innovation
Abstract and authors: (click to expand)

Abstract: Subgrid parameterizations that represent physical processes occurring below the resolution of current climate models are an important component in producing accurate, long-term predictions for the climate. A variety of approaches have been tested to design these components, including deep learning methods. In this work, we evaluate a proof of concept illustrating a multiscale approach to this prediction problem. We train neural networks to predict subgrid forcing values on a testbed model and examine improvements in prediction accuracy which can be obtained by using additional information in both fine-to-coarse and coarse-to-fine directions.

Authors: Karl Otness (New York University); Laure Zanna (NYU); Joan Bruna (NYU)

Climate Science & Modeling Hybrid Physical Models
ICLR 2023 Decision-aware uncertainty-calibrated deep learning for robust energy system operation (Proposals Track)
Abstract and authors: (click to expand)

Abstract: Decision-making under uncertainty is an important problem that arises in many domains. Achieving robustness guarantees requires well-calibrated uncertainties, which can be difficult to achieve in high-capacity prediction models such as deep neural networks. This paper proposes an end-to-end approach for learning uncertainty-calibrated deep learning models that directly optimizes a downstream decision-making objective with provable robustness. We also propose two concrete applications in energy system operations, including a grid scheduling task as well as an energy storage arbitrage task. As renewable wind and solar generation increasingly proliferate and their variability penetrates the energy grid, learning uncertainty-aware predictive models becomes increasingly crucial for maintaining efficient and reliable grid operation.

Authors: Christopher Yeh (California Institute of Technology); Nicolas Christianson (California Institute of Technology); Steven Low (California Institute of Technology); Adam Wierman (California Institute of Technology); Yisong Yue (Caltech)

Power & Energy Reinforcement Learning
ICLR 2023 Multi-Agent Deep Reinforcement Learning for Solar-Battery System to Mitigate Solar Curtailment in Real-Time Electricity Market (Papers Track)
Abstract and authors: (click to expand)

Abstract: The increased uptake of solar energy in the energy transition towards decarbonization has caused the issue of solar photovoltaic (PV) curtailments, resulting in significant economic losses and hindering the energy transition. To overcome this issue, battery energy storage systems (BESS) can serve as onsite backup sources for solar farms. However, the backup role of the BESS significantly limits its economic value, disincentivizing the BESS deployment due to high investment costs. Hence, it is essential to effectively reduce solar curtailment while ensuring viable operations of the BESS.

Authors: Jinhao Li (Monash University); Changlong Wang (Monash University); Hao Wang (Monash University)

Power & Energy Reinforcement Learning
ICLR 2023 Projecting the climate penalty on pm2.5 pollution with spatial deep learning (Proposals Track)
Abstract and authors: (click to expand)

Abstract: The climate penalty measures the effects of a changing climate on air quality due to the interaction of pollution with climate factors, independently of future changes in emissions. This work introduces a statistical framework for estimating the climate penalty on soot pollution (PM 2.5), which has been linked to respiratory and cardiovascular diseases and premature mortality. The framework evaluates the disparities in future PM 2.5 exposure across racial/ethnic and income groups---an important step towards informing mitigation public health policy and promoting environmental equity in addressing the effects of climate change. The proposed methodology aims to improve existing statistical-based methods for estimating the climate penalty using an expressive and scalable predictive model based on spatial deep learning with spatiotemporal trend estimation. The proposed approach will (1) use higher-resolution climate inputs, which current statistical methods to estimate the climate penalty approaches cannot accommodate; (2) integrate additional predictive data sources such as demographics, geology, and land use; (3) consider regional dependencies and synoptic weather patterns influencing PM 2.5, deconvolving the effects of climate change from increasing air quality regulations and other sources of unmeasured spatial heterogeneity.

Authors: Mauricio Tec (Harvard University); Riccardo Cadei (Harvard University); Francesca Dominici (Harvard University); Corwin Zigler (University of Texas at Austin)

Climate Justice Health Computer Vision & Remote Sensing Time-series Analysis
ICLR 2023 On the impact of small-data diversity on forecasts: evidence from meteorologically-driven electricity demand in Mediterranean zones. (Papers Track)
Abstract and authors: (click to expand)

Abstract: In this paper, we compare the improvement of probabilistic electricity demand forecasts for three specific coastal and island regions using raw and pre-computed meteorological features based on empirically-tested formulations drawn from climate science literature. Typically for the general task of time-series forecasting with strong weather/climate drivers, go-to models like the Autoregressive Integrated Moving Average (ARIMA) model are built with assumptions of how independent variables will affect a dependent one and are at best encoded with a handful of exogenous features with known impact. Depending on the geographical region and/or cultural practices of a population, such a selection process may yield a non-optimal feature set which would ultimately drive a weak impact on underline demand forecasts. The aim of this work is to assess the impact of a documented set of meteorological features on electricity demand using deep learning models in comparative studies. Leveraging the defining computational architecture of the Temporal Fusion Transformer (TFT), we discover the unimportance of weather features for improving probabilistic forecasts for the targeted regions. However, through experimentation, we discover that the more stable electricity demand of the coastal Mediterranean regions, the Ceuta and Melilla autonomous cities in Morocco, improved the forecast accuracy of the strongly tourist-driven electricity demand for the Balearic islands located in Spain during the time of travel restrictions (i.e., during COVID19 (2020))--a root mean squared error (RMSE) from ~0.090 to ~0.012 with a substantially improved 10th/90th quantile bounding.

Authors: Reginald Bryant (IBM Research - Africa); Julian Kuehnert (IBM Research)

Time-series Analysis Cities & Urban Planning Climate Science & Modeling Extreme Weather Societal Adaptation & Resilience Uncertainty Quantification & Robustness
ICLR 2023 Artificial Intelligence in Tropical Cyclone Forecasting (Proposals Track)
Abstract and authors: (click to expand)

Abstract: Tropical cyclones (TC) in Bangladesh and other developing nations harm property and human lives. Theoretically, artificial intelligence (AI) can anticipate TC using tracking, intensity, and cyclone aftereffect phenomena. Although AI has a significant impact on predicting, poorer nations have struggled to adjust to its real-world applications. The interpretability of such a solution from an AI-based solution is the main factor in that situation, together with the infrastructure. Explainable AI has been extensively employed in the medical field because the outcome is so important. We believe that using explainable AI in TC forecasting is equally important as one large forecast can cause the thought of life loss. Additionally, it will improve the long-term viability of the AI-based weather forecasting system. To be more specific, we want to employ explainable AI in every way feasible in this study to address the problems of TC forecasting, intensity estimate, and tracking. We'll do this by using the graph neural network. The adoption of AI-based solutions in underdeveloped nations will be aided by this solution, which will boost their acceptance. With this effort, we also hope to tackle the challenge of sustainable AI in order to tackle the issue of climate change on a larger scale. However, Cyclone forecasting might be transformed by sustainable AI, guaranteeing precise and early predictions to lessen the effects of these deadly storms. The examination of vast volumes of meteorological data to increase forecasting accuracy is made possible by the combination of AI algorithms and cutting-edge technologies like machine learning and big data analytics. Improved accuracy is one of the main advantages of sustainable AI for cyclone prediction. To provide more precise forecasts, AI systems can evaluate a lot of meteorological data, including satellite imagery and ocean temperature readings.

Authors: Dr. Nusrat Sharmin (Military Institute of Science and Technology); Professor Dr. Md. Mahbubur Rahman Rahman (Military Institute of Science and Technology (MIST)); Sabbir Rahman (Military Institute of Science and Technology); Mokhlesur Rahman (Military Institute of Science and Technology)

Climate Science & Modeling Disaster Management and Relief Interpretable ML
NeurIPS 2022 Function Approximations for Reinforcement Learning Controller for Wave Energy Converters (Papers Track)
Abstract and authors: (click to expand)

Abstract: Waves are a more consistent form of clean energy than wind and solar and the latest Wave Energy Converters (WEC) platforms like CETO 6 have evolved into complex multi-generator designs with a high energy capture potential for financial viability. Multi-Agent Reinforcement Learning (MARL) controller can handle these complexities and control the WEC optimally unlike the default engineering controllers like Spring Damper which suffer from lower energy capture and mechanical stress from the spinning yaw motion. In this paper, we look beyond the normal hyper-parameter and MARL agent tuning and explored the most suitable architecture for the neural network function approximators for the policy and critic networks of MARL which act as its brain. We found that unlike the commonly used fully connected network (FCN) for MARL, the sequential models like transformers and LSTMs can model the WEC system dynamics better. Our novel transformer architecture, Skip Transformer-XL (STrXL), with several gated residual connections in and around the transformer block performed better than the state-of-the-art with faster training convergence. STrXL boosts energy efficiency by an average of 25% to 28% over the existing spring damper (SD) controller for waves at different angles and almost eliminated the mechanical stress from the rotational yaw motion, saving costly maintenance on open seas, and thus reducing the Levelized Cost of wave energy (LCOE). Demo: https://tinyurl.com/4s4mmb9v

Authors: Soumyendu Sarkar (Hewlett Packard Enterprise); Vineet Gundecha (Hewlett Packard Enterpise); Alexander Shmakov (UC Irvine); Sahand Ghorbanpour (Hewlett Packard Enterprise); Ashwin Ramesh Babu (Hewlett Packard Enterprise Labs); Alexandre Pichard (Carnegie Clean Energy); Mathieu Cocho (Carnegie Clean Energy)

Power & Energy Reinforcement Learning
NeurIPS 2022 Image-Based Soil Organic Carbon Estimation from Multispectral Satellite Images with Fourier Neural Operator and Structural Similarity (Papers Track)
Abstract and authors: (click to expand)

Abstract: Soil organic carbon (SOC) sequestration is the transfer and storage of atmospheric carbon dioxide in soils, which plays an important role in climate change mitigation. SOC concentration can be improved by proper land use, thus it is beneficial if SOC can be estimated at a regional or global scale. As multispectral satellite data can provide SOC-related information such as vegetation and soil properties at a global scale, estimation of SOC through satellite data has been explored as an alternative to manual soil sampling. Although existing works show promising results, most studies are based on pixel-based approaches with traditional machine learning methods, and convolutional neural networks (CNNs) are seldom used. To study the advantages of using CNNs on SOC remote sensing, in this paper, we propose the FNO-DenseNet based on the state-of-the-art Fourier neural operator (FNO). By combining the advantages of the FNO and DenseNet, the FNO-DenseNet outperformed the FNO in our experiments with hundreds of times fewer parameters. The FNO-DenseNet also outperformed a pixel-based random forest by 18% in the mean absolute percentage error. To the best of our knowledge, this is the first work of applying the FNO on SOC remote sensing.

Authors: Ken C. L. Wong (IBM Research – Almaden Research Center); Levente Klein (IBM Research); Ademir Ferreira da Silva (IBM Research); Hongzhi Wang (IBM Almaden Research Center); Jitendra Singh (IBM Research - India); Tanveer Syeda-Mahmood (IBM Research)

Carbon Capture & Sequestration Computer Vision & Remote Sensing
NeurIPS 2022 SolarDK: A high-resolution urban solar panel image classification and localization dataset (Papers Track)
Abstract and authors: (click to expand)

Abstract: The body of research on classification of solar panel arrays from aerial imagery is increasing, yet there are still not many public benchmark datasets. This paper introduces two novel benchmark datasets for classifying and localizing solar panel arrays in Denmark: A human annotated dataset for classification and segmentation, as well as a classification dataset acquired using self-reported data from the Danish national building registry. We explore the performance of prior works on the new benchmark dataset, and present results after fine-tuning models using a similar approach as recent works. Furthermore, we train models of newer architectures and provide benchmark baselines to our datasets in several scenarios. We believe the release of these datasets may improve future research in both local and global geospatial domains for identifying and mapping of solar panel arrays from aerial imagery. The data is accessible at https://osf.io/aj539/.

Authors: Maxim MK Khomiakov (DTU); Julius Radzikowski (DTU); Carl Schmidt (DTU); Mathias Sørensen (DTU); Mads Andersen (DTU); Michael Andersen (Technical University of Denmark); Jes Frellsen (Technical University of Denmark)

Buildings Power & Energy Computer Vision & Remote Sensing
NeurIPS 2022 Bayesian inference for aerosol vertical profiles (Papers Track)
Abstract and authors: (click to expand)

Abstract: Aerosol-cloud interactions constitute the largest source of uncertainty in assessments of the anthropogenic climate change. This uncertainty arises in part from the difficulty in measuring the vertical distributions of aerosols. We often have to settle for less informative vertically aggregated proxies such as aerosol optical depth (AOD). In this work, we develop a framework to infer vertical aerosol profiles using AOD and readily available vertically resolved meteorological predictors such as temperature or relative humidity. We devise a simple Gaussian process prior over aerosol vertical profiles and update it with AOD observations. We validate our approach using ECHAM-HAM aerosol-climate model data. Our results show that, while simple, our model is able to reconstruct realistic extinction profiles with well-calibrated uncertainty. In particular, the model demonstrates a faithful reconstruction of extinction patterns arising from aerosol water uptake in the boundary layer.

Authors: Shahine Bouabid (University of Oxford); Duncan Watson-Parris (University of Oxford); Dino Sejdinovic (University of Adelaide)

Climate Science & Modeling Causal & Bayesian Methods
NeurIPS 2022 Optimizing toward efficiency for SAR image ship detection (Papers Track)
Abstract and authors: (click to expand)

Abstract: The detection and prevention of illegal fishing is critical to maintaining a healthy and functional ecosystem. Recent research on ship detection in satellite imagery has focused exclusively on performance improvements, disregarding detection efficiency. However, the speed and compute cost of vessel detection are essential for a timely intervention to prevent illegal fishing. Therefore, we investigated optimization methods that lower detection time and cost with minimal performance loss. We trained an object detection model based on a convolutional neural network (CNN) using a dataset of satellite images. Then, we designed two efficiency optimizations that can be applied to the base CNN or any other base model. The optimizations consist of a fast, cheap classification model and a statistical algorithm. The integration of the optimizations with the object detection model leads to a trade-off between speed and performance. We studied the trade-off using metrics that give different weight to execution time and performance. We show that by using a classification model the average precision of the detection model can be approximated to 99.5% in 44% of the time or to 92.7% in 25% of the time.

Authors: Arthur Van Meerbeeck (KULeuven); Ruben Cartuyvels (KULeuven); Jordy Van Landeghem (KULeuven); Sien Moens (KU Leuven)

Computer Vision & Remote Sensing Ecosystems & Biodiversity Oceans & Marine Systems
NeurIPS 2022 AutoML-based Almond Yield Prediction and Projection in California (Papers Track)
Abstract and authors: (click to expand)

Abstract: Almonds are one of the most lucrative products of California, but are also among the most sensitive to climate change. In order to better understand the relationship between climatic factors and almond yield, an automated machine learning framework is used to build a collection of machine learning models. The prediction skill is assessed using historical records. Future projections are derived using 17 downscaled climate outputs. The ensemble mean projection displays almond yield changes under two different climate scenarios, along with two technology development scenarios, where the role of technology development is highlighted. The mean projections and distributions provide insightful results to stakeholders and can be utilized by policymakers for climate adaptation.

Authors: Shiheng Duan (Lawrence Livermore National Laboratory); Shuaiqi Wu (University of California, Davis); Erwan Monier (University of California, Davis); Paul Ullrich (University of California, Davis)

Agriculture & Food
NeurIPS 2022 Attention-Based Scattering Network for Satellite Imagery (Papers Track)
Abstract and authors: (click to expand)

Abstract: Multi-channel satellite imagery, from stacked spectral bands or spatiotemporal data, have meaningful representations for various atmospheric properties. Combining these features in an effective manner to create a performant and trustworthy model is of utmost importance to forecasters. Neural networks show promise, yet suffer from unintuitive computations, fusion of high-level features, and may be limited by the quantity of available data. In this work, we leverage the scattering transform to extract high-level features without additional trainable parameters and introduce a separation scheme to bring attention to independent input channels. Experiments show promising results on estimating tropical cyclone intensity and predicting the occurrence of lightning from satellite imagery.

Authors: Jason Stock (Colorado State University); Charles Anderson (Colorado State University)

Climate Science & Modeling Earth Observation & Monitoring Interpretable ML
NeurIPS 2022 Discovering Interpretable Structural Model Errors in Climate Models (Papers Track)
Abstract and authors: (click to expand)

Abstract: Inaccuracies in the models of the Earth system, i.e., structural and parametric model errors, lead to inaccurate climate change projections. Errors in the model can originate from unresolved phenomena due to a low numerical resolution, as well as misrepresentations of physical phenomena or boundaries (e.g., orography). Therefore, such models lead to inaccurate short--term forecasts of weather and extreme events, and more importantly, long term climate projections. While calibration methods have been introduced to address for parametric uncertainties, e.g., by better estimation of system parameters from observations, addressing structural uncertainties, especially in an interpretable manner, remains a major challenge. Therefore, with increases in both the amount and frequency of observations of the Earth system, algorithmic innovations are required to identify interpretable representations of the model errors from observations. We introduce a flexible, general-purpose framework to discover interpretable model errors, and show its performance on a canonical prototype of geophysical turbulence, the two--level quasi--geostrophic system. Accordingly, a Bayesian sparsity--promoting regression framework is proposed, that uses a library of kernels for discovery of model errors. As calculating the library from noisy and sparse data (e.g., from observations) using convectional techniques leads to interpolation errors, here we use a coordinate-based multi--layer embedding to impute the sparse observations. We demonstrate the importance of alleviating spectral bias, and propose a random Fourier feature layer to reduce it in the proposed embeddings, and subsequently enable an accurate discovery. Our framework is demonstrated to successfully identify structural model errors due to linear and nonlinear processes (e.g., radiation, surface friction, advection), as well as misrepresented orography.

Authors: Rambod Mojgani (Rice University); Ashesh K Chattopadhyay (Rice University); Pedram Hassanzadeh (Rice University)

Climate Science & Modeling Earth Observation & Monitoring Interpretable ML
NeurIPS 2022 Aboveground carbon biomass estimate with Physics-informed deep network (Papers Track)
Abstract and authors: (click to expand)

Abstract: The global carbon cycle is a key process to understand how our climate is changing. However, monitoring the dynamics is difficult because a high-resolution robust measurement of key state parameters including the aboveground carbon biomass (AGB) is required. We use deep neural network to generate a wall-to-wall map of AGB within the Continental USA (CONUS) with 30-meter spatial resolution for the year 2021. We combine radar and optical hyperspectral imagery, with a physical climate parameter of solar-induced chlorophyll fluorescence (SIF)-based gross primary productivity (GPP). Validation results show that a masked variation of UNet has the lowest validation RMSE of 37.93 ± 1.36 Mg C/ha, as compared to 52.30 ± 0.03 Mg C/ha for random forest algorithm. Furthermore, models that learn from SIF-based GPP in addition to radar and optical imagery reduce validation RMSE by almost 10% and the standard deviation by 40%. Finally, we apply our model to measure losses in AGB from the recent 2021 Caldor wildfire in California, and validate our analysis with Sentinel-based burn index.

Authors: Juan Nathaniel (Columbia University); Levente Klein (IBM Research); Campbell D Watson (IBM Reserch); Gabrielle Nyirjesy (Columbia University); Conrad M Albrecht (IBM Research)

Computer Vision & Remote Sensing Forests Hybrid Physical Models
NeurIPS 2022 Improving the predictions of ML-corrected climate models with novelty detection (Papers Track)
Abstract and authors: (click to expand)

Abstract: While previous works have shown that machine learning (ML) can improve the prediction accuracy of coarse-grid climate models, these ML-augmented methods are more vulnerable to irregular inputs than the traditional physics-based models they rely on. Because ML-predicted corrections feed back into the climate model’s base physics, the ML-corrected model regularly produces out of sample data, which can cause model instability and frequent crashes. This work shows that adding semi-supervised novelty detection to identify out-of-sample data and disable the ML-correction accordingly stabilizes simulations and sharply improves the quality of predictions. We design an augmented climate model with a one-class support vector machine (OCSVM) novelty detector that provides better temperature and precipitation forecasts in a year-long simulation than either a baseline (no-ML) or a standard ML-corrected run. By improving the accuracy of coarse-grid climate models, this work helps make accurate climate models accessible to researchers without massive computational resources.

Authors: Clayton H Sanford (Columbia); Anna Kwa (Allen Institute for Artificial Intelligence); Oliver Watt-Meyer (Allen Institute for AI); Spencer Clark (Allen Institute for AI); Noah Brenowitz (Allen Institute for AI); Jeremy McGibbon (Allen Institute for AI); Christopher Bretherton (Allen Institute for AI)

Climate Science & Modeling Hybrid Physical Models Uncertainty Quantification & Robustness Unsupervised & Semi-Supervised Learning
NeurIPS 2022 Levee protected area detection for improved flood risk assessment in global hydrology models (Papers Track)
Abstract and authors: (click to expand)

Abstract: Precise flood risk assessment is needed to reduce human societies vulnerability as climate change increases hazard risk and exposure related to floods. Levees are built to protect people and goods from flood, which alters river hydrology, but are still not accounted for by global hydrological model. Detecting and integrating levee structures to global hydrological simulations is thus expected to enable more precise flood simulation and risk assessment, with important consequences for flood risk mitigation. In this work, we propose a new formulation to the problem of identifying levee structures: instead of detecting levees themselves, we focus on segmenting the region of the floodplain they protect. This formulation allows to better identify protected areas, to leverage the structure of hydrological data, and to simplify the integration of levee information to global hydrological models.

Authors: Masato Ikegawa (Kobe University); Tristan E.M Hascoet (Kobe University); Victor Pellet (Observatoire de Paris); Xudong Zhou (The University of Tokyo); Tetsuya Takiguchi (Kobe University); Dai Yamazaki (The University of Tokyo)

Disaster Management and Relief
NeurIPS 2022 Scene-to-Patch Earth Observation: Multiple Instance Learning for Land Cover Classification (Papers Track)
Abstract and authors: (click to expand)

Abstract: Land cover classification (LCC), and monitoring how land use changes over time, is an important process in climate change mitigation and adaptation. Existing approaches that use machine learning with Earth observation data for LCC rely on fully-annotated and segmented datasets. Creating these datasets requires a large amount of effort, and a lack of suitable datasets has become an obstacle in scaling the use of LCC. In this study, we propose Scene-to-Patch models: an alternative LCC approach utilising Multiple Instance Learning (MIL) that requires only high-level scene labels. This enables much faster development of new datasets whilst still providing segmentation through patch-level predictions, ultimately increasing the accessibility of using LCC for different scenarios. On the DeepGlobe-LCC dataset, our approach outperforms non-MIL baselines on both scene- and patch-level prediction. This work provides the foundation for expanding the use of LCC in climate change mitigation methods for technology, government, and academia.

Authors: Joseph Early (University of Southampton); Ying-Jung C Deweese (Georgia Insititute of Technology); Christine Evers (University of Southampton); Sarvapali Ramchurn (University of Southampton)

Land Use Earth Observation & Monitoring Agriculture & Food Cities & Urban Planning Forests Interpretable ML Unsupervised & Semi-Supervised Learning
NeurIPS 2022 Deep learning for downscaling tropical cyclone rainfall (Papers Track)
Abstract and authors: (click to expand)

Abstract: Flooding is often the leading cause of mortality and damages from tropical cyclones. With rainfall from tropical cyclones set to rise under global warming, better estimates of extreme rainfall are required to better support resilience efforts. While high resolution climate models capture tropical cyclone statistics well, they are computationally expensive leading to a trade-off between accuracy and generating enough ensemble members to generate sufficient high impact, low probability events. Often, downscaling models are used as a computationally cheaper alternative. Here, we develop and evaluate a set of deep learning models for downscaling tropical cyclone rainfall for more robust risk analysis.

Authors: Emily Vosper (University of Bristol); Lucy Harris (University of Oxford); Andrew McRae (University of Oxford); Laurence Aitchison (University of Bristol); Peter Watson (Bristol); Raul Santos Rodriguez (University of Bristol); Dann Mitchell (University of Bristol)

Extreme Weather Climate Science & Modeling Interpretable ML
NeurIPS 2022 Short-term Prediction and Filtering of Solar Power Using State-Space Gaussian Processes (Papers Track)
Abstract and authors: (click to expand)

Abstract: Short-term forecasting of solar photovoltaic energy (PV) production is important for powerplant management. Ideally these forecasts are equipped with error bars, so that downstream decisions can account for uncertainty. To produce predictions with error bars in this setting, we consider Gaussian processes (GPs) for modelling and predicting solar photovoltaic energy production in the UK. A standard application of GP regression on the PV timeseries data is infeasible due to the large data size and non-Gaussianity of PV readings. However, this is made possible by leveraging recent advances in scalable GP inference, in particular, by using the state-space form of GPs, combined with modern variational inference techniques. The resulting model is not only scalable to large datasets but can also handle continuous data streams via Kalman filtering.

Authors: So Takao (UCL); Sean Nassimiha (UCL); Peter Dudfield (Open Climate Fix); Jack Kelly (Open Climate Fix); Marc Deisenroth (University College London)

Time-series Analysis Climate Science & Modeling Power & Energy Causal & Bayesian Methods Uncertainty Quantification & Robustness
NeurIPS 2022 Identifying latent climate signals using sparse hierarchical Gaussian processes (Papers Track)
Abstract and authors: (click to expand)

Abstract: Extracting latent climate signals from multiple climate model simulations is important to estimate future climate change. To tackle this we develop a sparse hierarchical Gaussian process (SHGP), which probabilistically learns a latent distribution from a set of vectors. We use this to predict the latent surface temperature change globally and for central England from an ensemble of climate models, in a scalable manner and with robust uncertainty propagation.

Authors: Matt Amos (Lancaster University); Thomas Pinder (Lancaster University); Paul Young (Lancaster University)

Climate Science & Modeling Causal & Bayesian Methods Uncertainty Quantification & Robustness
NeurIPS 2022 Towards dynamical stability analysis of sustainable power grids using Graph Neural Networks (Papers Track)
Abstract and authors: (click to expand)

Abstract: To mitigate climate change, the share of renewable needs to be increased. Renewable energies introduce new challenges to power grids due to decentralization, reduced inertia and volatility in production. The operation of sustainable power grids with a high penetration of renewable energies requires new methods to analyze the dynamical stability. We provide new datasets of dynamical stability of synthetic power grids, and find that graph neural networks (GNNs) are surprisingly effective at predicting the highly non-linear target from topological information only. To illustrate the potential to scale to real-sized power grids, we demonstrate the successful prediction on a Texan power grid model.

Authors: Christian Nauck (PIK); Michael Lindner (PIK); Konstantin Schürholt (University of St. Gallen); Frank Hellmann (PIK)

Power & Energy
NeurIPS 2022 Detecting Methane Plumes using PRISMA: Deep Learning Model and Data Augmentation (Papers Track)
Abstract and authors: (click to expand)

Abstract: The new generation of hyperspectral imagers, such as PRISMA, has improved significantly our detection capability of methane (CH4) plumes from space at high spatial resolution (∼30m). We present here a complete framework to identify CH4 plumes using images from the PRISMA satellite mission and a deep learning technique able to automatically detect plumes over large areas. To compensate for the sparse database of PRISMA images, we trained our model by transposing high resolution plumes from Sentinel-2 to PRISMA. Our methodology avoids computationally expensive synthetic plume from Large Eddy Simulations while generating a broad and realistic training database, and paves the way for large-scale detection of methane plumes using future hyperspectral sensors (EnMAP, EMIT, CarbonMapper).

Authors: Alexis Groshenry (Kayrros); Clément Giron (Kayrros); Alexandre d'Aspremont (CNRS, DI, Ecole Normale Supérieure; Kayrros); Thomas Lauvaux (University of Reims Champagne Ardenne, GSMA, UMR 7331); Thibaud Ehret (Centre Borelli)

Computer Vision & Remote Sensing Earth Observation & Monitoring Climate Science & Modeling Meta- and Transfer Learning
NeurIPS 2022 Probabilistic forecasting of regional photovoltaic power production based on satellite-derived cloud motion (Papers Track)
Abstract and authors: (click to expand)

Abstract: Solar energy generation drastically increased in the last years, and it is expected to grow even more in the next decades. So, accurate intra-day forecasts are needed to improve the predictability of the photovoltaic power production and associated balancing measures to increase the shares of renewable energy in the power grid. Most forecasting methods require numerical weather predictions, which are slow to compute, or long-term datasets to run the forecast. These issues make the models difficult to implement in an operational setting. To overcome these problems, we propose a novel regional intraday probabilistic PV power forecasting model able to exploit only 2 hours of satellite-derived cloudiness maps to produce the ensemble forecast. The model is easy to implement in an operational setting as it is based on Pysteps, an already-operational Python library for precipitation nowcasting. With few adaptations of the Steps algorithm, we reached state-of-the-art performance, reaching a 71% lower RMSE than the Persistence model and a 50% lower CRPS than the Persistence Ensemble model for forecast lead times up to 4 hours.

Authors: Alberto Carpentieri (Bern University of Applied Science); Doris Folini (Institute for Atmospheric and Climate Science, ETH Zurich); Martin Wild (Institute for Atmospheric and Climate Science, ETH Zurich); Angela Meyer (Bern University of Applied Science)

Power & Energy Computer Vision & Remote Sensing Uncertainty Quantification & Robustness
NeurIPS 2022 Robustifying machine-learned algorithms for efficient grid operation (Papers Track)
Abstract and authors: (click to expand)

Abstract: We propose a learning-augmented algorithm, RobustML, for operation of dispatchable generation that exploits the good performance of a machine-learned algorithm while providing worst-case guarantees on cost. We evaluate the algorithm on a realistic two-generator system, where it exhibits robustness to distribution shift while enabling improved efficiency as renewable penetration increases.

Authors: Nicolas Christianson (California Institute of Technology); Christopher Yeh (California Institute of Technology); Tongxin Li (The Chinese University of Hong Kong (Shenzhen)); Mahdi Torabi Rad (Beyond Limits); Azarang Golmohammadi (Beyond Limits, Inc.); Adam Wierman (California Institute of Technology)

Uncertainty Quantification & Robustness Power & Energy Reinforcement Learning
NeurIPS 2022 Deep Hydrology: Hourly, Gap-Free Flood Maps Through Joint Satellite and Hydrologic Modelling (Papers Track)
Abstract and authors: (click to expand)

Abstract: Climate change-driven weather disasters are rapidly increasing in both frequency and magnitude. Floods are the most damaging of these disasters, with approximately 1.46 billion people exposed to inundation depths of over 0.15m, a significant life and livelihood risk. Accurate knowledge of flood-extent for ongoing and historical events facilitates climate adaptation in flood-prone communities by enabling near real-time disaster monitoring to support planning, response, and relief during these extreme events. Satellite observations can be used to derive flood-extent maps directly; however, these observations are impeded by cloud and canopy cover, and can be very infrequent and hence miss the flood completely. In contrast, physically-based inundation models can produce spatially complete event maps but suffer from high uncertainty if not frequently calibrated with expensive land and infrastructure surveys. In this study, we propose a deep learning approach to reproduce satellite-observed fractional flood-extent maps given dynamic state variables from hydrologic models, fusing information contained within the states with direct observations from satellites. Our model has an hourly temporal resolution, contains no cloud-gaps, and generalizes to watersheds across the continental United States with a 6% error on held-out areas that never flooded before. We further demonstrate through a case study in Houston, Texas that our model can distinguish tropical cyclones that caused flooding from those that did not within two days of landfall, thereby providing a reliable source for flood-extent maps that can be used by disaster monitoring services.

Authors: Tanya Nair (Cloud To Street); Veda Sunkara (Cloud to Street); Jonathan Frame (Cloud to Street); Philip Popien (Cloud to Street); Subit Chakrabarti (Cloud To Street)

Computer Vision & Remote Sensing Climate Justice Climate Science & Modeling Disaster Management and Relief Extreme Weather Hybrid Physical Models
NeurIPS 2022 Machine learning emulation of a local-scale UK climate model (Papers Track)
Abstract and authors: (click to expand)

Abstract: Climate change is causing the intensification of rainfall extremes. Precipitation projections with high spatial resolution are important for society to prepare for these changes, e.g. to model flooding impacts. Physics-based simulations for creating such projections are very computationally expensive. This work demonstrates the effectiveness of diffusion models, a form of deep generative models, for generating much more cheaply realistic high resolution rainfall samples for the UK conditioned on data from a low resolution simulation. We show for the first time a machine learning model that is able to produce realistic high-resolution rainfall predictions based on a physical model that resolves atmospheric convection, a key process behind extreme rainfall. By adding self-learnt, location-specific information to low resolution relative vorticity, quantiles and time-mean of the samples match well their counterparts from the high-resolution simulation.

Authors: Henry Addison (University of Bristol); Elizabeth Kendon (Met Office Hadley Centre); Suman Ravuri (DeepMind); Laurence Aitchison (University of Bristol); Peter Watson (Bristol)

Climate Science & Modeling Extreme Weather Generative Modeling
NeurIPS 2022 Bridging the Microwave Data Gap; Using Bayesian Deep Learning to “See” the Unseen (Papers Track)
Abstract and authors: (click to expand)

Abstract: Having microwave data with the spatial and temporal resolution of infrared data would provide a large positive impact on many climate and weather applications. We demonstrate that Bayesian deep learning is a promising technique for both creating and improving synthetic microwave data from infrared data. We report 0.7% mean absolute percentage error for 183+/-3 GHz microwave brightness temperature and uncertainty metrics and find that more training data is needed to achieve improved performance at 166 GHz, 37 GHz, and 23 GHz. Analysis of the spatial distribution of uncertainty reveals that additional cloud data will provide the greatest increase in skill, which will potentially allow for generation of many secondary products derived from microwave data in the future.

Authors: Pedro Ortiz (Naval Postgraduate School); Eleanor Casas (Naval Postgraduate School); Marko Orescanin (Naval Postgraduate School); Scott Powell (Naval Postgraduate School)

Earth Observation & Monitoring Climate Science & Modeling Extreme Weather Causal & Bayesian Methods Uncertainty Quantification & Robustness
NeurIPS 2022 Learning evapotranspiration dataset corrections from water cycle closure supervision (Papers Track)
Abstract and authors: (click to expand)

Abstract: Evapotranspiration (ET) is one of the most uncertain components of the global water cycle. Improving global ET estimates is needed to better our understanding of the global water cycle so as to forecast the consequences of climate change on the future of global water resource distribution. This work presents a methodology to derive monthly corrections of global ET datasets at 0.25 degree resolution. We use ML to generalize sparse catchment-level water cycle closure residual information to global and dense pixel-level residuals. Our model takes a probabilistic view on ET datasets and their correction that we use to regress catchment-level residuals using a sum-aggregated supervision. Using four global ET datasets, we show that our learned model has learned ET corrections that accurately generalize its water cycle-closure results to unseen catchments.

Authors: Tristan E.M Hascoet (Kobe University); Victor Pellet (LERMA); Filipe Aires (LERMA)

Earth Observation & Monitoring Climate Science & Modeling Causal & Bayesian Methods
NeurIPS 2022 Convolutional Neural Processes for Inpainting Satellite Images: Application to Water Body Segmentation (Papers Track)
Abstract and authors: (click to expand)

Abstract: The widespread availability of satellite images has allowed researchers to monitor the impact of climate on socio-economic and environmental issues through examples like crop and water body classification to measure food scarcity and risk of flooding. However, a common issue of satellite images is missing values due to measurement defects, which render them unusable by existing methods without data imputation. To repair the data, inpainting methods can be employed, which are based on classical PDEs or interpolation methods. Recently, deep learning approaches have shown promise in this realm, however many of these methods do not explicitly take into account the inherent spatio-temporal structure of satellite images. In this work, we cast satellite image inpainting as a meta-learning problem, and implement Convolutional Neural Processes (ConvNPs) in which we frame each satellite image as its own task or 2D regression problem. We show that ConvNPs outperform classical methods and state-of-the-art deep learning inpainting models on a scanline problem for LANDSAT 7 satellite images, assessed on a variety of in- and out-of-distribution images. Our results successfully match the performance of clean images on a downstream water body segmentation task in Canada.

Authors: Alexander Pondaven (Imperial College London); Mart Bakler (Imperial College London); Donghu Guo (Imperial College London); Hamzah Hashim (Imperial College London); Martin G Ignatov (Imperial college London); Samir Bhatt (Imperial College London); Seth Flaxman (Oxford); Swapnil Mishra (Imperial College London); Elie Alhajjar (USMA); Harrison Zhu (Imperial College London)

Computer Vision & Remote Sensing Earth Observation & Monitoring Generative Modeling Meta- and Transfer Learning
NeurIPS 2022 A POMDP Model for Safe Geological Carbon Sequestration (Papers Track)
Abstract and authors: (click to expand)

Abstract: Geological carbon capture and sequestration (CCS), where CO2 is stored in subsurface formations, is a promising and scalable approach for reducing global emissions.However, if done incorrectly, it may lead to earthquakes and leakage of CO2 back to the surface, harming both humans and the environment. These risks are exacerbated by the large amount of uncertainty in the structure of the storage formation. For these reasons, we propose that CCS operations be modeled as a partially observable Markov decision process (POMDP) and decisions be informed using automated planning algorithms. To this end, we develop a simplified model of CCS operations based on a 2D spillpoint analysis that retains many of the challenges and safety considerations of the real-world problem. We show how off-the-shelf POMDP solvers outperform expert baselines for safe CCS planning. This POMDP model can be used as a test bed to drive the development of novel decision-making algorithms for CCS operations.

Authors: Anthony Corso (Stanford University); Yizheng Wang (Stanford Univerity); Markus Zechner (Stanford University); Jef Caers (Stanford University); Mykel J Kochenderfer (Stanford University)

Carbon Capture & Sequestration Reinforcement Learning
NeurIPS 2022 Optimizing Japanese dam reservoir inflow forecast for efficient operation (Papers Track)
Abstract and authors: (click to expand)

Abstract: Despite a climate and topology favorable to hydropower (HP) generation, HP only accounts for 4% of today’s Japanese primary energy consumption mix. In recent years, calls for improving the efficiency of Japanese HP towards achieving a more sustainable energy mix have emerged from prominent voices in the Ministry of Land, Infrastructure, Transport and Tourism (MILT). Among potential optimizations, data-driven dam operation policies using accurate river discharge forecasts have been advocated for. In the meantime, Machine Learning (ML) has recently made important strides in hydrological modeling, with forecast accuracy improvements demonstrated on both precipitation nowcasting and river discharge prediction. We are motivated by the convergence of these societal and technological contexts: our final goal is to provide scientific evidence and actionable insights for dam infrastructure managers and policy makers to implement more energy-efficient and flood-resistant dam operation policies on a national scale. Towards this goal this work presents a preliminary study of ML-based dam inflow forecasts on a dataset of 127 Japanese public dams we assembled. We discuss our preliminary results and lay out a path for future studies.

Authors: Keisuke Yoshimi (Kobe University); Tristan E.M Hascoet (Kobe University); Rousslan F. Julien Dossa (Kobe University); Ryoichi Takashima (Kobe University); Tetsuya Takiguchi (Kobe University); Satoru Oishi (Kobe University)

Climate Science & Modeling Time-series Analysis
NeurIPS 2022 Deep Climate Change: A Dataset and Adaptive domain pre-trained Language Models for Climate Change Related Tasks (Papers Track)
Abstract and authors: (click to expand)

Abstract: The quantity and quality of literature around climate change (CC) and its impacts are increasing yearly. Yet, this field has received limited attention in the Natural Language Processing (NLP) community. With the help of large Language Models (LMs) and transfer learning, NLP can support policymakers, researchers, and climate activists in making sense of large-scale and complex CC-related texts. CC-related texts include specific language that general language models cannot represent accurately. Therefore we collected a climate change corpus consisting of over 360 thousand abstracts of top climate scientists' articles from trustable sources covering large temporal and spatial scales. Comparison of the performance of GPT2 LM and our 'climateGPT2 models', fine-tuned on the CC-related corpus, on claim generation (text generation) and fact-checking, downstream tasks show the better performance of the climateGPT2 models compared to the GPT2. The climateGPT2 models decrease the validation loss to 1.08 for claim generation from 43.4 obtained by GPT2. We found that climateGPT2 models improved the masked language model objective for the fact-checking task by increasing the F1 score from 0.67 to 0.72.

Authors: Saeid Vaghefi (University of Zürich); Veruska Muccione (University of Zürich); Christian Huggel (University of Zürich); Hamed Khashehchi (2w2e GmbH); Markus Leippold (University of Zurich)

Natural Language Processing Meta- and Transfer Learning
NeurIPS 2022 Data-Driven Optimal Solver for Coordinating a Sustainable and Stable Power Grid (Papers Track)
Abstract and authors: (click to expand)

Abstract: With today's pressing climate change concerns, the widespread integration of low-carbon technologies such as sustainable generation systems (e.g. photovoltaics, wind turbines, etc.) and flexible consumer devices (e.g. storage, electric vehicles, smart appliances, etc.) into the electric grid is vital. Although these power entities can be deployed at large, these are highly variable in nature and must interact with the existing grid infrastructure without violating electrical limits so that the system continues to operate in a stable manner at all times. In order to ensure the integrity of grid operations while also being economical, system operators will need to rapidly solve the optimal power flow (OPF) problem in order to adapt to these fluctuations. Inherent non-convexities in the OPF problem do not allow traditional model-based optimization techniques to offer guarantees on optimality, feasibility and convergence. In this paper, we propose a data-driven OPF solver built on information-theoretic and semi-supervised machine learning constructs. We show that this solver is able to rapidly compute solutions (i.e. in sub-second range) that are within 3\% of optimality with guarantees on feasibility on a benchmark IEEE 118-bus system.

Authors: Junfei Wang (York University); Pirathayini Srikantha (York University)

Power & Energy Climate Finance & Economics Unsupervised & Semi-Supervised Learning
NeurIPS 2022 Don't Waste Data: Transfer Learning to Leverage All Data for Machine-Learnt Climate Model Emulation (Papers Track)
Abstract and authors: (click to expand)

Abstract: How can we learn from all available data when training machine-learnt climate models, without incurring any extra cost at simulation time? Typically, the training data comprises coarse-grained high-resolution data. But only keeping this coarse-grained data means the rest of the high-resolution data is thrown out. We use a transfer learning approach, which can be applied to a range of machine learning models, to leverage all the high-resolution data. We use three chaotic systems to show it stabilises training, gives improved generalisation performance and results in better forecasting skill. Our code is at https://github.com/raghul-parthipan/dont_waste_data

Authors: Raghul Parthipan (University of Cambridge); Damon Wischik (Univeristy of Cambridge)

Climate Science & Modeling Generative Modeling Meta- and Transfer Learning Time-series Analysis
NeurIPS 2022 Explainable Multi-Agent Recommendation System for Energy-Efficient Decision Support in Smart Homes (Papers Track)
Abstract and authors: (click to expand)

Abstract: Transparent, understandable, and persuasive recommendations support the electricity consumers’ behavioral change to tackle the energy efficiency problem. This paper proposes an explainable multi-agent recommendation system for load shifting for household appliances. First, we extend a novel multi-agent approach by designing an Explainability Agent that provides explainable recommendations for optimal appliance scheduling in a textual and visual manner. Second, we enhance the predictive capacity of other agents by including weather data and applying state-of-the-art models (i.e., k-nearest-neighbours, extreme gradient boosting, adaptive boosting, random forest, logistic regression, and explainable boosting machines). Since we want to help the user understand a single recommendation, we focus on local explainability approaches. In particular, we apply post-model approaches LIME (local, interpretable, model-agnostic explanation) and SHAP (Shapley additive explanations) as model-agnostic tools that can explain the predictions of the chosen classifiers. We further provide an overview of the predictive and explainability performance. Our results show a substantial improvement in the performance of the multi-agent system while at the same time opening up the “black box” of recommendations. To show the pathway to positive impact regarding climate change, we provide a discussion on the potential impact of the suggested approach.

Authors: Alona Zharova (Humboldt University of Berlin); Annika Boer (Humboldt University of Berlin); Julia Knoblauch (Humboldt University of Berlin); Kai Ingo Schewina (Humboldt University of Berlin); Jana Vihs (Humboldt University of Berlin)

Recommender Systems Buildings Power & Energy Interpretable ML
NeurIPS 2022 FIRO: A Deep-neural Network for Wildfire Forecast with Interpretable Hidden States (Papers Track)
Abstract and authors: (click to expand)

Abstract: Several wildfire danger systems have emerged from decades of research. One such system is the National Fire-Danger Rating System (NFDRS), which is used widely across the United States and is a key predictor in the Global ECMWF Fire Forecasting (GEFF) model. The NFDRS is composed of over 100 equations relating wildfire risk to weather conditions, climate and land cover characteristics, and fuel. These equations and the corresponding 130+ parameters were developed via field and lab experiments. These parameters, which are fixed in the standard NFDRS and GEFF implementations, may not be the most appropriate for a climate-changing world. In order to adjust the NFDRS parameters to current climate conditions and specific geographical locations, we recast NFDRS in PyTorch to create a new deep learning-based Fire Index Risk Optimizer (FIRO). FIRO predicts the ignition component, or the probability a wildfire would require suppression in the presence of a firebrand, and calibrates the uncertain parameters for a specific region and climate conditions by training on observed fires. Given the rare occurrence of wildfires, we employed the extremal dependency index (EDI) as the loss function. Using ERA5 reanalysis and MODIS burned area data, we trained FIRO models for California, Texas, Italy, and Madagascar. Across these four geographies, the average EDI improvement was 175% above the standard NFDRS implementation

Authors: Eduardo R Rodrigues (MSR); Campbell D Watson (IBM Reserch); Bianca Zadrozny (IBM Research); Gabrielle Nyirjesy (Columbia University)

Disaster Management and Relief Forests Interpretable ML
NeurIPS 2022 Towards a spatially transferable super resolution model for downscaling Antarctic surface melt (Papers Track)
Abstract and authors: (click to expand)

Abstract: Surface melt on the Antarctic Ice Sheet is an important climate indicator, yet the spatial scale of modeling and observing surface melt is insufficient to capture crucial details and understand local processes. High-resolution climate models could provide a solution, but they are computationally expensive and require finetuning for some model parameters. An alternative method, pioneering in geophysics, is single-image super resolution (SR) applied on lower-resolution model output. However, often input and output of such SR models are available on the same, fixed spatial domain. High-resolution model simulations over Antarctica are available only in some regions. To be able to apply an SR model elsewhere, we propose to make the single-image SR model physics-aware, using surface albedo and elevation as additional input. Our results show a great improvement in the spatial transferability of the conventional SR model. Although issues with the input satellite-derived albedo remain, adding physics awareness paves a way toward a spatially transferable SR model for downscaling Antarctic surface melt.

Authors: Zhongyang Hu (IMAU); Yao Sun (TUM); Peter Kuipers Munneke (IMAU); Stef Lhermitte (TU Delft); Xiaoxiang Zhu (Technical University of Munich,Germany)

Climate Science & Modeling Computer Vision & Remote Sensing
NeurIPS 2022 Forecasting European Ozone Air Pollution With Transformers (Papers Track)
Abstract and authors: (click to expand)

Abstract: Surface ozone is an air pollutant that contributes to hundreds of thousands of premature deaths annually. Accurate short-term ozone forecasts may allow improved policy to reduce the risk to health, such as air quality warnings. However, forecasting ozone is a difficult problem, as surface ozone concentrations are controlled by a number of physical and chemical processes which act on varying timescales. Accounting for these temporal dependencies appropriately is likely to provide more accurate ozone forecasts. We therefore deploy a state-of-the-art transformer-based model, the Temporal Fusion Transformer, trained on observational station data from three European countries. In four-day test forecasts of daily maximum 8-hour ozone, the novel approach is highly skilful (MAE = 4.6 ppb, R2 = 0.82), and generalises well to two European countries unseen during training (MAE = 4.9 ppb, R2 = 0.79). The model outperforms standard machine learning models on our data, and compares favourably to the published performance of other deep learning architectures tested on different data. We illustrate that the model pays attention to physical variables known to control ozone concentrations, and that the attention mechanism allows the model to use relevant days of past ozone concentrations to make accurate forecasts.

Authors: Seb Hickman (University of Cambridge); Paul Griffiths (University of Cambridge); Alex Archibald (University of Cambridge); Peer Nowack (Imperial College London); Elie Alhajjar (USMA)

Climate Science & Modeling Extreme Weather Health Public Policy Societal Adaptation & Resilience Interpretable ML Time-series Analysis
NeurIPS 2022 Stability Constrained Reinforcement Learning for Real-Time Voltage Control (Papers Track)
Abstract and authors: (click to expand)

Abstract: This paper is a summary of a recently submitted work. Deep Reinforcement Learning (DRL) has been recognized as a promising tool to address the challenges in real-time control of power systems. However, its deployment in real-world power systems has been hindered by a lack of explicit stability and safety guarantees. In this paper, we propose a stability constrained reinforcement learning method for real-time voltage control in both single-phase and three-phase distribution grids. The key idea underlying our approach is an explicitly constructed Lyapunov function that certifies stability. We demonstrate the effectiveness of our approach with IEEE test feeders, where the proposed method achieves the best overall performance, while always achieving voltage stability. In contrast, standard RL methods often fail to achieve voltage stability.

Authors: Jie Feng (UCSD); Yuanyuan Shi (University of California San Diego); Guannan Qu (Carnegie Mellon University); Steven Low (California Institute of Technology); Animashree Anandkumar (Caltech); Adam Wierman (California Institute of Technology)

Power & Energy Reinforcement Learning
NeurIPS 2022 Land Use Prediction using Electro-Optical to SAR Few-Shot Transfer Learning (Papers Track)
Abstract and authors: (click to expand)

Abstract: Satellite image analysis has important implications for land use, urbanization, and ecosystem monitoring. Deep learning methods can facilitate the analysis of different satellite modalities, such as electro-optical (EO) and synthetic aperture radar (SAR) imagery, by supporting knowledge transfer between the modalities to compensate for individual shortcomings. Recent progress has shown how distributional alignment of neural network embeddings can produce powerful transfer learning models by employing a sliced Wasserstein distance (SWD) loss. We analyze how this method can be applied to Sentinel-1 and -2 satellite imagery and develop several extensions toward making it effective in practice. In an application to few-shot Local Climate Zone (LCZ) prediction, we show that these networks outperform multiple common baselines on datasets with a large number of classes. Further, we provide evidence that instance normalization can significantly stabilize the training process and that explicitly shaping the embedding space using supervised contrastive learning can lead to improved performance.

Authors: Marcel Hussing (University of Pennsylvania); Karen Li (University of Pennsylvania); Eric Eaton (University of Pennsylvania)

Land Use Meta- and Transfer Learning Cities & Urban Planning Earth Observation & Monitoring
NeurIPS 2022 Exploring Randomly Wired Neural Networks for Climate Model Emulation (Papers Track)
Abstract and authors: (click to expand)

Abstract: Exploring the climate impacts of various anthropogenic emissions scenarios is key to making informed decisions for climate change mitigation and adaptation. State-of-the-art Earth system models can provide detailed insight into these impacts, but have a large associated computational cost on a per-scenario basis. This large computational burden has driven recent interest in developing cheap machine learning models for the task of climate model emulation. In this manuscript, we explore the efficacy of randomly wired neural networks for this task. We describe how they can be constructed and compare them to their standard feedforward counterparts using the ClimateBench dataset. Specifically, we replace the dense layers in multilayer perceptrons, convolutional neural networks, and convolutional long short-term memory networks with randomly wired ones and assess the impact on model performance for models with 1 million and 10 million parameters. We find average performance improvements of 4.2% across model complexities and prediction tasks, with substantial performance improvements of up to 16.4% in some cases. Furthermore, we find no significant difference in prediction speed between networks with standard feedforward dense layers and those with randomly wired layers. These findings indicate that randomly wired neural networks may be suitable direct replacements for traditional dense layers in many standard models.

Authors: William J Yik (Harvey Mudd College); Sam J Silva (The University of Southern California); Andrew Geiss (Pacific Northwest National Laboratory); Duncan Watson-Parris (University of Oxford)

Climate Science & Modeling Earth Observation & Monitoring Extreme Weather Computer Vision & Remote Sensing Time-series Analysis
NeurIPS 2022 Closing the Domain Gap -- Blended Synthetic Imagery for Climate Object Detection (Papers Track)
Abstract and authors: (click to expand)

Abstract: Object detection models have great potential to increase both the frequency and cost-efficiency of assessing climate-relevant infrastructure in satellite imagery. However, model performance can suffer when models are applied to stylistically different geographies. We propose a technique to generate synthetic imagery using minimal labeled examples of the target object at a low computational cost. Our technique blends example objects onto unlabeled images of the target domain. We show that including these synthetic images improves the average precision of a YOLOv3 object detection model when compared to a baseline and other popular domain adaptation techniques.

Authors: Caleb Kornfein (Duke University); Frank Willard (Duke University); Caroline Tang (Duke University); Yuxi Long (Duke University); Saksham Jain (Duke University); Jordan Malof (Duke University); Simiao Ren (Duke University); Kyle Bradbury (Duke University)

Computer Vision & Remote Sensing Meta- and Transfer Learning
NeurIPS 2022 SustainGym: A Benchmark Suite of Reinforcement Learning for Sustainability Applications (Papers Track)
Abstract and authors: (click to expand)

Abstract: The lack of standardized benchmarks for reinforcement learning (RL) in sustainability applications has made it difficult to both track progress on specific domains and identify bottlenecks for researchers to focus their efforts on. In this paper, we present SustainGym, a suite of two environments designed to test the performance of RL algorithms on realistic sustainability tasks. The first environment simulates the problem of scheduling decisions for a fleet of electric vehicle (EV) charging stations, and the second environment simulates decisions for a battery storage system bidding in an electricity market. We describe the structure and features of the environments and show that standard RL algorithms have significant room for improving performance. We discuss current challenges in introducing RL to real-world sustainability tasks, including physical constraints and distribution shift.

Authors: Christopher Yeh (California Institute of Technology); Victor Li (California Institute of Technology); Rajeev Datta (California Institute of Technology); Yisong Yue (Caltech); Adam Wierman (California Institute of Technology)

Reinforcement Learning Power & Energy
NeurIPS 2022 Remote estimation of geologic composition using interferometric synthetic-aperture radar in California’s Central Valley (Papers Track)
Abstract and authors: (click to expand)

Abstract: California's Central Valley is the national agricultural center, producing 1/4 of the nation’s food. However, land in the Central Valley is sinking at a rapid rate (as much as 20 cm per year) due to continued groundwater pumping. Land subsidence has a significant impact on infrastructure resilience and groundwater sustainability. In this study, we aim to identify specific regions with different temporal dynamics of land displacement and find relationships with underlying geological composition. Then, we aim to remotely estimate geologic composition using interferometric synthetic aperture radar (InSAR)-based land deformation temporal changes using machine learning techniques. We identified regions with different temporal characteristics of land displacement in that some areas (e.g., Helm) with coarser grain geologic compositions exhibited potentially reversible land deformation (elastic land compaction). We found a significant correlation between InSAR-based land deformation and geologic composition using random forest and deep neural network regression models. We also achieved significant accuracy with 1/4 sparse sampling to reduce any spatial correlations among data, suggesting that the model has the potential to be generalized to other regions for indirect estimation of geologic composition. Our results indicate that geologic composition can be estimated using InSAR-based land deformation data. In-situ measurements of geologic composition can be expensive and time consuming and may be impractical in some areas. The generalizability of the model sheds light on high spatial resolution geologic composition estimation utilizing existing measurements.

Authors: Kyongsik Yun (California Institute of Technology); Kyra Adams (California Institute of Technology); John Reager (California Institute of Technology); Zhen Liu (California Institute of Technology); Caitlyn Chavez (California Institute of Technology); Michael Turmon (California Institute of Technology); Thomas Lu (California Institute of Technology)

Earth Observation & Monitoring Interpretable ML
NeurIPS 2022 AutoML for Climate Change: A Call to Action (Papers Track)
Abstract and authors: (click to expand)

Abstract: The challenge that climate change poses to humanity has spurred a rapidly developing field of artificial intelligence research focused on climate change applications. The climate change ML (CCML) community works on a diverse, challenging set of problems which often involve physics-constrained ML or heterogeneous spatiotemporal data. It would be desirable to use automated machine learning (AutoML) techniques to automatically find high-performing architectures and hyperparameters for a given dataset. In this work, we benchmark popular Auto ML libraries on three high-leverage CCML applications: climate modeling, wind power forecasting, and catalyst discovery. We find that out-of-the-box AutoML libraries currently fail to meaningfully surpass the performance of human-designed CCML models. However, we also identify a few key weaknesses, which stem from the fact that most AutoML techniques are tailored to computer vision and NLP applications. For example, while dozens of search spaces have been designed for image and language data, none have been designed for spatiotemporal data. Addressing these key weaknesses can lead to the discovery of novel architectures that yield substantial performance gains across numerous CCML applications. Therefore, we present a call to action to the AutoML community, since there are a number of concrete, promising directions for future work in the space of AutoML for CCML. We release our code and a list of resources at https://github.com/climate-change-automl/climate-change-automl.

Authors: Renbo Tu (University of Toronto); Nicholas Roberts (University of Wisconsin-Madison); Vishak Prasad C (Indian Institute Of Technology, Bombay); Sibasis Nayak (Indian Institute of Technology, Bombay); Paarth Jain (Indian Institute of Technology Bombay); Frederic Sala (University of Wisconsin-Madison); Ganesh Ramakrishnan (IIT Bombay); Ameet Talwalkar (CMU); Willie Neiswanger (Stanford University); Colin White (Abacus.AI)

Climate Science & Modeling Chemistry & Materials Power & Energy
NeurIPS 2022 Temperature impacts on hate speech online: evidence from four billion tweets (Papers Track)
Abstract and authors: (click to expand)

Abstract: Human aggression is no longer limited to the physical space but exists in the form of hate speech on social media. Here, we examine the effect of temperature on the occurrence of hate speech on Twitter and interpret the results in the context of climate change, human behavior and mental health. Employing supervised machine learning models, we identify hate speech in a data set of four billion geolocated tweets from over 750 US cities (2014 – 2020). We statistically evaluate the changes in daily hate tweets against changes in local temperature, isolating the temperature influence from confounding factors using binned panel-regression models. We find a low prevalence of hate tweets in moderate temperatures and observe sharp increases of up to 12% for colder and up to 22% for hotter temperatures, indicating that not only hot but also cold temperatures increase aggressive tendencies. Further, we observe that for extreme temperatures hate speech also increases as a percentage of total tweeting activity, crowding out non-hate speech. The quasi-quadratic shape of the temperature-hate tweet curve is robust across varying climate zones, income groups, religious and political beliefs. The prevalence of the results across climatic and socioeconomic splits points to limits in adaptation. Our results illuminate hate speech online as an impact channel through which temperature alters societal aggression.

Authors: Annika Stechemesser (Potsdam Insitute for Climate Impact Research); Anders Levermann (Potsdam Institute for Climate Impact Research); Leonie Wenz (Potsdam Institute for Climate Impact Research)

Health Societal Adaptation & Resilience Data Mining Natural Language Processing
NeurIPS 2022 Cross Modal Distillation for Flood Extent Mapping (Papers Track)
Abstract and authors: (click to expand)

Abstract: The increasing intensity and frequency of floods is one of the many consequences of our changing climate. In this work, we explore ML techniques that improve the flood detection module of an operational early flood warning system. Our method exploits an unlabelled dataset of paired multi-spectral and Synthetic Aperture Radar (SAR) imagery to reduce the labeling requirements of a purely supervised learning method. Past attempts have used such unlabelled data by creating weak labels out of them, but end up learning the label mistakes in those weak labels. Motivated by knowledge distillation and semi supervised learning, we explore the use of a teacher to train a student with the help of a small hand labeled dataset and a large unlabelled dataset. Unlike the conventional self distillation setup, we propose a cross modal distillation framework that transfers supervision from a teacher trained on richer modality (multi-spectral images) to a student model trained on SAR imagery. The trained models are then tested on the Sen1Floods11 dataset. Our model outperforms the Sen1Floods11 SAR baselines by an absolute margin of 4.15% pixel wise Intersection-over-Union (IoU) on the test split.

Authors: Shubhika Garg (Google); Ben Feinstein (Google); Shahar Timnat (Google); Vishal V Batchu (Google); gideon dror (The Academic College of Tel-Aviv-Yaffo); Adi Gerzi Rosenthal (Google); Varun Gulshan (Google Research)

Computer Vision & Remote Sensing Disaster Management and Relief Earth Observation & Monitoring Unsupervised & Semi-Supervised Learning
NeurIPS 2022 Transformer Neural Networks for Building Load Forecasting (Papers Track)
Abstract and authors: (click to expand)

Abstract: Accurate electrical load forecasts of buildings are needed to optimize local energy storage and to make use of demand-side flexibility. We study the usage of Transformer neural networks for short-term electrical load forecasting of 296 buildings from a public dataset. Transformer neural networks trained on many buildings give the best forecasts on 115 buildings, and multi-layer perceptrons trained on a single building are better on 161 buildings. In addition, we evaluate the models on buildings that were not used for training, and find that Transformer neural networks generalize better than multi-layer perceptrons and our statistical baselines. This shows that the usage of Transformer neural networks for building load forecasting could reduce training resources due to the good generalization to unseen buildings, and they could be useful for cold-start scenarios.

Authors: Matthias Hertel (KIT); Simon Ott (KIT); Oliver Neumann (KIT); Benjamin Schäfer (KIT); Ralf Mikut (Karlsruhe Institute of Technology); Veit Hagenmeyer (Karlsruhe Institute of Technology (KIT))

Time-series Analysis Buildings
NeurIPS 2022 Estimating Chicago’s tree cover and canopy height using multi-spectral satellite imagery (Papers Track)
Abstract and authors: (click to expand)

Abstract: Information on urban tree canopies is fundamental to mitigating climate change as well as improving quality of life. Urban tree planting initiatives face a lack of up-to-date data about the horizontal and vertical dimensions of the tree canopy in cities. We present a pipeline that utilizes LiDAR data as ground-truth and then trains a multi-task machine learning model to generate reliable estimates of tree cover and canopy height in urban areas using multi-source multi-spectral satellite imagery for the case study of Chicago.

Authors: John Francis (University College London)

Cities & Urban Planning Ecosystems & Biodiversity Public Policy Computer Vision & Remote Sensing
NeurIPS 2022 Reconstruction of Grid Measurements in the Presence of Adversarial Attacks (Papers Track)
Abstract and authors: (click to expand)

Abstract: In efforts to mitigate the adverse effects of climate change, policymakers have set ambitious goals to reduce the carbon footprint of all sectors - including the electric grid. To facilitate this, sustainable energy systems like renewable generation must { be} deployed at high numbers throughout the grid. As these are highly variable in nature, the grid must be closely monitored so that system operators will have elevated situational awareness and can execute timely actions to maintain stable grid operations. With the widespread deployment of sensors like phasor measurement units (PMUs), an abundance of data is available for conducting accurate state estimation. However, due to the cyber-physical nature of the power grid, measurement data can be perturbed in an adversarial manner to enforce incorrect decision-making. In this paper, we propose a novel reconstruction method that leverages on machine learning constructs like CGAN and gradient search to recover the original states when subjected to adversarial perturbations. Experimental studies conducted on the practical IEEE 118-bus benchmark power system show that the proposed method can reduce errors due to perturbation by large margins (i.e. up to 100%).

Authors: Amirmohammad Naeini (York University); Samer El Kababji (Western University); Pirathayini Srikantha (York University)

Generative Modeling Unsupervised & Semi-Supervised Learning
NeurIPS 2022 Heat Demand Forecasting with Multi-Resolutional Representation of Heterogeneous Temporal Ensemble (Papers Track)
Abstract and authors: (click to expand)

Abstract: One of the primal challenges faced by utility companies is ensuring efficient supply with minimal greenhouse gas emissions. The advent of smart meters and smart grids provide an unprecedented advantage in realizing an optimised supply of thermal energies through proactive techniques such as load forecasting. In this paper, we propose a forecasting framework for heat demand based on neural networks where the time series are encoded as scalograms equipped with the capacity of embedding exogenous variables such as weather, and holiday/non-holiday. Subsequently, CNNs are utilized to predict the heat load multi-step ahead. Finally, the proposed framework is compared with other state-of-the-art methods, such as SARIMAX and LSTM. The quantitative results from retrospective experiments show that the proposed framework consistently outperforms the state-of-the-art baseline method with real-world data acquired from Denmark. A minimal mean error of 7.54% for MAPE and 417kW for RMSE is achieved with the proposed framework in comparison to all other methods.

Authors: Satyaki Chatterjee (Pattern Recognition Lab, FAU Erlangen-Nuremberg); Adithya Ramachandran (Pattern Recognition Lab, Friedrich Alexander University, Erlangen); Thorkil Flensmark Neergaard (Brønderslev Forsyning A/S); Andreas K Maier (Pattern Recognition Lab, FAU Erlangen-Nuremberg); Siming Bayer (Pattern Recognition Lab, Friedrich-Alexander University)

Time-series Analysis Cities & Urban Planning Power & Energy Interpretable ML
NeurIPS 2022 Generalized Ice Detection on Wind Turbine Rotor Blades with Neural Style Transfer (Papers Track)
Abstract and authors: (click to expand)

Abstract: Wind energy’s ability to liberate the world of conventional sources of energy relies on lowering the significant costs associated with the maintenance of wind turbines. Since icing events on turbine rotor blades are a leading cause of operational failures, identifying icing in advance is critical. Some recent studies focus on specific wind parks and fail to generalize to unseen scenarios (e.g. new rotor blade designs). We propose the utilisation of synthetic data augmentation via neural style transfer to improve the generalization of existing ice prediction models. We show that training models with augmented data that captures domain-invariant icing characteristics can help improve predictive performance across multiple wind parks. Through efficient identification of icing, this study can support preventive maintenance of wind energy sources by making them more reliable towards tackling climate change.

Authors: Joyjit Chatterjee (University of Hull); Maria Teresa Alvela Nieto (University of Bremen); Hannes Gelbhardt (University of Bremen); Nina Dethlefs (University of Hull); Jan Ohlendorf (University of Bremen); Klaus-Dieter Thoben (University of Bremen)

Power & Energy Computer Vision & Remote Sensing Meta- and Transfer Learning
NeurIPS 2022 Identifying Compound Climate Drivers of Forest Mortality with β-VAE (Papers Track)
Abstract and authors: (click to expand)

Abstract: Climate change is expected to lead to higher rates of forest mortality. Forest mortality is a complex phenomenon driven by the interaction of multiple climatic variables at multiple temporal scales, further modulated by the current state of the forest (e.g. age, stem diameter, and leaf area index). Identifying the compound climate drivers of forest mortality would greatly improve understanding and projections of future forest mortality risk. Observation data are, however, limited in accuracy and sample size, particularly in regard to forest state variables and mortality events. In contrast, simulations with state-of-the-art forest models enable the exploration of novel machine learning techniques for associating forest mortality with driving climate conditions. Here we simulate 160,000 years of beech, pine and spruce forest dynamics with the forest model FORMIND. We then apply β-VAE to learn disentangled latent representations of weather conditions and identify those that are most likely to cause high forest mortality. The learned model successfully identifies three characteristic climate representations that can be interpreted as different compound drivers of forest mortality.

Authors: Mohit Anand (Helmholtz Centre for Environmental Research - UFZ); Lily-belle Sweet (Helmholtz Centre for Environmental Research - UFZ); Gustau Camps-Valls (Universitat de València); Jakob Zscheischler (Helmholtz Centre for Environmental Research - UFZ)

Generative Modeling Climate Science & Modeling Earth Observation & Monitoring Ecosystems & Biodiversity Extreme Weather Forests Interpretable ML Time-series Analysis
NeurIPS 2022 TCFD-NLP: Assessing alignment of climate disclosures using NLP for the financial markets (Papers Track)
Abstract and authors: (click to expand)

Abstract: Climate-related disclosure is increasing in importance as companies and stakeholders alike aim to reduce their environmental impact and exposure to climate-induced risk. Companies primarily disclose this information in annual or other lengthy documents where climate information is not the sole focus. To assess the quality of a company's climate-related disclosure, these documents, often hundreds of pages long, must be reviewed manually by climate experts. We propose a more efficient approach to assessing climate-related financial information. We construct a model leveraging TF-IDF, sentence transformers and multi-label k nearest neighbors (kNN). The developed model is capable of assessing alignment of climate disclosures at scale, with a level of granularity and transparency that will support decision-making in the financial markets with relevant climate information. In this paper, we discuss the data that enabled this project, the methodology, and how the resulting model can drive climate impact.

Authors: Rylen Sampson (Manifest Climate); Aysha Cotterill (Manifest Climate); Quoc Tien Au (Manifest Climate)

Climate Finance & Economics Natural Language Processing
NeurIPS 2022 Deep Learning for Rapid Landslide Detection using Synthetic Aperture Radar (SAR) Datacubes (Papers Track)
Abstract and authors: (click to expand)

Abstract: With climate change predicted to increase the likelihood of landslide events, there is a growing need for rapid landslide detection technologies that help inform emergency responses. Synthetic Aperture Radar (SAR) is a remote sensing technique that can provide measurements of affected areas independent from weather or lighting conditions. Usage of SAR, however, is hindered by domain knowledge that is necessary for the pre-processing steps and its interpretation requires expert knowledge. We provide simplified, pre-processed, machine-learning ready SAR datacubes for four globally located landslide events obtained from several Sentinel-1 satellite passes before and after a landslide triggering event together with segmentation maps of the landslides. From this dataset, using the Hokkaido, Japan datacube, we study the feasibility of SAR-based landslide detection with supervised deep learning (DL). Our results demonstrate that DL models can be used to detect landslides from SAR data, achieving an Area under the Precision-Recall curve exceeding 0.7. We find that additional satellite visits enhance detection performance, but that early detection is possible when SAR data is combined with terrain information from a digital elevation model. This can be especially useful for time-critical emergency interventions.

Authors: Vanessa Boehm (UC Berkeley); Wei Ji Leong (The Ohio State University); Ragini Bal Mahesh (German Aerospace Center DLR); Ioannis Prapas (National Observatory of Athens); Siddha Ganju (Nvidia); Freddie Kalaitzis (University of Oxford); Edoardo Nemni (United Nations Satellite Centre (UNOSAT)); Raul Ramos-Pollan (Universidad de Antioquia)

Disaster Management and Relief Earth Observation & Monitoring Computer Vision & Remote Sensing
NeurIPS 2022 Hybrid Recurrent Neural Network for Drought Monitoring (Papers Track)
Abstract and authors: (click to expand)

Abstract: Droughts are pervasive hydrometeorological phenomena and global hazards, whose frequency and intensity are expected to increase in the context of climate change. Drought monitoring is of paramount relevance. Here we propose a hybrid model for drought detection that integrates both climatic indices and data-driven models in a hybrid deep learning approach. We exploit time-series of multi-scale Standardized Precipitation Evapotranspiration Index together with precipitation and temperature as inputs. We introduce a dual-branch recurrent neural network with convolutional lateral connections for blending the data. Experimental and ablative results show that the proposed system outperforms both the considered drought index and purely data-driven deep learning models. Our results suggest the potential of hybrid models for drought monitoring and open the door to synergistic systems that learn from data and domain knowledge altogether.

Authors: Mengxue Zhang (Universitat de València); Miguel-Ángel Fernández-Torres (Universitat de València); Gustau Camps-Valls (Universitat de València)

Climate Science & Modeling Extreme Weather Computer Vision & Remote Sensing Hybrid Physical Models Time-series Analysis
NeurIPS 2022 Deep Learning for Global Wildfire Forecasting (Papers Track)
Abstract and authors: (click to expand)

Abstract: Climate change is expected to aggravate wildfire activity through the exacerbation of fire weather. Improving our capabilities to anticipate wildfires on a global scale is of uttermost importance for mitigating their negative effects. In this work, we create a global fire dataset and demonstrate a prototype for predicting the presence of global burned areas on a sub-seasonal scale with the use of segmentation deep learning models. Particularly, we present an open-access global analysis-ready datacube, which contains a variety of variables related to the seasonal and sub-seasonal fire drivers (climate, vegetation, oceanic indices, human-related variables), as well as the historical burned areas and wildfire emissions for 2001-2021. We train a deep learning model, which treats global wildfire forecasting as an image segmentation task and skillfully predicts the presence of burned areas 8, 16, 32 and 64 days ahead of time. Our work motivates the use of deep learning for global burned area forecasting and paves the way towards improved anticipation of global wildfire patterns.

Authors: Ioannis Prapas (National Observatory of Athens); Akanksha Ahuja (NOA); Spyros Kondylatos (National Observatory of Athens); Ilektra Karasante (National Observatory of Athens); Lazaro Alonso (Max Planck Institute for Biogeochemistry); Eleanna Panagiotou (Harokopio University of Athens); Charalampos Davalas (Harokopio University of Athens); Dimitrios Michail (Harokopio University of Athens); Nuno Carvalhais (Max Planck Institute for Biogeochemistry); Ioannis Papoutsis (National Observatory of Athens)

Earth Observation & Monitoring Climate Science & Modeling Disaster Management and Relief Extreme Weather Computer Vision & Remote Sensing
NeurIPS 2022 Causal Modeling of Soil Processes for Improved Generalization (Papers Track)
Abstract and authors: (click to expand)

Abstract: Measuring and monitoring soil organic carbon is critical for agricultural productivity and for addressing critical environmental problems. Soil organic carbon not only enriches nutrition in soil, but also has a gamut of co-benefits such as improving water storage and limiting physical erosion. Despite a litany of work in soil organic carbon estimation, current approaches do not generalize well across soil conditions and management practices. We empirically show that explicit modeling of cause-and-effect relationships among the soil processes improves the out-of-distribution generalizability of prediction models. We provide a comparative analysis of soil organic carbon estimation models where the skeleton is estimated using causal discovery methods. Our framework provide an average improvement of 81% in test mean squared error and 52% in test mean absolute error.

Authors: Somya Sharma (U. Minnesota); Swati Sharma (Microsoft Research); Emre Kiciman (Microsoft Research); Andy Neal (Rothamstead); Ranveer Chandra (Microsoft Research); John Crawford (University of Glasgow); Sara Malvar (Microsoft); Eduardo R Rodrigues (MSR)

Agriculture & Food Causal & Bayesian Methods
NeurIPS 2022 Machine Learning for Activity-Based Road Transportation Emissions Estimation (Papers Track)
Abstract and authors: (click to expand)

Abstract: Measuring and attributing greenhouse gas (GHG) emissions remains a challenging problem as the world strives towards meeting emissions reductions targets. As a significant portion of total global emissions, the road transportation sector represents an enormous challenge for estimating and tracking emissions at a global scale. To meet this challenge, we have developed a hybrid approach for estimating road transportation emissions that combines the strengths of machine learning and satellite imagery with localized emissions factors data to create an accurate, globally scalable, and easily configurable GHG monitoring framework.

Authors: Derek Rollend (JHU); Kevin Foster (JHU); Tomek Kott (JHU); Rohita Mocharla (JHU); Rodrigo Rene Rai Munoz Abujder (Johns Hopkins Applied Physics Laboratory); Neil Fendley (JHU/APL); Chace Ashcraft (JHU/APL); Frank Willard (JHU); Marisa Hughes (JHU)

Transportation Cities & Urban Planning Computer Vision & Remote Sensing
NeurIPS 2022 Bayesian State-Space SCM for Deforestation Baseline Estimation for Forest Carbon Credit (Papers Track) Best Paper: Pathway to Impact
Abstract and authors: (click to expand)

Abstract: In forest carbon credit, the concept of dynamic (or ex-post) baseline has been discussed to overcome the criticism of junk carbon credit, while an ex-ante baseline is still necessary in terms of project finance and risk assessment. We propose a Bayesian state-space SCM, which integrates both ex-ante and ex-post baseline estimation in a time-series causal inference framework. We apply the proposed model to a REDD+ project in Brazil, and show that it might have had a small, positive effect but had been over-credited and that the 90% predictive interval of the ex-ante baseline included the ex-post baseline, implying our ex-ante estimation can work effectively.

Authors: Keisuke Takahata (sustainacraft, Inc.); Hiroshi Suetsugu (sustainacraft, Inc.); Keiichi Fukaya (National Institute for Environmental Studies); Shinichiro Shirota (Hitotsubashi University)

Forests Causal & Bayesian Methods Land Use
NeurIPS 2022 Estimating Corporate Scope 1 Emissions Using Tree-Based Machine Learning Methods (Papers Track)
Abstract and authors: (click to expand)

Abstract: Companies worldwide contribute to climate change, emitting significant amounts of greenhouse gases (GHGs). Yet, most do not report their direct or Scope 1 emissions, resulting in a large data gap in corporate emissions. This study aims to fill this gap by training several decision-tree machine learning models to predict company-level Scope 1 emissions. Our results demonstrate that the Extreme Gradient Boosting and LightGBM models perform best, where the former shows a 19% improvement in prediction error over a benchmark model. Our model is also of reduced complexity and greater computational efficiency; it does not require meta-learners and is trained on a smaller number of features, for which data is more common and accessible compared to prior works. Our features are uniquely chosen based on concepts of environmental pollution in economic theory. Predicting corporate emissions with machine learning can be used as a gap-filling approach, which would allow for better GHG accounting and tracking, thus facilitating corporate decarbonization efforts in the long term. It can also impact representations of a company’s carbon performance and carbon risks, thereby helping to funnel investments towards companies with lower emissions and those making true efforts to decarbonize.

Authors: Elham Kheradmand (University of Montreal); Maida Hadziosmanovic (Concordia University); Nazim Benguettat (Concordia); H. Damon Matthews (Concordia University); Shannon M. Lloyd (Concordia University)

Climate Finance & Economics Interpretable ML
NeurIPS 2022 Analyzing Micro-Level Rebound Effects of Energy Efficient Technologies (Papers Track)
Abstract and authors: (click to expand)

Abstract: Energy preservation is central to prevent resource depletion, climate change and environment degradation. Investment in raising efficiency of appliances is among the most significant attempts to save energy. Ironically, introduction of many such energy saving appliances increased the total energy consumption instead of reducing it. This effect in literature is attributed to the inherent Jevons paradox (JP) and optimism bias (OB) in consumer behavior. However, the magnitude of these instincts vary among different people. Identification of this magnitude for each household can enable the development of appropriate policies that induce desired energy saving behaviour. Using the RECS 2015 dataset, the paper uses machine learning for each electrical appliance to determine the dependence of their total energy consumption on their energy star rating. This shows that only substitutable appliances register increase in energy demand upon boosted efficiency. Lastly, an index is noted to indicate the varying influence of JP and OB on different households.

Authors: Mayank Jain (University College Dublin); Mukta Jain (Delhi School of Economics); Tarek T. Alskaif (Wageningen University); Soumyabrata Dev (University College Dublin)

Behavioral and Social Science Cities & Urban Planning Power & Energy
NeurIPS 2022 Comparing the carbon costs and benefits of low-resource solar nowcasting (Papers Track)
Abstract and authors: (click to expand)

Abstract: Mitigating emissions in line with climate goals requires the rapid integration of low carbon energy sources, such as solar photovoltaics (PV) into the electricity grid. However, the energy produced from solar PV fluctuates due to clouds obscuring the sun's energy. Solar PV yield nowcasting is used to help anticipate peaks and troughs in demand to support grid integration. This paper compares multiple low-resource approaches to nowcasting solar PV yield. To do so, we use a dataset of UK satellite imagery and solar PV energy readings over a 1 to 4-hour time range. Our work investigates the performance of multiple nowcasting models. The paper also estimates the carbon emissions generated and averted by deploying models such as these, and finds that short-term PV forecasting may have a benefit several orders of magnitude greater than its carbon cost and that this benefit holds for small models that could be deployable in low-resource settings around the globe.

Authors: Ben Dixon (UCL); Jacob Bieker (Open Climate Fix); Maria Perez-Ortiz (University College London)

Computer Vision & Remote Sensing Climate Science & Modeling Time-series Analysis
NeurIPS 2022 Climate Policy Tracker: Pipeline for automated analysis of public climate policies (Papers Track)
Abstract and authors: (click to expand)

Abstract: The number of standardized policy documents regarding climate policy and their publication frequency is significantly increasing. The documents are long and tedious for manual analysis, especially for policy experts, lawmakers, and citizens who lack access or domain expertise to utilize data analytics tools. Potential consequences of such a situation include reduced citizen governance and involvement in climate policies and an overall surge in analytics costs, rendering less accessibility for the public. In this work, we use a Latent Dirichlet Allocation-based pipeline for the automatic summarization and analysis of 10-years of national energy and climate plans (NECPs) for the period from 2021 to 2030, established by 27 Member States of the European Union. We focus on analyzing policy framing, the language used to describe specific issues, to detect essential nuances in the way governments frame their climate policies and achieve climate goals. The methods leverage topic modeling and clustering for the comparative analysis of policy documents across different countries. It allows for easier integration in potential user-friendly applications for the development of theories and processes of climate policy. This would further lead to better citizen governance and engagement over climate policies and public policy research.

Authors: Artur Żółkowski (Warsaw University of Technology); Mateusz Krzyziński (Warsaw University of Technology); Piotr Wilczyński (Warsaw University of Technology); Stanisław Giziński (University of Warsaw); Emilia Wiśnios (University of Warsaw); Bartosz Pieliński (University of Warsaw); Julian Sienkiewicz (Warsaw University of Technology); Przemysław Biecek (Warsaw University of Technology)

Public Policy Natural Language Processing
NeurIPS 2022 Topic correlation networks inferred from open-ended survey responses reveal signatures of ideology behind carbon tax opinion (Papers Track)
Abstract and authors: (click to expand)

Abstract: Ideology can often render policy design ineffective by overriding what, at face value, are rational incentives. A timely example is carbon pricing, whose public support is strongly influenced by ideology. As a system of ideas, ideology expresses itself in the way people explain themselves and the world. As an object of study, ideology is then amenable to a generative modelling approach within the text-as-data paradigm. Here, we analyze the structure of ideology underlying carbon tax opinion using topic models. An idea, termed a topic, is operationalized as the fixed set of proportions with which words are used when talking about it. We characterize ideology through the relational structure between topics. To access this latent structure, we use the highly expressive Structural Topic Model to infer topics and the weights with which individual opinions mix topics. We fit the model to a large dataset of open-ended survey responses of Canadians elaborating on their support of or opposition to the tax. We propose and evaluate statistical measures of ideology in our data, such as dimensionality and heterogeneity. Finally, we discuss the implications of the results for transition policy in particular, and of our approach to analyzing ideology for computational social science in general.

Authors: Maximilian Puelma Touzel (Mila)

Behavioral and Social Science Climate Finance & Economics Public Policy Generative Modeling Natural Language Processing
NeurIPS 2022 Controllable Generation for Climate Modeling (Papers Track)
Abstract and authors: (click to expand)

Abstract: Recent years have seen increased interest in modeling future climate trends, especially from the point of view of accurately predicting, understanding and mitigating downstream impacts. For instance, current state-of-the-art process-based agriculture models rely on high-resolution climate data during the growing season for accurate estimation of crop yields. However, high-resolution climate data for future climates is unavailable and needs to be simulated, and that too for multiple possible climate scenarios, which becomes prohibitively expensive via traditional methods. Meanwhile, deep generative models leveraging the expressivity of neural networks have shown immense promise in modeling distributions in high dimensions. Here, we cast the problem of simulation of climate scenarios in a generative modeling framework. Specifically, we leverage the GAN (Generative Adversarial Network) framework for simulating synthetic climate scenarios. We condition the model by quantifying the degree of ``extremeness" of the observed sample, which allows us to sample from different parts of the distribution. We demonstrate the efficacy of the proposed method on the CHIRPS precipitation dataset.

Authors: Moulik Choraria (University of Illinois at Urbana-Champaign); Daniela Szwarcman (IBM Research); Bianca Zadrozny (IBM Research); Campbell D Watson (IBM Reserch); Lav Varshney (UIUC: ECE)

Climate Science & Modeling Extreme Weather Generative Modeling
NeurIPS 2022 Learn to Bid: Deep Reinforcement Learning with Transformer for Energy Storage Bidding in Energy and Contingency Reserve Markets (Papers Track)
Abstract and authors: (click to expand)

Abstract: As part of efforts to tackle climate change, grid-scale battery energy storage systems (BESS) play an essential role in facilitating reliable and secure power system operation with variable renewable energy (VRE). BESS can balance time-varying electricity demand and supply in the spot market through energy arbitrage and in the frequency control ancillary services (FCAS) market through service enablement or delivery. Effective algorithms are needed for the optimal participation of BESS in multiple markets. Using deep reinforcement learning (DRL), we present a BESS bidding strategy in the joint spot and contingency FCAS markets, leveraging a transformer-based temporal feature extractor to exploit the temporal trends of volatile energy prices. We validate our strategy on real-world historical energy prices in the Australian National Electricity Market (NEM). We demonstrate that the novel DRL-based bidding strategy significantly outperforms benchmarks. The simulation also reveals that the joint bidding in both the spot and contingency FCAS markets can yield a much higher profit than in individual markets. Our work provides a viable use case for the BESS, contributing to the power system operation with high penetration of renewables.

Authors: Jinhao Li (Monash University); Changlong Wang (Monash University); Yanru Zhang (University of Electronic Science and Technology of China); Hao Wang (Monash University)

Power & Energy