Climate Change AI Workshop Papers
On this page, we show accepted works from all our workshops on "Tackling Climate Change with Machine Learning."
Venue | Title | Subject Areas |
---|---|---|
ICLR 2023 |
Tutorial: Quantus x Climate - Applying explainable AI evaluation in climate science
(Tutorials Track)
Abstract and authors: (click to expand)Abstract: Explainable artificial intelligence (XAI) methods shed light on the predictions of deep neural networks (DNNs). In the climate context, XAI has been applied to improve and validate deep learning (DL) methods while providing researchers with new insight into physical processes. However, the evaluation, validation and selection of XAI methods are challenging due to often lacking ground truth explanations. In this tutorial, we introduce the XAI evaluation package Quantus to the climate community. We start by providing the users with pre-processed input and output data alongside a convolutional neural network (CNN) trained to assign yearly temperature maps to classes according to their decade. We explain the network prediction of an example temperature map using five different explanation techniques Gradient GradientShap, IntegratedGradients, LRP-z and Occlusion. By visually analyzing each explanation method around the North Atlantic (NA) cooling patch 10-80W, 20-60N, we provide a motivating example that shows that different explanations may disagree in their explained evidence which subsequently can lead to different scientific interpretation and potentially, misleading conclusions. We continue by introducing Quantus including the explanation properties that can be evaluated such as robustness, faithfulness, complexity, localization and randomization. We guide the participants towards a practical understanding of XAI evaluation by demonstrating how metrics differ in their scoring and interpretation. Moreover, we teach the participants to compare and select an appropriate XAI method by performing a comprehensive XAI evaluation. Lastly, we return to the motivating example, highlighting how Quantus can facilitate well-founded XAI research in climate science. Authors: Philine L Bommer (TU Berlin); Anna Hedström (Technische Universität Berlin); Marlene Kretschmer (University of Reading); Marina M.-C. Höhne (TU Berlin) |
Interpretable ML Climate science and climate modeling Classification, regression, and supervised learning |
ICLR 2023 |
CityLearn: A Tutorial on Reinforcement Learning Control for Grid-Interactive Efficient Buildings and Communities
(Tutorials Track)
Abstract and authors: (click to expand)Abstract: Buildings are responsible for up to 75% of electricity consumption in the United States. Grid-Interactive Efficient Buildings can provide flexibility to solve the issue of power supply-demand mismatch, particularly brought about by renewables. Their high energy efficiency and self-generating capabilities can reduce demand without affecting the building function. Additionally, load shedding and shifting through smart control of storage systems can further flatten the load curve and reduce grid ramping cost in response to rapid decrease in renewable power supply. The model-free nature of reinforcement learning control makes it a promising approach for smart control in grid-interactive efficient buildings, as it can adapt to unique building needs and functions. However, a major challenge for the adoption of reinforcement learning in buildings is the ability to benchmark different control algorithms to accelerate their deployment on live systems. CityLearn is an open source OpenAI Gym environment for the implementation and benchmarking of simple and advanced control algorithms, e.g., rule-based control, model predictive control or deep reinforcement learning control thus, provides solutions to this challenge. This tutorial leverages CityLearn to demonstrate different control strategies in grid-interactive efficient buildings. Participants will learn how to design three controllers of varying complexity for battery management using a real-world residential neighborhood dataset to provide load shifting flexibility. The algorithms will be evaluated using six energy flexibility, environmental and economic key performance indicators, and their benefits and shortcomings will be identified. By the end of the tutorial, participants will acquire enough familiarity with the CityLearn environment for extended use in new datasets or personal projects. Authors: Kingsley E Nweye (The University of Texas at Austin); Allen Wu (The University of Texas at Austin); Hyun Park (The University of Texas at Austin); Yara Almilaify (The University of Texas at Austin); Zoltan Nagy (The University of Texas at Austin) |
Buildings Cities and urban planning Power and energy systems Reinforcement learning and control Time-series analysis |
ICLR 2023 |
Smart Meter Data Analytics: Practical Use-Cases and Best Practices of Machine Learning Applications for Energy Data in the Residential Sector
(Tutorials Track)
Abstract and authors: (click to expand)Abstract: To cope with climate change, the energy system is undergoing a massive transformation. With the electrification of all sectors, the power grid is facing high additional demand. As a result, the digitization of the grid is becoming more of a focus. The smart grid relies heavily on the increasing deployment of smart electricity meters around the world. The corresponding smart meter data is typically a time series of power or energy measurements with a resolution of 1s to 60 min. This data provides valuable insights and opportunities for monitoring and controlling activities in the power grid. In this tutorial, we therefore provide an overview of best practices for analyzing smart meter data. We focus on machine learning applications and low resolution (15-60 minutes) energy data in a residential setting. We only use real-world datasets and cover use-cases that are highly relevant for practical applications. Although this tutorial is specifically tailored to an audience from the energy domain, we believe that anyone from the data analytics and machine learning community can benefit from it, as many techniques are applicable to any time series data. Through our tutorial, we hope to foster new ideas, contribute to an interdisciplinary exchange between different research fields, and educate people about energy use. Authors: Tobias Brudermueller (ETH Zurich); Markus Kreft (ETH Zurich) |
Power and energy systems Buildings Time-series analysis |
ICLR 2023 |
Mitigating climate and health impact of small-scale kiln industry using multi-spectral classifier and deep learning
(Papers Track)
Abstract and authors: (click to expand)Abstract: Industrial air pollution has a direct health impact and is a major contributor to climate change. Small scale industries particularly bull-trench brick kilns are one of the major causes of air pollution in South Asia often creating hazardous levels of smog that is injurious to human health. To mitigate the climate and health impact of the kiln industry, fine-grained kiln localization at different geographic locations is needed. Kiln localization using multi-spectral remote sensing data such as vegetation index results in a noisy estimates whereas use of high-resolution imagery is infeasible due to cost and compute complexities. This paper proposes a fusion of spatio-temporal multi-spectral data with high-resolution imagery for detection of brick kilns within the "Brick-Kiln-Belt" of South Asia. We first perform classification using low-resolution spatio-temporal multi-spectral data from Sentinel-2 imagery by combining vegetation, burn, build up and moisture indices. Then orientation aware object detector: YOLOv3 (with theta value) is implemented for removal of false detections and fine-grained localization. Our proposed technique, when compared with other benchmarks, results in a 21 times improvement in speed with comparable or higher accuracy when tested over multiple countries. Authors: Usman Nazir (Lahore University of Management Sciences); Murtaza Taj (Lahore University of Management Sciences); Momin Uppal (Lahore University of Management Sciences); Sara khalid (University of Oxford) |
Earth observations and monitoring Health |
ICLR 2023 |
Global Flood Prediction: a Multimodal Machine Learning Approach
(Papers Track)
Abstract and authors: (click to expand)Abstract: Flooding is one of the most destructive and costly natural disasters, and climate changes would further increase risks globally. This work presents a novel mul- timodal machine learning approach for multi-year global flood risk prediction, combining geographical information and historical natural disaster dataset. Our multimodal framework employs state-of-the-art processing techniques to extract embeddings from each data modality, including text-based geographical data and tabular-based time-series data. Experiments demonstrate that a multimodal ap- proach, that is combining text and statistical data, outperforms a single-modality approach. Our most advanced architecture, employing embeddings extracted us- ing transfer learning upon DistilBert model, achieves 75%-77% ROCAUC score in predicting the next 1-5 year flooding event in historically flooded locations. This work demonstrates the potentials of using machine learning for long-term planning in natural disaster management Authors: Cynthia Zeng (MIT); Dimitris Bertsimas (MIT) |
Extreme weather Disaster management and relief |
ICLR 2023 |
Improving global high-resolution Earth system model simulations of precipitation with generative adversarial networks
(Papers Track)
Abstract and authors: (click to expand)Abstract: Precipitation extremes are expected to become stronger and more frequent in response to anthropogenic global warming. Accurately projecting the ecological and socioeconomic impacts is an urgent task. Impact models are developed and calibrated with observation-based data but rely on Earth system model (ESM) output for future scenarios. ESMs, however, exhibit significant biases in their output because they cannot fully resolve complex cross-scale interactions of processes that produce precipitation cannot. State-of-the-art bias correction methods only address errors in the simulated frequency distributions, locally at every individual grid cell. Improving unrealistic spatial patterns of the ESM output, which would require spatial context, has not been possible so far. Here we show that a post-processing method based on physically constrained generative adversarial networks (GANs) can correct biases of a state-of-the-art global ESM both in local frequency distributions and in the spatial patterns at once. While our method improves local frequency distributions similarly well to a gold-standard ESM bias-adjustment framework, it strongly outperforms existing methods in correcting spatial patterns. Our study highlights the importance of physical constraints in neural networks for out-of-sample predictions in the context of climate change. Authors: Philipp Hess (Technical University of Munich) |
Generative modeling Earth science Extreme weather Computer vision and remote sensing Hybrid physical models |
ICLR 2023 |
Machine Learning for Advanced Building Construction
(Papers Track)
Abstract and authors: (click to expand)Abstract: High-efficiency retrofits can play a key role in reducing carbon emissions associated with buildings if processes can be scaled-up to reduce cost, time, and disruption. Here we demonstrate an artificial intelligence/computer vision (AI/CV)-enabled framework for converting exterior build scans and dimensional data directly into manufacturing and installation specifications for overclad panels. In our workflow point clouds associated with LiDAR-scanned buildings are segmented into a facade feature space, vectorized features are extracted using an iterative random-sampling consensus algorithm, and from this representation an optimal panel design plan satisfying manufacturing constraints is generated. This system and the corresponding construction process is demonstrated on a test facade structure constructed at the National Renewable Energy Laboratory (NREL). We also include a brief summary of a techno-economic study designed to estimate the potential energy and cost impact of this new system. Authors: Hilary Egan (NREL); Clement Fouquet (Trimble Inc.); Chioke Harris (NREL) |
Buildings Classification, regression, and supervised learning |
ICLR 2023 |
Coregistration of Satellite Image Time Series Through Alignment of Road Networks
(Papers Track)
Abstract and authors: (click to expand)Abstract: Due to climate change, thawing permafrost affects transportation infrastructure in northern regions. Tracking deformations over time of these structures can allow identifying the most vulnerable sections to permafrost degradation and implement climate adaptation strategies. The Sentinel-2 mission provides data well-suited for multitemporal analysis due to its high temporal resolution and multispectral coverage. However, the geometrical misalignment of Sentinel-2 imagery makes this analysis challenging. Towards the goal of estimating the deformation of linear infrastructure in northern Canada, we propose an automatic subpixel coregistration algorithm for satellite image time series based on the matching of binary masks of roads produced by a deep learning model. We demonstrate the feasibility of achieving subpixel coregistration through alignment of roads on a small dataset of high-resolution Sentinel-2 images from the region of Gillam in northern Canada. This is the first step towards training a road deformation prediction model. Authors: Andres Felipe Perez Murcia (University of Manitoba); Pooneh Maghoul (University of Manitoba); Ahmed Ashraf (University of Manitoba) |
Earth observations and monitoring Transportation Computer vision and remote sensing |
ICLR 2023 |
Estimating Residential Solar Potential using Aerial Data
(Papers Track)
Abstract and authors: (click to expand)Abstract: Project Sunroof estimates the solar potential of residential buildings using high quality aerial data. That is, it estimates the potential solar energy (and associated financial savings) that can be captured by buildings if solar panels were to be installed on their roofs. Unfortunately its coverage is limited by the lack of high resolution digital surface map (DSM) data. We present a deep learning approach that bridges this gap by enhancing widely available low-resolution data, thereby dramatically increasing the coverage of Sunroof. We also present some ongoing efforts to potentially improve accuracy even further by replacing certain algorithmic components of Sunroof’s processing pipeline with deep learning. Authors: Ross Goroshin (Google); Carl Elkin (Google) |
Power and energy systems Computer vision and remote sensing |
ICLR 2023 |
Improving extreme weather events detection with light-weight neural networks
(Papers Track)
Abstract and authors: (click to expand)Abstract: To advance automated detection of extreme weather events, which are increasing in frequency and intensity with climate change, we explore modifications to a novel light-weight Context Guided convolutional neural network architecture trained for semantic segmentation of tropical cyclones and atmospheric rivers in climate data. Our primary focus is on tropical cyclones, the most destructive weather events, for which current models show limited performance. We investigate feature engineering, data augmentation, learning rate modifications, alternative loss functions, and architectural changes. In contrast to previous approaches optimizing for intersection over union, we specifically seek to improve recall to penalize under-counting and prioritize identification of tropical cyclones. We report success through the use of weighted loss functions to counter class imbalance for these rare events. We conclude with directions for future research on extreme weather events detection, a crucial task for prediction, mitigation, and equitable adaptation to the impacts of climate change. Authors: Romain Lacombe (Stanford University); Hannah Grossman (Stanford); Lucas P Hendren (Stanford University); David Ludeke (Stanford University) |
Extreme weather Climate science and climate modeling Earth science Classification, regression, and supervised learning Computer vision and remote sensing |
ICLR 2023 |
CaML: Carbon Footprinting of Products with Zero-Shot Semantic Text Similarity
(Papers Track)
Abstract and authors: (click to expand)Abstract: Estimating the embodied carbon in products is a key step towards understanding their impact, and undertaking mitigation actions. Precise carbon attribution is challenging at scale, requiring both domain expertise and granular supply chain data. As a first-order approximation, standard reports use Economic Input-Output based Life Cycle Assessment (EIO-LCA) which estimates carbon emissions per dollar at an industry sector level using transactions between different parts of the economy. For EIO-LCA, an expert needs to map each product to one of upwards of 1000 potential industry sectors. We present CaML, an algorithm to automate EIO-LCA using semantic text similarity matching by leveraging the text descriptions of the product and the industry sector. CaML outperforms the previous manually intensive method, yielding a MAPE of 22% with no domain labels. Authors: Bharathan Balaji (Amazon); Venkata Sai Gargeya Vunnava (amazon); Geoffrey Guest (Amazon); Jared Kramer (Amazon) |
Natural language processing Supply chains Classification, regression, and supervised learning |
ICLR 2023 |
Graph Neural Network Generated Metal-Organic Frameworks for Carbon Capture
(Proposals Track)
Abstract and authors: (click to expand)Abstract: The level of carbon dioxide (CO2) in our atmosphere is rapidly rising and is projected to double today‘s levels to reach 1,000 ppm by 2100 under certain scenarios, primarily driven by anthropogenic sources. Technology that can capture CO2 from anthropogenic sources, remove from atmosphere and sequester it at the gigaton scale by 2050 is required stop and reverse the impact of climate change. Metal-organic frameworks (MOFs) have been a promising technology in various applications including gas separation as well as CO2 capture from point-source flue gases or removal from the atmosphere. MOFs offer unmatched surface area through their highly porous crystalline structure and MOF technology has potential to become a leading adsorption-based CO2 separation technology providing high surface area, structure stability and chemical tunability. Due to its complex structure, MOF crystal structure (atoms and bonds) cannot be easily represented in tabular format for machine learning (ML) applications whereas graph neural networks (GNN) have already been explored in representation of simpler chemical molecules. In addition to difficulty in MOF data representation, an infinite number of combinations can be created for MOF crystals, which makes ML applications more suitable to alleviate dependency on subject matter experts (SME) than conventional computational methods. In this work, we propose training of GNNs in variational autoencoder (VAE) setting to create an end-to-end workflow for the generation of new MOF crystal structures directly from the data within the crystallographic information files (CIFs) and conditioned by additional CO2 performance values. Authors: Zikri Bayraktar (Schlumberger Doll Research); Shahnawaz Molla (Schlumberger Doll Research); Sharath Mahavadi (Schlumberger Doll Research) |
Generative modeling Materials science and discovery Other |
ICLR 2023 |
Long-lead forecasts of wintertime air stagnation index in southern China using oceanic memory effects
(Papers Track)
Abstract and authors: (click to expand)Abstract: Stagnant weather condition is one of the major contributors to air pollution as it is favorable for the formation and accumulation of pollutants. To measure the atmosphere’s ability to dilute air pollutants, Air Stagnation Index (ASI) has been introduced as an important meteorological index. Therefore, making long-lead ASI forecasts is vital to make plans in advance for air quality management. In this study, we found that autumn Niño indices derived from sea surface temperature (SST) anomalies show a negative correlation with wintertime ASI in southern China, offering prospects for a prewinter forecast. We developed an LSTM-based model to predict the future wintertime ASI. Results demonstrated that multivariate inputs (past ASI and Niño indices) achieve better forecast performance than univariate input (only past ASI). The model achieves a correlation coefficient of 0.778 between the actual and predicted ASI, exhibiting a high degree of consistency. Authors: Chenhong Zhou (Hong Kong Baptist University); Xiaorui Zhang (Hong Kong Baptist University); Meng Gao (Hong Kong Baptist University); Shanshan Liu (University of science and technology of China); Yike Guo (Hong Kong University of Science and Technology); Jie Chen (Hong Kong Baptist University) |
Climate science and climate modeling Earth science Extreme weather Classification, regression, and supervised learning Time-series analysis |
ICLR 2023 |
Attention-based Domain Adaptation Forecasting of Streamflow in Data-Sparse Regions
(Papers Track)
Abstract and authors: (click to expand)Abstract: Streamflow forecasts are critical to guide water resource management, mitigate drought and flood effects, and develop climate-smart infrastructure and governance. Many global regions, however, have limited streamflow observations to guide evidence-based management strategies. In this paper, we propose an attention-based domain adaptation streamflow forecaster for data-sparse regions. Our approach leverages the hydrological characteristics of a data-rich source domain to induce effective 24hr lead-time streamflow prediction in a data-constrained target domain. Specifically, we employ a deep-learning framework leveraging domain adaptation techniques to simultaneously train streamflow predictions and discern between both domains using an adversarial method. Experiments against baseline cross-domain forecasting models show improved performance for 24hr lead-time streamflow forecasting. Authors: Roland R Oruche (University of Missouri-Columbia); Fearghal O'Donncha (IBM Research) |
Meta- and transfer learning Disaster management and relief Time-series analysis |
ICLR 2023 |
Predicting Cycling Traffic in Cities: Is bike-sharing data representative of the cycling volume?
(Proposals Track)
Abstract and authors: (click to expand)Abstract: A higher share of cycling in cities can lead to a reduction in greenhouse gas emissions, a decrease in noise pollution, and personal health benefits. Data-driven approaches to planning new infrastructure to promote cycling are rare, mainly because data on cycling volume are only available selectively. By leveraging new and more granular data sources, we predict bicycle count measurements in Berlin, using data from free-floating bike-sharing systems in addition to weather, vacation, infrastructure, and socioeconomic indicators. To reach a high prediction accuracy given the diverse data, we make use of machine learning techniques. Our goal is to ultimately predict traffic volume on all streets beyond those with counters and to understand the variance in feature importance across time and space. Results indicate that bike-sharing data are valuable to improve the predictive performance, especially in cases with high outliers, and help generalize the models to new locations. Authors: Silke K. Kaiser (Hertie School) |
Transportation Classification, regression, and supervised learning |
ICLR 2023 |
Unsupervised machine learning techniques for multi-model comparison: A case study on Antarctic Intermediate Water in CMIP6 models
(Papers Track)
Abstract and authors: (click to expand)Abstract: The Climate Model Intercomparison Project provides access to ensembles of model experiments that are widely used to better understand past, present, and future climate changes. In this study, we use Principal Component Analysis and K-means and hierarchical clustering techniques to guide identification of models in the CMIP6 dataset that are best suited for specific modelling objectives. An example is discussed here that focuses on how CMIP6 models reproduce the physical properties of Antarctic Intermediate Water, a key feature of the global oceanic circulation and of the ocean-climate system, noting that the tools and methods introduced here can readily be extended to the analysis of other features and regions. Authors: Ophelie Meuriot (Imperial College London); Yves Plancherel (Imperial College London); Veronica Nieves (University of Valencia) |
Climate science and climate modeling Unsupervised and semi-supervised learning |
ICLR 2023 |
An automatic mobile approach for Tree DBH Estimation Using a Depth Map and a Regression Convolutional Neural Network
(Papers Track)
Abstract and authors: (click to expand)Abstract: Carbon credit programs finance projects to reduce emissions, remove pollutants, improve livelihoods, and protect natural ecosystems. Ensuring the quality and integrity of such projects is essential to their success. One of the most important variables used in nature-based solutions to measure carbon sequestration is the diameter at breast height (DBH) of trees. In this paper, we propose an automatic mobile computer vision method to estimate the DBH of a tree using a single depth map on a smartphone, along with our created dataset DepthMapDBH2023. We successfully demonstrated that this dataset paired with a lightweight regression convolutional neural network is able to accurately estimate the DBH of trees distinct in appearance, shape, number of tree forks, tree density and crowding, and vine presence. Automation of these measurements will help crews in the field who are collecting data for forest inventories. Gathering as much on-the-ground data as possible is required to ensure the transparency of carbon credit projects. Access to high-quality datasets of manual measurements helps improve biomass models which are widely used in the field of ecological simulation. The code used in this paper will be publicly available on Github and the dataset on Kaggle. Authors: Margaux Masson-Forsythe (Earthshot Labs); Margaux Masson-Forsythe (Earthshot Labs) |
Computer vision and remote sensing Carbon capture and sequestration Climate science and climate modeling Forestry and other land use Classification, regression, and supervised learning |
ICLR 2023 |
Safe Multi-Agent Reinforcement Learning for Price-Based Demand Response
(Papers Track)
Abstract and authors: (click to expand)Abstract: Price-based demand response (DR) enables households to provide the flexibility required in power grids with a high share of volatile renewable energy sources. Multi-agent reinforcement learning (MARL) offers a powerful, decentralized decision-making tool for autonomous agents participating in DR programs. Unfortunately, MARL algorithms do not naturally allow one to incorporate safety guarantees, preventing their real-world deployment. To meet safety constraints, we propose a safety layer that minimally adjusts each agent's decisions. We investigate the influence of using a reward function that reflects these safety adjustments. Results show that considering safety aspects in the reward during training improves both convergence speed and performance of the MARL agents in the investigated numerical experiments. Authors: Hannah Markgraf (Technical University of Munich); Matthias Althoff (Technical University of Munich) |
Reinforcement learning and control Buildings Power and energy systems |
ICLR 2023 |
BurnMD: A Fire Projection and Mitigation Modeling Dataset
(Papers Track)
Abstract and authors: (click to expand)Abstract: Today's fire projection modeling tools struggle to keep up with the rapid rate and increasing severity of climate change, leaving disaster managers dependent on tools which are increasingly unrepresentative of complex interactions between fire behavior, environmental conditions, and various mitigation options. This has consequences for equitably minimizing wildfire risks to life, property, ecology, cultural heritage, and public health. Fortunately, decades of data exist for fuel populations, weather conditions, and outcomes of significant fires in the American West and globally. The fire management community faces a lack of data standardization and validation among many competing fire models. Likewise, the machine learning community lacks curated datasets and benchmarks to develop solutions necessary to generate impact in this space. We present a novel dataset composed of 308 medium sized fires from the years 2018-2021, complete with both time series airborne based inference and ground operational estimation of fire extent, and operational mitigation data such as control line construction. As the first large wildfire dataset with mitigation information, Burn Mitigation Dataset (BurnMD) will help advance fire projection modeling, fire risk modeling, and AI generated land management policies. Authors: Marissa Dotter (MITRE Corporation) |
Forestry and other land use Climate science and climate modeling Computer vision and remote sensing Time-series analysis |
ICLR 2023 |
MAHTM: A Multi-Agent Framework for Hierarchical Transactive Microgrids
(Papers Track)
Abstract and authors: (click to expand)Abstract: Integration of variable renewable energy into the grid has posed challenges to system operators in achieving optimal trade-offs among energy availability, cost affordability, and pollution controllability. This paper proposes a multi-agent reinforcement learning framework for managing energy transactions in microgrids. The framework addresses the challenges above: it seeks to optimize the usage of available resources by minimizing the carbon footprint while benefiting all stakeholders. The proposed architecture consists of three layers of agents, each pursuing different objectives. The first layer, comprised of prosumers and consumers, minimizes the total energy cost. The other two layers control the energy price to decrease the carbon impact while balancing the consumption and production of both renewable and conventional energy. This framework also takes into account fluctuations in energy demand and supply. Authors: Nicolas M Cuadrado (MBZUAI); Roberto Alejandro Gutierrez Guillen (MBZUAI); Yongli Zhu (Texas A&M University); Martin Takac (Mohamed bin Zayed University of Artificial Intelligence) |
Power and energy systems Reinforcement learning and control |
ICLR 2023 |
Improving a Shoreline Forecasting Model with Symbolic Regression
(Papers Track)
Abstract and authors: (click to expand)Abstract: Given the current context of climate change and the increasing population densities at coastal zones around the globe, there is an increasing need to be able to predict the development of our coasts. Recent advances in artificial intelligence allow for automatic analysis of observational data. Symbolic Regression (SR) is a type of Machine Learning algorithm that aims to find interpretable symbolic expressions that can explain relations in the data. In this work, we aim to study the problem of forecasting shoreline change using SR. We make use of Cartesian Genetic Programming (CGP) in order to encode and improve upon ShoreFor, a physical shoreline prediction model. During training, CGP individuals are evaluated and selected according to their predictive score at five different coastal sites. This work presents a comparison between a CGP-evolved model and the base ShoreFor model. In addition to evolution's ability to produce well-performing models, it demonstrates the usefulness of SR as a research tool to gain insight into the behaviors of shorelines in various geographical zones. Authors: Mahmoud AL NAJAR (Laboratory of Spatial Geophysics and Oceanography Studies); Rafael ALMAR (Laboratory of Spatial Geophysics and Oceanography Studies); Erwin BERGSMA (CNES); Jean-Marc DELVIT (CNES); Dennis Wilson (ISAE) |
Interpretable ML Earth observations and monitoring Extreme weather Hybrid physical models Time-series analysis |
ICLR 2023 |
Remote Control: Debiasing Remote Sensing Predictions for Causal Inference
(Papers Track)
Abstract and authors: (click to expand)Abstract: Understanding and properly estimating the impacts of environmental interventions is of critical importance as we work towards achieving global climate goals. Remote sensing has become an essential tool for evaluating when and where climate policies have positive impacts on factors like greenhouse gas emissions and carbon sequestration. However, when machine learning models trained to predict outcomes using remotely sensed data simply minimize a standard loss function, the predictions that they generate can produce biased estimates in downstream causal inference. If prediction error in the outcome variable is correlated with policy variables or important confounders, as is the case for many widely used remote sensing data sets, estimates of the causal impacts of policies can be biased. In this paper, we demonstrate how this bias can arise, and we propose the use of an adversarial debiasing model (Zhang, Lemoine, and Mitchell 2018) in order to correct the issue when using satellite data to generate machine learning predictions for use in causal inference. We apply this method to a case study of the relationship between roads and tree cover in West Africa, where our results indicate that adversarial debiasing can recover a much more accurate estimate of the parameter of interest compared to when the standard approach is used. Authors: Matthew Gordon (Yale); Megan Ayers (Yale University); Eliana Stone (Yale School of the Environment); Luke C Sanford (Yale School of the Environment) |
Classification, regression, and supervised learning Forestry and other land use Computer vision and remote sensing |
ICLR 2023 |
A simplified machine learning based wildfire ignition model from insurance perspective
(Papers Track)
Abstract and authors: (click to expand)Abstract: In the context of climate change, wildfires are becoming more frequent, intense, and prolonged in the western US, particularly in California. Wildfires cause catastrophic socio-economic losses and are projected to worsen in the near future. Inaccurate estimates of fire risk put further pressure on wildfire (re)insurance and cause many homes to lose wildfire insurance coverage. Efficient and effective prediction of fire ignition is one step towards better fire risk assessment. Here we present a simplified machine learning-based fire ignition model at yearly scale that is well suited to the use case of one-year term wildfire (re)insurance. Our model yields a recall, precision, and the area under the precision-recall curve of 0.69, 0.86 and 0.81, respectively, for California, and significantly higher values of 0.82, 0.90 and 0.90, respectively, for the populated area, indicating its good performance. In addition, our model feature analysis reveals that power line density, enhanced vegetation index (EVI), vegetation optical depth (VOD), and distance to the wildland-urban interface stand out as the most important features determining ignitions. The framework of this simplified ignition model could easily be applied to other regions or genesis of other perils like hurricane, and it paves the road to a broader and more affordable safety net for homeowners. Authors: Yaling Liu (OurKettle Inc); Son Le (OurKettle Inc.); Yufei Zou (Our Kettle, Inc.); mojtaba Sadgedhi (OurKettle Inc.); Yang Chen (University of California, Irvine); Niels Andela (BeZero Carbon); Pierre Gentine (Columbia University) |
Classification, regression, and supervised learning Earth observations and monitoring Earth science Societal adaptation and resilience Interpretable ML |
ICLR 2023 |
Nested Fourier Neural Operator for Basin-Scale 4D CO2 Storage Modeling
(Papers Track)
Abstract and authors: (click to expand)Abstract: Carbon capture and storage (CCS) plays an essential role in global decarbonization. Scaling up CCS requires accurate and high-resolution modeling of the storage reservoir pressure buildup and the gaseous plume migration. However, such modeling is very challenging at scale due to the high computational costs of existing numerical methods. This challenge leads to significant uncertainty in evaluating storage opportunities which can delay the pace of global CCS deployments. We introduce a machine-learning approach for dynamic basin-scale modeling that speeds up flow prediction nearly 700,000 times compared to existing methods. Our framework, Nested Fourier Neural Operator (FNO), provides a general-purpose simulator alternative under diverse reservoir conditions, geological heterogeneity, and injection schemes. It enables unprecedented real-time high-fidelity modeling to support decision-making in basin-scale CCS projects. Authors: Gege Wen (Stanford University) |
Carbon capture and sequestration Earth science Classification, regression, and supervised learning |
ICLR 2023 |
SEA LEVEL PROJECTIONS WITH MACHINE LEARNING USING ALTIMETRY AND CLIMATE MODEL ENSEMBLES
(Papers Track)
Abstract and authors: (click to expand)Abstract: Satellite altimeter observations retrieved since 1993 show that the global mean sea level is rising at an unprecedented rate (3.4mm/year). With almost three decades of observations, we can now investigate the contributions of anthropogenic climate-change signals such as greenhouse gases, aerosols, and biomass burning in this rising sea level. We use machine learning (ML) to investigate future patterns of sea level change. To understand the extent of contributions from the climate-change signals, and to help in forecasting sea level change in the future, we turn to climate model simulations. This work presents a machine learning framework that exploits both satellite observations and climate model simulations to generate sea level rise projections at a 2-degree resolution spatial grid, 30 years into the future. We train fully connected neural networks (FCNNs) to predict altimeter values through a non-linear fusion of the climate model hindcasts (for 1993-2019). The learned FCNNs are then applied to future climate model projections to predict future sea level patterns. We propose segmenting our spatial dataset into meaningful clusters and show that clustering helps to improve predictions of our ML model. Authors: Saumya Sinha (University of Colorado, Boulder); John Fasullo (NCAR); R. Steven Nerem (Univesity of Colorado, Boulder); Claire Monteleoni (University of Colorado Boulder) |
Classification, regression, and supervised learning Climate science and climate modeling Earth science Oceans and marine systems |
ICLR 2023 |
Global-Local Policy Search and Its Application in Grid-Interactive Building Control
(Papers Track)
Abstract and authors: (click to expand)Abstract: As the buildings sector represents over 70% of the total U.S. electricity consumption, it offers a great amount of untapped demand-side resources to tackle many critical grid-side problems and improve the overall energy system's efficiency. To help make buildings grid-interactive, this paper proposes a global-local policy search method to train a reinforcement learning (RL) based controller which optimizes building operation during both normal hours and demand response (DR) events. Experiments on a simulated five-zone commercial building demonstrate that by adding a local fine-tuning stage to the evolution strategy policy training process, the control costs can be further reduced by 7.55% in unseen testing scenarios. Baseline comparison also indicates that the learned RL controller outperforms a pragmatic linear model predictive controller (MPC), while not requiring intensive online computation. Authors: Xiangyu Zhang (National Renewable Energy Laboratory); Yue Chen (National Renewable Energy Laboratory); Andrey Bernstein (NREL) |
Buildings Power and energy systems Reinforcement learning and control |
ICLR 2023 |
Accuracy is not the only Metric that matters: Estimating the Energy Consumption of Deep Learning Models
(Papers Track)
Abstract and authors: (click to expand)Abstract: Modern machine learning models have started to consume incredible amounts of energy, thus incurring large carbon footprints (Strubell et al., 2019). To address this issue, we have created an energy estimation pipeline, which allows practitioners to estimate the energy needs of their models in advance, without actually running or training them. We accomplished this, by collecting high-quality energy data and building a first baseline model, capable of predicting the energy consumption of DL models by accumulating their estimated layer-wise energies. Authors: Johannes Getzner (Technical University of Munich); Bertrand Charpentier (Technical University of Munich); Stephan Günnemann (Technical University of Munich) |
Power and energy systems Classification, regression, and supervised learning Data mining |
ICLR 2023 |
Topology Estimation from Voltage Edge Sensing for Resource-Constrained Grids
(Papers Track)
Abstract and authors: (click to expand)Abstract: Electric grids are the conduit for renewable energy delivery and will play a crucial role in mitigating climate change. To do so successfully in resource-constrained low- and middle-income countries (LMICs), increasing operational efficiency is key. Such efficiency demands evolving knowledge of the grid’s state, of which topology---how points on the network are physically connected---is fundamental. In LMICs, knowledge of distribution topology is limited and established methods for topology estimation rely on expensive sensing infrastructure, such as smart meters or PMUs, that are inaccessible at scale. This paper lays the foundation for topology estimation from more accessible data: outlet-level voltage magnitude measurements. It presents a graph-based algorithm and explanatory visualization using the Fielder vector for estimating and communicating topological proximity from this data. We demonstrate the method on a real dataset collected in Accra, Ghana, thus opening the possibility of globally accessible, cutting-edge grid monitoring through non-traditional sensing strategies coupled with ML. Authors: Mohini S Bariya (nLine); Genevieve Flaspohler (nLine) |
Unsupervised and semi-supervised learning Climate justice Time-series analysis |
ICLR 2023 |
Chasing Low-Carbon Electricity for Practical and Sustainable DNN Training
(Papers Track)
Abstract and authors: (click to expand)Abstract: Deep learning has experienced significant growth in recent years, resulting in increased energy consumption and carbon emission from the use of GPUs for training deep neural networks (DNNs). Answering the call for sustainability, conventional solutions have attempted to move training jobs to locations or time frames with lower carbon intensity. However, moving jobs to other locations may not always be feasible due to large dataset sizes or data regulations. Moreover, postponing training can negatively impact application service quality because the DNNs backing the service are not updated in a timely fashion. In this work, we present a practical solution that reduces the carbon footprint of DNN training without migrating or postponing jobs. Specifically, our solution observes real-time carbon intensity shifts during training and controls the energy consumption of GPUs, thereby reducing carbon footprint while maintaining training performance. Furthermore, in order to proactively adapt to shifting carbon intensity, we propose a lightweight machine learning algorithm that predicts the carbon intensity of the upcoming time frame. Our solution, Chase, reduces the total carbon footprint of training ResNet-50 on ImageNet by 13.6% while only increasing training time by 2.5%. Authors: Zhenning Yang (University of Michigan, Ann Arbor); Luoxi Meng (University of Michigan, Ann Arbor); Jae-Won Chung (University of Michigan, Ann Arbor); Mosharaf Chowdhury (University of Michigan, Ann Arbor) |
Classification, regression, and supervised learning Other Time-series analysis |
ICLR 2023 |
SOLAR PANEL MAPPING VIA ORIENTED OBJECT DETECTION
(Papers Track)
Abstract and authors: (click to expand)Abstract: Maintaining the integrity of solar power plants is a vital component in dealing with the current climate crisis. This process begins with analysts creating a de- tailed map of a plant with the coordinates of every solar panel, making it possible to quickly locate and mitigate potential faulty solar panels. However, this task is extremely tedious and is not scalable for the ever increasing capacity of so- lar power across the globe. Therefore, we propose an end-to-end deep learning framework for detecting individual solar panels using a rotated object detection architecture. We evaluate our approach on a diverse dataset of solar power plants collected from across the United States and report a mAP score of 83.3%. Authors: Conor Wallace (DroneBase); Isaac Corley (University of Texas at San Antonio); Jonathan Lwowski (DroneBase) |
Computer vision and remote sensing |
ICLR 2023 |
Disentangling observation biases to monitor spatio-temporal shifts in species distributions
(Proposals Track)
Abstract and authors: (click to expand)Abstract: The accelerated pace of environmental change due to anthropogenic activities makes it more important than ever to understand current and future ecosystem dynamics at a global scale. Species observations stemming from citizen science platforms are increasingly leveraged to gather information about the geographic distributions of many species. However, their usability is limited by the strong biases inherent to these community-driven efforts. These biases in the sampling effort are often treated as noise that has to be compensated for. In this project, we posit that better modelling the sampling effort (including the usage of the different platforms across countries, local accessibility, attractiveness of the location for platform users, affinity of different user groups for different species, etc.) is the key towards improving Species Distribution Models (SDM) using observations from citizen science platforms, thus opening up the possibility of leveraging them to monitor changes in species distributions and population densities. Authors: Diego Marcos (Inria); Christophe Botella (); Ilan Havinga (Wageningen University); Dino Ienco (INRAE); Cassio F. Dantas (TETIS, INRAE, Univ Montpellier); Pierre Alliez (INRIA Sophie-Antipolis, France); Alexis Joly (INRIA, FR) |
Causal and Bayesian methods Earth observations and monitoring Computer vision and remote sensing Hybrid physical models |
ICLR 2023 |
Mapping global innovation networks around clean energy technologies
(Proposals Track)
Abstract and authors: (click to expand)Abstract: Reaching net zero emissions requires rapid innovation and scale-up of clean tech. In this context, clean tech innovation networks (CTINs) can play a crucial role by pooling necessary resources and competences and enabling knowledge transfers between different actors. However, existing evidence on CTINs is limited due to a lack of comprehensive data. Here, we develop a machine learning framework to identify CTINs from announcements on social media to map the global CTIN landscape. Specifically, we classify the social media announcements regarding the type of technology (e.g., hydrogen, solar), interaction type (e.g., equity investment, R\&D collaboration), and status (e.g., commencement, update). We then extract referenced organizations via entity recognition. Thereby, we generate a large-scale dataset of CTINs across different technologies, countries, and over time. This allows us to compare characteristics of CTINs, such as the geographic proximity of actors, and to investigate the association between network evolution and technology innovation and diffusion. As a direct implication, our work helps policy makers to promote CTINs by identifying current barriers and needs. Authors: Malte Toetzke (ETH Zurich); Francesco Re (ETH Zurich); Benedict Probst (ETH Zurich); Stefan Feuerriegel (LMU Munich); Laura Diaz Anadon (University of Cambridge); Volker Hoffmann (ETH Zurich) |
Climate finance and economics Supply chains Natural language processing |
ICLR 2023 |
Widespread increases in future wildfire risk to global forest carbon offset projects revealed by explainable AI
(Papers Track)
Abstract and authors: (click to expand)Abstract: Carbon offset programs are critical in the fight against climate change. One emerging threat to the long-term stability and viability of forest carbon offset projects is wildfires, which can release large amounts of carbon and limit the efficacy of associated offsetting credits. However, analysis of wildfire risk to forest carbon projects is challenging because existing models for forecasting long-term fire risk are limited in predictive accuracy. Therefore, we propose an explainable artificial intelligence (XAI) model trained on 7 million global satellite wildfire observations. Validation results suggest substantial potential for high resolution, enhanced accuracy projections of global wildfire risk, and the model outperforms the U.S. National Center for Atmospheric Research's leading fire model. Applied to a collection of 190 global forest carbon projects, we find that fire exposure is projected to increase 55% [37-76%] by 2080 under a mid-range scenario (SSP2-4.5). Our results indicate the large wildfire carbon project damages seen in the past decade are likely to become more frequent as forests become hotter and drier. In response, we hope the model can support wildfire managers, policymakers, and carbon market analysts to preemptively quantify and mitigate long-term permanence risks to forest carbon projects. Authors: Tristan Ballard (Sust Inc); Gopal Erinjippurath (Sust Global); Matthew W Cooper (Sust Global); Chris Lowrie (Sust Global) |
Climate finance and economics Forestry and other land use Interpretable ML |
ICLR 2023 |
A High-Resolution, Data-Driven Model of Urban Carbon Emissions
(Papers Track)
Best Pathway to Impact
Abstract and authors: (click to expand)Abstract: Cities represent both a fundamental contributor to greenhouse (GHG) emissions and a catalyst for climate action. Many global cities have outlined sustainability and climate change mitigation plans, focusing on energy efficiency, shifting away from fossil fuels, and prioritizing environmental and social justice. To achieve broad-based and equitable carbon emissions reductions and sustainability goals, new data-driven methodologies are needed to identify and target efficiency and carbon reduction opportunities in the built environment at the building, neighborhood, and city-scale. Our methodology integrates data from numerous data sources and develops data-driven and physical models of energy use and carbon emissions from buildings and transportation to generate a high spatiotemporal resolution model of urban greenhouse gas emissions. The method and data tool are designed to support city leaders and urban policymakers with an unprecedented view of localized carbon emissions to enable data-driven and evidenced-based climate action. Authors: Bartosz Bonczak (New York University); Boyeong Hong (New York University); Constantine E. Kontokosta (New York University) |
Hybrid physical models Climate science and climate modeling Data mining |
ICLR 2023 |
ClimaX: A foundation model for weather and climate
(Papers Track)
Abstract and authors: (click to expand)Abstract: Recent data-driven approaches based on machine learning aim to directly solve a downstream forecasting or projection task by learning a data-driven functional mapping using deep neural networks. However, these networks are trained using curated and homogeneous climate datasets for specific spatiotemporal tasks, and thus lack the generality of currently used physics-informed numerical models for weather and climate modeling. We develop and demonstrate ClimaX, a flexible and generalizable deep learning model for weather and climate science that can be trained using heterogeneous datasets spanning different variables, spatiotemporal coverage, and physical groundings. ClimaX extends the Transformer architecture with novel encoding and aggregation blocks that allow effective use of available compute and data while maintaining general utility. ClimaX is pre-trained with a self-supervised learning objective on climate datasets derived from CMIP6. The pretrained ClimaX can then be fine-tuned to address a breadth of climate and weather tasks, including those that involve atmospheric variables and spatiotemporal scales unseen during pretraining. Compared to existing data-driven baselines, we show that this generality in ClimaX results in superior performance on benchmarks for weather forecasting and climate projections. Authors: Tung Nguyen (University of California, Los Angeles); Johannes Brandstetter (Microsoft Research); Ashish Kapoor (Microsoft); Jayesh Gupta (Microsoft Research); Aditya Grover (UCLA) |
Climate science and climate modeling Earth science Meta- and transfer learning |
ICLR 2023 |
Learning to Communicate and Collaborate in a Competitive Multi-Agent Setup to Clean the Ocean from Macroplastics
(Papers Track)
Abstract and authors: (click to expand)Abstract: Finding a balance between collaboration and competition is crucial for artificial agents in many real-world applications. We investigate this using a Multi-Agent Reinforcement Learning (MARL) setup on the back of a high-impact problem. The accumulation and yearly growth of plastic in the ocean cause irreparable damage to many aspects of oceanic health and the marina system. To prevent further damage, we need to find ways to reduce macroplastics from known plastic patches in the ocean. Here we propose a Graph Neural Network (GNN) based communication mechanism that increases the agents' observation space. In our custom environment, agents control a plastic collecting vessel. The communication mechanism enables agents to develop a communication protocol using a binary signal. While the goal of the agent collective is to clean up as much as possible, agents are rewarded for the individual amount of macroplastics collected. Hence agents have to learn to communicate effectively while maintaining high individual performance. We compare our proposed communication mechanism with a multi-agent baseline without the ability to communicate. Results show communication enables collaboration and increases collective performance significantly. This means agents have learned the importance of communication and found a balance between collaboration and competition. Authors: Philipp D Siedler (Aleph Alpha) |
Reinforcement learning and control Oceans and marine systems |
ICLR 2023 |
Sub-seasonal to seasonal forecasts through self-supervised learning
(Proposals Track)
Abstract and authors: (click to expand)Abstract: Sub-seasonal to seasonal (S2S) weather forecasts are an important decision- making tool that informs economical and logistical planning in agriculture, energy management, and disaster mitigation. They are issued on time scales of weeks to months and differ from short-term weather forecasts in two important ways: (i) the dynamics of the atmosphere on these timescales can be described only statistically and (ii) these dynamics are characterized by large-scale phenomena in both space and time. While deep learning (DL) has shown promising results in short-term weather forecasting, DL-based S2S forecasts are challenged by comparatively small volumes of available training data and large fluctuations in predictability due to atmospheric conditions. In order to develop more reliable S2S predictions that leverage current advances in DL, we propose to utilize the masked auto-encoder (MAE) framework to learn generic representations of large-scale atmospheric phenomena from high resolution global data. Besides exploring the suitability of the learned representations for S2S forecasting, we will also examine whether they account for climatic phenomena (e.g., the Madden-Julian Oscillation) that are known to increase predictability on S2S timescales. Authors: Jannik Thuemmel (University of Tuebingen); Felix Strnad (Potsdam Institute for Climate Impact Research); Jakob Schlör (Eberhard Karls Universität Tübingen); Martin V. Butz (University of Tübingen); Bedartha Goswami (University of Tübingen) |
Unsupervised and semi-supervised learning Climate science and climate modeling Extreme weather Interpretable ML |
ICLR 2023 |
Mining Effective Strategies for Climate Change Communication
(Papers Track)
Abstract and authors: (click to expand)Abstract: With the goal of understanding effective strategies to communicate about climate change, we build interpretable models to rank tweets related to climate change with respect to the engagement they generate. Our models are based on the Bradley-Terry model of pairwise comparison outcomes and use a combination of the tweets’ topic and metadata features to do the ranking. To remove confounding factors related to author popularity and minimise noise, they are trained on pairs of tweets that are from the same author and around the same time period and have a sufficiently large difference in engagement. The models achieve good accuracy on a held-out set of pairs. We show that we can interpret the parameters of the trained model to identify the topic and metadata features that contribute to high engagement. Among other observations, we see that topics related to climate projections, human cost and deaths tend to have low engagement while those related to mitigation and adaptation strategies have high engagement. We hope the insights gained from this study will help craft effective climate communication to promote engagement, thereby lending strength to efforts to tackle climate change. Authors: Aswin Suresh (EPFL); Lazar Milikic (EPFL); Francis Murray (EPFL); Yurui Zhu (EPFL); Matthias Grossglauser (École Polytechnique Fédérale de Lausanne (EPFL)) |
Natural language processing Behavioral and social science Public policy Societal adaptation and resilience Classification, regression, and supervised learning Data mining Interpretable ML Unsupervised and semi-supervised learning |
ICLR 2023 |
Graph-Based Deep Learning for Sea Surface Temperature Forecasts
(Papers Track)
Abstract and authors: (click to expand)Abstract: Sea surface temperature (SST) forecasts help with managing the marine ecosystem and the aquaculture impacted by anthropogenic climate change. Numerical dynamical models are resource intensive for SST forecasts; machine learning (ML) models could reduce high computational requirements and have been in the focus of the research community recently. ML models normally require a large amount of data for training. Environmental data are collected on regularly-spaced grids, so early work mainly used grid-based deep learning (DL) for prediction. However, both grid data and the corresponding DL approaches have inherent problems. As geometric DL has emerged, graphs as a more generalized data structure and graph neural networks (GNNs) have been introduced to the spatiotemporal domains. In this work, we preliminarily explored graph re-sampling and GNNs for global SST forecasts, and GNNs show better one month ahead SST prediction than the persistence model in most oceans in terms of root mean square errors. Authors: Ding Ning (University of Canterbury); Varvara Vetrova (University of Canterbury); Karin Bryan (University of Waikato) |
Data mining Climate science and climate modeling Oceans and marine systems Classification, regression, and supervised learning Other Time-series analysis |
ICLR 2023 |
Activity-Based Recommendations for the Reduction of CO2 Emissions in Private Households
(Papers Track)
Abstract and authors: (click to expand)Abstract: This paper proposes an activity prediction framework for a multi-agent recommendation system to tackle the energy-efficiency problem in residential buildings. Our system generates an activity-shifting schedule based on the social practices from the users’ domestic life. We further provide a utility option for the recommender system to focus on saving CO2 emissions or energy costs, or both. The empirical results show that while focusing on the reduction of CO2 emissions, the system provides an average of 12% of emission savings and 7% of electricity cost savings. When concentrating on energy costs, 6% of emission savings and 20% of electricity cost savings are possible for the studied households. Authors: Alona Zharova (Humboldt University of Berlin); Laura Löschmann (Humboldt University of Berlin) |
Recommender systems Buildings Power and energy systems Classification, regression, and supervised learning |
ICLR 2023 |
Data-driven mean-variability optimization of PV portfolios with automatic differentiation
(Papers Track)
Abstract and authors: (click to expand)Abstract: Increasing PV capacities has a crucial role to reach carbon-neutral energy systems. To promote PV expansion, policy designs have been developed which rely on energy yield maximization to increase the total PV energy supply in energy systems. Focusing on yield maximization, however, ignores negative side-effects such as an increased variability due to similar-orientated PV systems at clustered regions. This can lead to costly ancillary services and thereby reduces the acceptance of renewable energy. This paper suggests to rethink PV portfolio designs by deriving mean-variability hedged PV portfolios with smartly orientated tilt and azimuth angles. Based on a data-driven method inspired from modern portfolio theory, we formulate the problem as a biobjective, non-convex optimization problem which is solved based on automatically differentiating the physical PV conversion model subject to individual tilt and azimuth angles. To illustrate the performance of the proposed method, a case study is designed to derive efficient frontiers in the mean-variability spectrum of Germany's PV portfolio based on representative grid points. The proposed method allows decision-makers to hedge between variability and yield in PV portfolio design decisions. This is the first study highlighting the problem of ignoring variability in PV portfolio expansion schemes and introduces a way to tackle this issue using modern methods inspired by Machine Learning. Authors: Matthias Zech (German Aerospace Center (DLR), Institute of Networked Energy Systems); Lueder von Bremen (German Aerospace Center (DLR), Institute of Networked Energy Systems) |
Power and energy systems Climate finance and economics Classification, regression, and supervised learning Interpretable ML |
ICLR 2023 |
DiffESM: Conditional Emulation of Earth System Models with Diffusion Models
(Papers Track)
Abstract and authors: (click to expand)Abstract: Earth System Models (ESMs) are essential tools for understanding the impact of human actions on Earth's climate. One key application of these models is studying extreme weather events, such as heat waves or dry spells, which have significant socioeconomic and environmental consequences. However, the computational demands of running a sufficient number of simulations to analyze the risks are often prohibitive. In this paper we demonstrate that diffusion models -- a class of generative deep learning models -- can effectively emulate the spatio-temporal trends of ESMs under previously unseen climate scenarios, while only requiring a small fraction of the computational resources. We present a diffusion model that is conditioned on monthly averages of temperature or precipitation on a 96x96 global grid, and produces daily values that are both realistic and consistent with those averages. Our results show that the output from our diffusion model closely matches the spatio-temporal behavior of the ESM it emulates in terms of the frequency of phenomena such as heat waves, dry spells, or rainfall intensity. Authors: Seth Bassetti (Western Washington University); Brian Hutchinson (Western Washington University); Claudia Tebaldi (Joint Global Change Research Institute); Ben Kravitz (Indiana University) |
Climate science and climate modeling Generative modeling |
ICLR 2023 |
Emission-Constrained Optimization of Gas Systems with Input-Convex Neural Networks
(Papers Track)
Abstract and authors: (click to expand)Abstract: Planning optimization of gas networks under emission constraints prioritizes gas supply with the smallest emission footprint. As this problem includes complex gas flow physical laws, standard optimization solvers cannot guarantee convergence to a feasible solution, especially under strict emission constraints. To address this issue, we develop an input-convex neural network (ICNN) aided optimization routine which incorporates a set of trained ICNNs approximating the gas flow equations with high precision. Numerical tests on the Belgium gas network demonstrate that the ICNN-aided optimization dominates non-convex and relaxation-based solvers, with larger optimality gains pertaining to stricter emission targets. Authors: Vladimir Dvorkin (Massachusetts Institute of Technology); Samuel C Chevalier (Technical University of Denmark); Spyros Chatzivasileiadis (Technical University of Denmark) |
Power and energy systems Supply chains |
ICLR 2023 |
Deep ensembles to improve uncertainty quantification of statistical downscaling models under climate change conditions
(Papers Track)
Abstract and authors: (click to expand)Abstract: Recently, deep learning has emerged as a promising tool for statistical downscaling, the set of methods for generating high-resolution climate fields from coarse low-resolution variables. Nevertheless, their ability to generalize to climate change conditions remains questionable, mainly due to the stationarity assumption. We propose deep ensembles as a simple method to improve the uncertainty quantification of statistical downscaling models. By better capturing uncertainty, statistical downscaling models allow for superior planning against extreme weather events, a source of various negative social and economic impacts. Since no observational future data exists, we rely on a pseudo reality experiment to assess the suitability of deep ensembles for quantifying the uncertainty of climate change projections. Deep ensembles allow for a better risk assessment, highly demanded by sectoral applications to tackle climate change. Authors: Jose González-Abad (Instituto de Fı́sica de Cantabria (IFCA), CSIC-Universidad de Cantabria); Jorge Baño-Medina (Institute of Physics of Cantabria) |
Uncertainty quantification and robustness Climate science and climate modeling |
ICLR 2023 |
Bayesian Inference of Severe Hail in Australia
(Papers Track)
Abstract and authors: (click to expand)Abstract: Severe hailstorms are responsible for some of the most costly insured weather events in Australia and can cause significant damage to homes, businesses, and agriculture. However their response to climate change remains uncertain, in large part due to the challenges of observing severe hailstorms. We propose a novel Bayesian approach which explicitly models known biases and uncertainties of current hail observations to produce more realistic estimates of severe hail risk from existing observations. Training this model on data from south-east Queensland, Australia, suggests that previous analyses of severe hail that did not account for this uncertainty may produce poorly calibrated risk estimates. Preliminary evaluation on withheld data confirms that our model produces well-calibrated probabilities and is applicable out of sample. Whilst developed for hail, we highlight also the generality of our model and its potential applications to other severe weather phenomena and areas of climate change adaptation and mitigation. Authors: Isabelle C Greco (University of New South Wales); Steven Sherwood (University of New South Wales); Timothy Raupach (University of New South Wales); Gab Abramowitz (University of New South Wales) |
Causal and Bayesian methods Climate science and climate modeling Disaster management and relief Earth observations and monitoring Extreme weather Uncertainty quantification and robustness |
ICLR 2023 |
Exploring the potential of neural networks for Species Distribution Modeling
(Papers Track)
Abstract and authors: (click to expand)Abstract: Species distribution models (SDMs) relate species occurrence data with environmental variables and are used to understand and predict species distributions across landscapes. While some machine learning models have been adopted by the SDM community, recent advances in neural networks may have untapped potential in this field. In this work, we compare the performance of multi-layer perceptron (MLP) neural networks to well-established SDM methods on a benchmark dataset spanning 225 species in six geographical regions. We also compare the performance of MLPs trained separately for each species to an equivalent model trained on a set of species and performing multi-label classification. Our results show that MLP models achieve comparable results to state-of-the-art SDM methods, such as MaxEnt. We also find that multi-species MLPs perform slightly better than single-species MLPs. This study indicates that neural networks, along with all their convenient and valuable characteristics, are worth considering for SDMs. Authors: Robin Zbinden (EPFL); Nina van Tiel (EPFL); Benjamin Kellenberger (Yale University); Lloyd H Hughes (EPFL); Devis Tuia (EPFL) |
Ecosystems and biodiversity Classification, regression, and supervised learning |
ICLR 2023 |
Modelling Atmospheric Dynamics with Spherical Fourier Neural Operators
(Papers Track)
Abstract and authors: (click to expand)Abstract: Fourier Neural Operators (FNOs) have established themselves as an efficient method for learning resolution-independent operators in a wide range of scientific machine learning applications. This can be attributed to their ability to effectively model long-range dependencies in spatio-temporal data through computationally ef- ficient global convolutions. However, the use of discrete Fourier transforms (DFTs) in FNOs leads to spurious artifacts and pronounced dissipation when applied to spherical coordinates, due to the incorrect assumption of flat geometry. To ad- dress the issue, we introduce Spherical FNOs (SFNOs), which use the generalized Fourier transform for learning operators on spherical geometries. We demonstrate the effectiveness of the method for forecasting atmospheric dynamics, producing stable auto-regressive results for a simulated time of one year (1,460 steps) while retaining physically plausible dynamics. This development has significant implica- tions for machine learning-based climate dynamics emulation, which could play a crucial role in accelerating our response to climate change. Authors: Boris Bonev (NVIDIA); Thorsten Kurth (Nvidia); Christian Hundt (NVIDIA AI Technology Center); Jaideep Pathak (NVIDIA Corporation); Maximilian Baust (NVIDIA); Karthik Kashinath (NVIDIA); Anima Anandkumar (NVIDIA/Caltech) |
Hybrid physical models Climate science and climate modeling Earth science Extreme weather |
ICLR 2023 |
Distributed Reinforcement Learning for DC Open Energy Systems
(Papers Track)
Abstract and authors: (click to expand)Abstract: The direct current open energy system (DCOES) enables the production, storage, and exchange of renewable energy within local communities, which is helpful, especially in isolated villages and islands where centralized power supply is unavailable or unstable. As solar and wind energy production varies in time and space depending on the weather and the energy usage patterns differ for different households, how to store and exchange energy is an important research issue. In this work, we explore the use of deep reinforcement learning (DRL) for adaptive control of energy storage in local batteries and energy sharing through DC grids. We extend the Autonomous Power Interchange System (APIS) emulator from SonyCSL to combine it with reinforcement learning algorithms in each house. We implemented deep Q-network (DQN) and prioritized DQN to dynamically set the parameters of the real-time energy exchange protocol of APIS and tested it using the actual data collected from the DCOES in the faculty houses of Okinawa Institute of Science and Technology (OIST). The simulation results showed that RL agents outperformed the hand-tuned control strategy. Sharing average energy production, storage, and usage within the local community further improved efficiency. The implementation of DRL methods for adaptive energy storage and exchange can help reducing carbon emission and positively impact the climate. Authors: Qiong Huang (Okinawa Institute of Science and Technology Graduate University); Kenji Doya (Okinawa Institute of Science and Technology) |
Reinforcement learning and control Power and energy systems |
ICLR 2023 |
Uncovering the Spatial and Temporal Variability of Wind Resources in Europe: A Web-Based Data-Mining Tool
(Papers Track)
Abstract and authors: (click to expand)Abstract: We introduce REmap-eu.app, a web-based data-mining visualization tool of the spatial and temporal variability of wind resources. It uses the latest open-access dataset of the daily wind capacity factor in 28 European countries between 1979 and 2019 and proposes several user-configurable visualizations of the temporal and spatial variations of the wind power capacity factor. The platform allows for a deep analysis of the distribution, the cross-country correlation, and the drivers of low wind power events. It offers an easy-to-use interface that makes it suitable for the needs of researchers and stakeholders. The tool is expected to be useful in identifying areas of high wind potential and possible challenges that may impact the large-scale deployment of wind turbines in Europe. Particular importance is given to the visualization of low wind power events and to the potential of cross-border cooperations in mitigating the variability of wind in the context of increasing reliance on weather-sensitive renewable energy sources. Authors: Alban Puech (École Polytechnique); Jesse Read (Ecole Polytechnique) |
Data mining Climate science and climate modeling Power and energy systems Time-series analysis |
ICLR 2023 |
Understanding forest resilience to drought with Shapley values
(Proposals Track)
Abstract and authors: (click to expand)Abstract: Increases in drought frequency, intensity, and duration due to climate change are threatening forests around the world. Climate-driven tree mortality is associated with devastating ecological and societal consequences, including the loss of carbon sequestration, habitat provisioning, and water filtration services. A spatially fine-grained understanding of the site characteristics making forests more resilient to drought is still lacking. Furthermore, the complexity of drought effects on forests, which can be cumulative and delayed, demands investigation of the most appropriate drought indices. In this study, we aim to gain a better understanding of the temporal and spatial drivers of drought-induced changes in forest vitality using Shapley values, which allow for the relevance of predictors to be quantified locally. A better understanding of the contribution of meteorological and environmental factors to trees’ response to drought can support forest managers aiming to make forests more climate-resilient. Authors: Stenka Vulova (Technische Universität Berlin); Alby Duarte Rocha (Technische Universität Berlin); Akpona Okujeni (Humboldt-Universität zu Berlin); Johannes Vogel (Freie Universität Berlin); Michael Förster (Technische Universität Berlin); Patrick Hostert (Humboldt-Universität zu Berlin); Birgit Kleinschmit (Technische Universität Berlin) |
Forestry and other land use Earth observations and monitoring Earth science Ecosystems and biodiversity Extreme weather Classification, regression, and supervised learning Interpretable ML |
ICLR 2023 |
Improving the spatial accuracy of extreme tropical cyclone rainfall in ERA5 using deep learning
(Papers Track)
Abstract and authors: (click to expand)Abstract: We propose a novel method for the bias adjustment and post-processing of gridded rainfall data products. Our method uses U-Net (a deep convolutional neural network) as a backbone, and a novel loss function given by the combination of a pixelwise bias component (Mean Absolute Error) and a spatial accuracy component (Fractions Skill Score). We evaluate the proposed approach by adjusting extreme rainfall from the popular ERA5 reanalysis dataset, using the multi-source observational dataset MSWEP as a target. We focus on a sample of extreme rainfall events induced by tropical cyclones and show that the proposed method significantly reduces both the MAE (by 16\%) and FSS (by 53\%) of ERA5. Authors: Guido Ascenso (Politecnico di Milano); Andrea Ficchì (Politecnico di Milano); Matteo Giuliani (Politecnico di Milano); Leone Cavicchia (Fondazione Centro Euro-Mediterraneo sui Cambiamenti Climatici (CMCC)); Enrico Scoccimarro (Fondazione Centro Euro-Mediterraneo sui Cambiamenti Climatici (CMCC)); Andrea Castelletti (Politecnico di Milano) |
Extreme weather Climate science and climate modeling Classification, regression, and supervised learning Computer vision and remote sensing |
ICLR 2023 |
XAI for transparent wind turbine power curve models
(Papers Track)
Abstract and authors: (click to expand)Abstract: Accurate wind turbine power curve models, which translate ambient conditions into turbine power output, are crucial for wind energy to scale and fulfill its proposed role in the global energy transition. While machine learning (ML) methods have shown significant advantages over parametric, physics-informed approaches, they are often criticized for being opaque "black boxes", which hinders their application in practice. We apply Shapley values, a popular explainable artificial intelligence (XAI) method, and the latest findings from XAI for regression models, to uncover the strategies ML models have learned from operational wind turbine data. Our findings reveal that the trend towards ever larger model architectures, driven by a focus on test set performance, can result in physically implausible model strategies. Therefore, we call for a more prominent role of XAI methods in model selection. Moreover, we propose a practical approach to utilize explanations for root cause analysis in the context of wind turbine performance monitoring. This can help to reduce downtime and increase the utilization of turbines in the field. Authors: Simon Letzgus (Technische Universität Berlin) |
Power and energy systems Classification, regression, and supervised learning Interpretable ML |
ICLR 2023 |
Green AutoML for Plastic Litter Detection
(Papers Track)
Abstract and authors: (click to expand)Abstract: The world’s oceans are polluted with plastic waste and the detection of it is an important step toward removing it. Wolf et al. (2020) created a plastic waste dataset to develop a plastic detection system. Our work aims to improve the machine learning model by using Green Automated Machine Learning (AutoML). One aspect of Green-AutoML is to search for a machine learning pipeline, while also minimizing the carbon footprint. In this work, we train five standard neural architectures for image classification on the aforementioned plastic waste dataset. Subsequently, their performance and carbon footprints are compared to an Efficient Neural Architecture Search as a well-known AutoML approach. We show the potential of Green-AutoML by outperforming the original plastic detection system by 1.1% in accuracy and using 33 times fewer floating point operations at inference, and only 29% of the carbon emissions of the best-known baseline. This shows the large potential of AutoML on climate-change relevant applications and at the same time contributes to more efficient modern Deep Learning systems, saving substantial resources and reducing the carbon footprint. Authors: Daphne Theodorakopoulos (German Research Centre for Artificial Intelligence (DFKI), Marine Perception Department and Leibniz University Hannover, Institute of Artificial Intelligence); Christoph Manß (German Research Centre for Artificial Intelligence (DFKI), Marine Perception Department); Frederic Stahl (German Research Centre for Artificial Intelligence (DFKI), Marine Perception Department); Marius Lindauer (Leibniz University Hannover) |
Other Oceans and marine systems Classification, regression, and supervised learning Computer vision and remote sensing |
ICLR 2023 |
Robustly modeling the nonlinear impact of climate change on agriculture by combining econometrics and machine learning
(Proposals Track)
Abstract and authors: (click to expand)Abstract: Climate change is expected to have a dramatic impact on agricultural production; however, due to natural complexity, the exact avenues and relative strengths by which this will happen are still unknown. The development of accurate forecasting models is thus of great importance to enable policy makers to design effective interventions. To date, most machine learning methods aimed at tackling this problem lack a consideration of causal structure, thereby making them unreliable for the types of counterfactual analysis necessary when making policy decisions. Econometrics has developed robust techniques for estimating cause-effect relations in time-series, specifically through the use of cointegration analysis and Granger causality. However, these methods are frequently limited in flexibility, especially in the estimation of nonlinear relationships. In this work, we propose to integrate the non-linear function approximators with the robust causal estimation methods to ultimately develop an accurate agricultural forecasting model capable of robust counterfactual analysis. This method would be a valuable new asset for government and industrial stakeholders to understand how climate change impacts agricultural production. Authors: Benedetta Francesconi (Independent Researcher); Ying-Jung C Deweese (Descartes Labs / Georgia Insititute of Technology) |
Agriculture and food Climate science and climate modeling Public policy Societal adaptation and resilience Time-series analysis |
ICLR 2023 |
Towards Green, Accurate, and Efficient AI Models Through Multi-Objective Optimization
(Proposals Track)
Abstract and authors: (click to expand)Abstract: Machine learning is one of the fastest growing services in modern hyperscale data centers. While AI’s exponential scaling has enabled unprecedented modeling capabilities across computer vision, natural language processing, protein modeling, personalized recommendation, it comes at the expense of significant energy and environmental footprints. This work aims to co-optimize machine learning models in terms of their accuracy, compute efficiency, and environmental sustainability by using multi-objective bayesian optimization. We aim to extend current multi-objective optimization frameworks, such as the openly available Ax (adaptive experimentation) platform to balance accuracy, efficiency, and environmental sustainability of deep neural networks. In order to optimize for environmental sustainability we will consider the impact across AI model life cycles (e.g., training, inference) and hardware life cycles (e.g., manufacturing, operational use). Given this is a research proposal, we expect to demonstrate that designing for sustainable AI models yields distinct optimal neural network architectures than ones designed for accuracy and efficiency given the external impacts of varying renewable energy and tradeoffs between compute and storage for embodied carbon overheads. Authors: Udit Gupta (Harvard University); Daniel R Jiang (Meta); Maximilian Balandat (Facebook); Carole-Jean Wu (Meta AI) |
Other |
ICLR 2023 |
EfficientTempNet: Temporal Super-Resolution of Radar Rainfall
(Papers Track)
Abstract and authors: (click to expand)Abstract: Rainfall data collected by various remote sensing instruments such as radars or satellites has different space-time resolutions. This study aims to improve the temporal resolution of radar rainfall products to help with more accurate climate change modeling and studies. In this direction, we introduce a solution based on EfficientNetV2, namely EfficientTempNet, to increase the temporal resolution of radar-based rainfall products from 10 minutes to 5 minutes. We tested EfficientRainNet over a dataset for the state of Iowa, US, and compared its performance to three different baselines to show that EfficientTempNet presents a viable option for better climate change monitoring. Authors: Bekir Z Demiray (University of Iowa); Muhammed A Sit (The University of Iowa); Ibrahim Demir (University of Iowa) |
Earth observations and monitoring Climate science and climate modeling Earth science Extreme weather Computer vision and remote sensing Time-series analysis |
ICLR 2023 |
Bird Distribution Modelling using Remote Sensing and Citizen Science data
(Papers Track)
Overall Best Paper
Abstract and authors: (click to expand)Abstract: Climate change is a major driver of biodiversity loss, changing the geographic range and abundance of many species. However, there remain significant knowl- edge gaps about the distribution of species, due principally to the amount of effort and expertise required for traditional field monitoring. We propose an approach leveraging computer vision to improve species distribution modelling, combining the wide availability of remote sensing data with sparse on-ground citizen science data from .We introduce a novel task and dataset for mapping US bird species to their habitats by predicting species encounter rates from satellite images, along with baseline models which demonstrate the power of our approach. Our methods open up possibilities for scalably modelling ecosystems properties worldwide. Authors: Mélisande Teng (Mila, Université de Montréal); Amna Elmustafa (African Institute for Mathematical Science); Benjamin Akera (McGill University); Hugo Larochelle (UdeS); David Rolnick (McGill University, Mila) |
Ecosystems and biodiversity Computer vision and remote sensing |
ICLR 2023 |
Efficient HVAC Control with Deep Reinforcement Learning and EnergyPlus
(Papers Track)
Abstract and authors: (click to expand)Abstract: Heating and cooling comprise a significant fraction of the energy consumed by buildings, which in turn account for a significant fraction of society’s energy use. Most building heating, ventilation, and air conditioning (HVAC) systems use standard control schemes that meet basic operating constraints and comfort requirements but with suboptimal efficiency. Deep reinforcement learning (DRL) has shown immense potential for high-performing control in a variety of simulated settings, but has not been widely deployed for real-world control. Here we provide two contributions toward increasing the viability of real-world, DRL-based HVAC control, leveraging the EnergyPlus building simulator. First, we use the new EnergyPlus Python API to implement a first-of-its-kind, purely Python-based EnergyPlus DRL learning framework capable of generalizing to a wide variety of building configurations and weather scenarios. Second, we demonstrate an approach to constrained learning for this setting, removing the requirement to tune reward functions in order to maximize energy efficiency given temperature constraints. We tested our framework on realistic building models of a data center, an office building, and a secondary school. In each case, trained agents maintained temperature control while achieving energy savings relative to standard approaches. Authors: Jared Markowitz (Johns Hopkins University Applied Physics Laboratory); Nathan Drenkow (Johns Hopkins University Applied Physics Laboratory) |
Buildings Reinforcement learning and control |
ICLR 2023 |
Fourier Neural Operators for Arbitrary Resolution Climate Data Downscaling
(Papers Track)
Abstract and authors: (click to expand)Abstract: Running climate simulations informs us of future climate change. However, it is computationally expensive to resolve complex climate processes numerically. As one way to speed up climate simulations, neural networks have been used to downscale climate variables from fast-running low-resolution simulations. So far, all neural network downscaling models can only downscale input samples with a pre-defined upsampling factor. In this work, we propose a Fourier neural operator downscaling model. It trains with data of a small upsampling factor and then can zero-shot downscale its input to arbitrary unseen high-resolutions. Evaluated on Navier-Stokes equation solution data and ERA5 water content data, our downscaling model demonstrates better performance than widely used convolutional and adversarial generative super-resolution models in both learned and zero-shot downscaling. Our model's performance is further boosted when a constraint layer is applied. In the end, we show that by combining our downscaling model with a low-resolution numerical PDE solver, the downscaled solution outperforms the solution of the state-of-the-art high-resolution data-driven solver. Our model can be used to cheaply and accurately generate arbitrarily high-resolution climate simulation data with fast-running low-resolution simulation as input. Authors: Qidong Yang (New York University); Paula Harder (Fraunhofer ITWM); Venkatesh Ramesh (University of Montreal, Mila); Alex Hernandez-Garcia (Mila - Quebec AI Institute); Daniela Szwarcman (IBM Research); Prasanna Sattigeri (IBM Research); Campbell D Watson (IBM Reserch); David Rolnick (McGill University, Mila) |
Climate science and climate modeling Earth observations and monitoring Computer vision and remote sensing Generative modeling |
ICLR 2023 |
Data-driven multiscale modeling of subgrid parameterizations in climate models
(Papers Track)
Best ML Innovation
Abstract and authors: (click to expand)Abstract: Subgrid parameterizations that represent physical processes occurring below the resolution of current climate models are an important component in producing accurate, long-term predictions for the climate. A variety of approaches have been tested to design these components, including deep learning methods. In this work, we evaluate a proof of concept illustrating a multiscale approach to this prediction problem. We train neural networks to predict subgrid forcing values on a testbed model and examine improvements in prediction accuracy which can be obtained by using additional information in both fine-to-coarse and coarse-to-fine directions. Authors: Karl Otness (New York University); Laure Zanna (NYU); Joan Bruna (NYU) |
Climate science and climate modeling Classification, regression, and supervised learning Hybrid physical models |
ICLR 2023 |
Decision-aware uncertainty-calibrated deep learning for robust energy system operation
(Proposals Track)
Abstract and authors: (click to expand)Abstract: Decision-making under uncertainty is an important problem that arises in many domains. Achieving robustness guarantees requires well-calibrated uncertainties, which can be difficult to achieve in high-capacity prediction models such as deep neural networks. This paper proposes an end-to-end approach for learning uncertainty-calibrated deep learning models that directly optimizes a downstream decision-making objective with provable robustness. We also propose two concrete applications in energy system operations, including a grid scheduling task as well as an energy storage arbitrage task. As renewable wind and solar generation increasingly proliferate and their variability penetrates the energy grid, learning uncertainty-aware predictive models becomes increasingly crucial for maintaining efficient and reliable grid operation. Authors: Christopher Yeh (California Institute of Technology); Nicolas Christianson (California Institute of Technology); Steven Low (California Institute of Technology); Adam Wierman (California Institute of Technology); Yisong Yue (Caltech) |
Power and energy systems Reinforcement learning and control |
ICLR 2023 |
Multi-Agent Deep Reinforcement Learning for Solar-Battery System to Mitigate Solar Curtailment in Real-Time Electricity Market
(Papers Track)
Abstract and authors: (click to expand)Abstract: The increased uptake of solar energy in the energy transition towards decarbonization has caused the issue of solar photovoltaic (PV) curtailments, resulting in significant economic losses and hindering the energy transition. To overcome this issue, battery energy storage systems (BESS) can serve as onsite backup sources for solar farms. However, the backup role of the BESS significantly limits its economic value, disincentivizing the BESS deployment due to high investment costs. Hence, it is essential to effectively reduce solar curtailment while ensuring viable operations of the BESS. Authors: Jinhao Li (Monash University); Changlong Wang (Monash University); Hao Wang (Monash University) |
Power and energy systems Reinforcement learning and control |
ICLR 2023 |
Projecting the climate penalty on pm2.5 pollution with spatial deep learning
(Proposals Track)
Abstract and authors: (click to expand)Abstract: The climate penalty measures the effects of a changing climate on air quality due to the interaction of pollution with climate factors, independently of future changes in emissions. This work introduces a statistical framework for estimating the climate penalty on soot pollution (PM 2.5), which has been linked to respiratory and cardiovascular diseases and premature mortality. The framework evaluates the disparities in future PM 2.5 exposure across racial/ethnic and income groups---an important step towards informing mitigation public health policy and promoting environmental equity in addressing the effects of climate change. The proposed methodology aims to improve existing statistical-based methods for estimating the climate penalty using an expressive and scalable predictive model based on spatial deep learning with spatiotemporal trend estimation. The proposed approach will (1) use higher-resolution climate inputs, which current statistical methods to estimate the climate penalty approaches cannot accommodate; (2) integrate additional predictive data sources such as demographics, geology, and land use; (3) consider regional dependencies and synoptic weather patterns influencing PM 2.5, deconvolving the effects of climate change from increasing air quality regulations and other sources of unmeasured spatial heterogeneity. Authors: Mauricio Tec (Harvard University); Riccardo Cadei (Harvard University); Francesca Dominici (Harvard University); Corwin Zigler (University of Texas at Austin) |
Climate justice Health Computer vision and remote sensing Time-series analysis |
ICLR 2023 |
On the impact of small-data diversity on forecasts: evidence from meteorologically-driven electricity demand in Mediterranean zones.
(Papers Track)
Abstract and authors: (click to expand)Abstract: In this paper, we compare the improvement of probabilistic electricity demand forecasts for three specific coastal and island regions using raw and pre-computed meteorological features based on empirically-tested formulations drawn from climate science literature. Typically for the general task of time-series forecasting with strong weather/climate drivers, go-to models like the Autoregressive Integrated Moving Average (ARIMA) model are built with assumptions of how independent variables will affect a dependent one and are at best encoded with a handful of exogenous features with known impact. Depending on the geographical region and/or cultural practices of a population, such a selection process may yield a non-optimal feature set which would ultimately drive a weak impact on underline demand forecasts. The aim of this work is to assess the impact of a documented set of meteorological features on electricity demand using deep learning models in comparative studies. Leveraging the defining computational architecture of the Temporal Fusion Transformer (TFT), we discover the unimportance of weather features for improving probabilistic forecasts for the targeted regions. However, through experimentation, we discover that the more stable electricity demand of the coastal Mediterranean regions, the Ceuta and Melilla autonomous cities in Morocco, improved the forecast accuracy of the strongly tourist-driven electricity demand for the Balearic islands located in Spain during the time of travel restrictions (i.e., during COVID19 (2020))--a root mean squared error (RMSE) from ~0.090 to ~0.012 with a substantially improved 10th/90th quantile bounding. Authors: Reginald Bryant (IBM Research - Africa); Julian Kuehnert (IBM Research) |
Time-series analysis Cities and urban planning Climate science and climate modeling Extreme weather Societal adaptation and resilience Uncertainty quantification and robustness |
ICLR 2023 |
Artificial Intelligence in Tropical Cyclone Forecasting
(Proposals Track)
Abstract and authors: (click to expand)Abstract: Tropical cyclones (TC) in Bangladesh and other developing nations harm property and human lives. Theoretically, artificial intelligence (AI) can anticipate TC using tracking, intensity, and cyclone aftereffect phenomena. Although AI has a significant impact on predicting, poorer nations have struggled to adjust to its real-world applications. The interpretability of such a solution from an AI-based solution is the main factor in that situation, together with the infrastructure. Explainable AI has been extensively employed in the medical field because the outcome is so important. We believe that using explainable AI in TC forecasting is equally important as one large forecast can cause the thought of life loss. Additionally, it will improve the long-term viability of the AI-based weather forecasting system. To be more specific, we want to employ explainable AI in every way feasible in this study to address the problems of TC forecasting, intensity estimate, and tracking. We'll do this by using the graph neural network. The adoption of AI-based solutions in underdeveloped nations will be aided by this solution, which will boost their acceptance. With this effort, we also hope to tackle the challenge of sustainable AI in order to tackle the issue of climate change on a larger scale. However, Cyclone forecasting might be transformed by sustainable AI, guaranteeing precise and early predictions to lessen the effects of these deadly storms. The examination of vast volumes of meteorological data to increase forecasting accuracy is made possible by the combination of AI algorithms and cutting-edge technologies like machine learning and big data analytics. Improved accuracy is one of the main advantages of sustainable AI for cyclone prediction. To provide more precise forecasts, AI systems can evaluate a lot of meteorological data, including satellite imagery and ocean temperature readings. Authors: Dr. Nusrat Sharmin (Military Institute of Science and Technology); Professor Dr. Md. Mahbubur Rahman Rahman (Military Institute of Science and Technology (MIST)); Sabbir Rahman (Military Institute of Science and Technology); Mokhlesur Rahman (Military Institute of Science and Technology) |
Climate science and climate modeling Disaster management and relief Interpretable ML |
NeurIPS 2022 |
Function Approximations for Reinforcement Learning Controller for Wave Energy Converters
(Papers Track)
Abstract and authors: (click to expand)Abstract: Waves are a more consistent form of clean energy than wind and solar and the latest Wave Energy Converters (WEC) platforms like CETO 6 have evolved into complex multi-generator designs with a high energy capture potential for financial viability. Multi-Agent Reinforcement Learning (MARL) controller can handle these complexities and control the WEC optimally unlike the default engineering controllers like Spring Damper which suffer from lower energy capture and mechanical stress from the spinning yaw motion. In this paper, we look beyond the normal hyper-parameter and MARL agent tuning and explored the most suitable architecture for the neural network function approximators for the policy and critic networks of MARL which act as its brain. We found that unlike the commonly used fully connected network (FCN) for MARL, the sequential models like transformers and LSTMs can model the WEC system dynamics better. Our novel transformer architecture, Skip Transformer-XL (STrXL), with several gated residual connections in and around the transformer block performed better than the state-of-the-art with faster training convergence. STrXL boosts energy efficiency by an average of 25% to 28% over the existing spring damper (SD) controller for waves at different angles and almost eliminated the mechanical stress from the rotational yaw motion, saving costly maintenance on open seas, and thus reducing the Levelized Cost of wave energy (LCOE). Demo: https://tinyurl.com/4s4mmb9v Authors: Soumyendu Sarkar (Hewlett Packard Enterprise); Vineet Gundecha (Hewlett Packard Enterpise); Alexander Shmakov (UC Irvine); Sahand Ghorbanpour (Hewlett Packard Enterprise); Ashwin Ramesh Babu (Hewlett Packard Enterprise Labs); Alexandre Pichard (Carnegie Clean Energy); Mathieu Cocho (Carnegie Clean Energy) |
Power and energy systems Reinforcement learning and control |
NeurIPS 2022 |
Image-Based Soil Organic Carbon Estimation from Multispectral Satellite Images with Fourier Neural Operator and Structural Similarity
(Papers Track)
Abstract and authors: (click to expand)Abstract: Soil organic carbon (SOC) sequestration is the transfer and storage of atmospheric carbon dioxide in soils, which plays an important role in climate change mitigation. SOC concentration can be improved by proper land use, thus it is beneficial if SOC can be estimated at a regional or global scale. As multispectral satellite data can provide SOC-related information such as vegetation and soil properties at a global scale, estimation of SOC through satellite data has been explored as an alternative to manual soil sampling. Although existing works show promising results, most studies are based on pixel-based approaches with traditional machine learning methods, and convolutional neural networks (CNNs) are seldom used. To study the advantages of using CNNs on SOC remote sensing, in this paper, we propose the FNO-DenseNet based on the state-of-the-art Fourier neural operator (FNO). By combining the advantages of the FNO and DenseNet, the FNO-DenseNet outperformed the FNO in our experiments with hundreds of times fewer parameters. The FNO-DenseNet also outperformed a pixel-based random forest by 18% in the mean absolute percentage error. To the best of our knowledge, this is the first work of applying the FNO on SOC remote sensing. Authors: Ken C. L. Wong (IBM Research – Almaden Research Center); Levente Klein (IBM Research); Ademir Ferreira da Silva (IBM Research); Hongzhi Wang (IBM Almaden Research Center); Jitendra Singh (IBM Research - India); Tanveer Syeda-Mahmood (IBM Research) |
Carbon capture and sequestration Classification, regression, and supervised learning Computer vision and remote sensing |
NeurIPS 2022 |
SolarDK: A high-resolution urban solar panel image classification and localization dataset
(Papers Track)
Abstract and authors: (click to expand)Abstract: The body of research on classification of solar panel arrays from aerial imagery is increasing, yet there are still not many public benchmark datasets. This paper introduces two novel benchmark datasets for classifying and localizing solar panel arrays in Denmark: A human annotated dataset for classification and segmentation, as well as a classification dataset acquired using self-reported data from the Danish national building registry. We explore the performance of prior works on the new benchmark dataset, and present results after fine-tuning models using a similar approach as recent works. Furthermore, we train models of newer architectures and provide benchmark baselines to our datasets in several scenarios. We believe the release of these datasets may improve future research in both local and global geospatial domains for identifying and mapping of solar panel arrays from aerial imagery. The data is accessible at https://osf.io/aj539/. Authors: Maxim MK Khomiakov (DTU); Julius Radzikowski (DTU); Carl Schmidt (DTU); Mathias Sørensen (DTU); Mads Andersen (DTU); Michael Andersen (Technical University of Denmark); Jes Frellsen (Technical University of Denmark) |
Buildings Power and energy systems Computer vision and remote sensing |
NeurIPS 2022 |
Bayesian inference for aerosol vertical profiles
(Papers Track)
Abstract and authors: (click to expand)Abstract: Aerosol-cloud interactions constitute the largest source of uncertainty in assessments of the anthropogenic climate change. This uncertainty arises in part from the difficulty in measuring the vertical distributions of aerosols. We often have to settle for less informative vertically aggregated proxies such as aerosol optical depth (AOD). In this work, we develop a framework to infer vertical aerosol profiles using AOD and readily available vertically resolved meteorological predictors such as temperature or relative humidity. We devise a simple Gaussian process prior over aerosol vertical profiles and update it with AOD observations. We validate our approach using ECHAM-HAM aerosol-climate model data. Our results show that, while simple, our model is able to reconstruct realistic extinction profiles with well-calibrated uncertainty. In particular, the model demonstrates a faithful reconstruction of extinction patterns arising from aerosol water uptake in the boundary layer. Authors: Shahine Bouabid (University of Oxford); Duncan Watson-Parris (University of Oxford); Dino Sejdinovic (University of Adelaide) |
Climate science and climate modeling Causal and Bayesian methods |
NeurIPS 2022 |
Optimizing toward efficiency for SAR image ship detection
(Papers Track)
Abstract and authors: (click to expand)Abstract: The detection and prevention of illegal fishing is critical to maintaining a healthy and functional ecosystem. Recent research on ship detection in satellite imagery has focused exclusively on performance improvements, disregarding detection efficiency. However, the speed and compute cost of vessel detection are essential for a timely intervention to prevent illegal fishing. Therefore, we investigated optimization methods that lower detection time and cost with minimal performance loss. We trained an object detection model based on a convolutional neural network (CNN) using a dataset of satellite images. Then, we designed two efficiency optimizations that can be applied to the base CNN or any other base model. The optimizations consist of a fast, cheap classification model and a statistical algorithm. The integration of the optimizations with the object detection model leads to a trade-off between speed and performance. We studied the trade-off using metrics that give different weight to execution time and performance. We show that by using a classification model the average precision of the detection model can be approximated to 99.5% in 44% of the time or to 92.7% in 25% of the time. Authors: Arthur Van Meerbeeck (KULeuven); Ruben Cartuyvels (KULeuven); Jordy Van Landeghem (KULeuven); Sien Moens (KU Leuven) |
Computer vision and remote sensing Ecosystems and biodiversity Oceans and marine systems |
NeurIPS 2022 |
AutoML-based Almond Yield Prediction and Projection in California
(Papers Track)
Abstract and authors: (click to expand)Abstract: Almonds are one of the most lucrative products of California, but are also among the most sensitive to climate change. In order to better understand the relationship between climatic factors and almond yield, an automated machine learning framework is used to build a collection of machine learning models. The prediction skill is assessed using historical records. Future projections are derived using 17 downscaled climate outputs. The ensemble mean projection displays almond yield changes under two different climate scenarios, along with two technology development scenarios, where the role of technology development is highlighted. The mean projections and distributions provide insightful results to stakeholders and can be utilized by policymakers for climate adaptation. Authors: Shiheng Duan (Lawrence Livermore National Laboratory); Shuaiqi Wu (University of California, Davis); Erwan Monier (University of California, Davis); Paul Ullrich (University of California, Davis) |
Agriculture and food Classification, regression, and supervised learning |
NeurIPS 2022 |
Attention-Based Scattering Network for Satellite Imagery
(Papers Track)
Abstract and authors: (click to expand)Abstract: Multi-channel satellite imagery, from stacked spectral bands or spatiotemporal data, have meaningful representations for various atmospheric properties. Combining these features in an effective manner to create a performant and trustworthy model is of utmost importance to forecasters. Neural networks show promise, yet suffer from unintuitive computations, fusion of high-level features, and may be limited by the quantity of available data. In this work, we leverage the scattering transform to extract high-level features without additional trainable parameters and introduce a separation scheme to bring attention to independent input channels. Experiments show promising results on estimating tropical cyclone intensity and predicting the occurrence of lightning from satellite imagery. Authors: Jason Stock (Colorado State University); Charles Anderson (Colorado State University) |
Earth science Earth observations and monitoring Classification, regression, and supervised learning Interpretable ML |
NeurIPS 2022 |
Discovering Interpretable Structural Model Errors in Climate Models
(Papers Track)
Abstract and authors: (click to expand)Abstract: Inaccuracies in the models of the Earth system, i.e., structural and parametric model errors, lead to inaccurate climate change projections. Errors in the model can originate from unresolved phenomena due to a low numerical resolution, as well as misrepresentations of physical phenomena or boundaries (e.g., orography). Therefore, such models lead to inaccurate short--term forecasts of weather and extreme events, and more importantly, long term climate projections. While calibration methods have been introduced to address for parametric uncertainties, e.g., by better estimation of system parameters from observations, addressing structural uncertainties, especially in an interpretable manner, remains a major challenge. Therefore, with increases in both the amount and frequency of observations of the Earth system, algorithmic innovations are required to identify interpretable representations of the model errors from observations. We introduce a flexible, general-purpose framework to discover interpretable model errors, and show its performance on a canonical prototype of geophysical turbulence, the two--level quasi--geostrophic system. Accordingly, a Bayesian sparsity--promoting regression framework is proposed, that uses a library of kernels for discovery of model errors. As calculating the library from noisy and sparse data (e.g., from observations) using convectional techniques leads to interpolation errors, here we use a coordinate-based multi--layer embedding to impute the sparse observations. We demonstrate the importance of alleviating spectral bias, and propose a random Fourier feature layer to reduce it in the proposed embeddings, and subsequently enable an accurate discovery. Our framework is demonstrated to successfully identify structural model errors due to linear and nonlinear processes (e.g., radiation, surface friction, advection), as well as misrepresented orography. Authors: Rambod Mojgani (Rice University); Ashesh K Chattopadhyay (Rice University); Pedram Hassanzadeh (Rice University) |
Climate science and climate modeling Earth observations and monitoring Interpretable ML |
NeurIPS 2022 |
Aboveground carbon biomass estimate with Physics-informed deep network
(Papers Track)
Abstract and authors: (click to expand)Abstract: The global carbon cycle is a key process to understand how our climate is changing. However, monitoring the dynamics is difficult because a high-resolution robust measurement of key state parameters including the aboveground carbon biomass (AGB) is required. We use deep neural network to generate a wall-to-wall map of AGB within the Continental USA (CONUS) with 30-meter spatial resolution for the year 2021. We combine radar and optical hyperspectral imagery, with a physical climate parameter of solar-induced chlorophyll fluorescence (SIF)-based gross primary productivity (GPP). Validation results show that a masked variation of UNet has the lowest validation RMSE of 37.93 ± 1.36 Mg C/ha, as compared to 52.30 ± 0.03 Mg C/ha for random forest algorithm. Furthermore, models that learn from SIF-based GPP in addition to radar and optical imagery reduce validation RMSE by almost 10% and the standard deviation by 40%. Finally, we apply our model to measure losses in AGB from the recent 2021 Caldor wildfire in California, and validate our analysis with Sentinel-based burn index. Authors: Juan Nathaniel (Columbia University); Levente Klein (IBM Research); Campbell D Watson (IBM Reserch); Gabrielle Nyirjesy (Columbia University); Conrad M Albrecht (IBM Research) |
Computer vision and remote sensing Forestry and other land use Hybrid physical models |
NeurIPS 2022 |
Improving the predictions of ML-corrected climate models with novelty detection
(Papers Track)
Abstract and authors: (click to expand)Abstract: While previous works have shown that machine learning (ML) can improve the prediction accuracy of coarse-grid climate models, these ML-augmented methods are more vulnerable to irregular inputs than the traditional physics-based models they rely on. Because ML-predicted corrections feed back into the climate model’s base physics, the ML-corrected model regularly produces out of sample data, which can cause model instability and frequent crashes. This work shows that adding semi-supervised novelty detection to identify out-of-sample data and disable the ML-correction accordingly stabilizes simulations and sharply improves the quality of predictions. We design an augmented climate model with a one-class support vector machine (OCSVM) novelty detector that provides better temperature and precipitation forecasts in a year-long simulation than either a baseline (no-ML) or a standard ML-corrected run. By improving the accuracy of coarse-grid climate models, this work helps make accurate climate models accessible to researchers without massive computational resources. Authors: Clayton H Sanford (Columbia); Anna Kwa (Allen Institute for Artificial Intelligence); Oliver Watt-Meyer (Allen Institute for AI); Spencer Clark (Allen Institute for AI); Noah Brenowitz (Allen Institute for AI); Jeremy McGibbon (Allen Institute for AI); Christopher Bretherton (Allen Institute for AI) |
Climate science and climate modeling Hybrid physical models Uncertainty quantification and robustness Unsupervised and semi-supervised learning |
NeurIPS 2022 |
Levee protected area detection for improved flood risk assessment in global hydrology models
(Papers Track)
Abstract and authors: (click to expand)Abstract: Precise flood risk assessment is needed to reduce human societies vulnerability as climate change increases hazard risk and exposure related to floods. Levees are built to protect people and goods from flood, which alters river hydrology, but are still not accounted for by global hydrological model. Detecting and integrating levee structures to global hydrological simulations is thus expected to enable more precise flood simulation and risk assessment, with important consequences for flood risk mitigation. In this work, we propose a new formulation to the problem of identifying levee structures: instead of detecting levees themselves, we focus on segmenting the region of the floodplain they protect. This formulation allows to better identify protected areas, to leverage the structure of hydrological data, and to simplify the integration of levee information to global hydrological models. Authors: Masato Ikegawa (Kobe University); Tristan E.M Hascoet (Kobe University); Victor Pellet (Observatoire de Paris); Xudong Zhou (The University of Tokyo); Tetsuya Takiguchi (Kobe University); Dai Yamazaki (The University of Tokyo) |
Disaster management and relief Classification, regression, and supervised learning |
NeurIPS 2022 |
Scene-to-Patch Earth Observation: Multiple Instance Learning for Land Cover Classification
(Papers Track)
Abstract and authors: (click to expand)Abstract: Land cover classification (LCC), and monitoring how land use changes over time, is an important process in climate change mitigation and adaptation. Existing approaches that use machine learning with Earth observation data for LCC rely on fully-annotated and segmented datasets. Creating these datasets requires a large amount of effort, and a lack of suitable datasets has become an obstacle in scaling the use of LCC. In this study, we propose Scene-to-Patch models: an alternative LCC approach utilising Multiple Instance Learning (MIL) that requires only high-level scene labels. This enables much faster development of new datasets whilst still providing segmentation through patch-level predictions, ultimately increasing the accessibility of using LCC for different scenarios. On the DeepGlobe-LCC dataset, our approach outperforms non-MIL baselines on both scene- and patch-level prediction. This work provides the foundation for expanding the use of LCC in climate change mitigation methods for technology, government, and academia. Authors: Joseph Early (University of Southampton); Ying-Jung C Deweese (Georgia Insititute of Technology); Christine Evers (University of Southampton); Sarvapali Ramchurn (University of Southampton) |
Earth observations and monitoring Agriculture and food Cities and urban planning Forestry and other land use Classification, regression, and supervised learning Interpretable ML Unsupervised and semi-supervised learning |
NeurIPS 2022 |
Deep learning for downscaling tropical cyclone rainfall
(Papers Track)
Abstract and authors: (click to expand)Abstract: Flooding is often the leading cause of mortality and damages from tropical cyclones. With rainfall from tropical cyclones set to rise under global warming, better estimates of extreme rainfall are required to better support resilience efforts. While high resolution climate models capture tropical cyclone statistics well, they are computationally expensive leading to a trade-off between accuracy and generating enough ensemble members to generate sufficient high impact, low probability events. Often, downscaling models are used as a computationally cheaper alternative. Here, we develop and evaluate a set of deep learning models for downscaling tropical cyclone rainfall for more robust risk analysis. Authors: Emily Vosper (University of Bristol); Lucy Harris (University of Oxford); Andrew McRae (University of Oxford); Laurence Aitchison (University of Bristol); Peter Watson (Bristol); Raul Santos Rodriguez (University of Bristol); Dann Mitchell (University of Bristol) |
Extreme weather Climate science and climate modeling Interpretable ML |
NeurIPS 2022 |
Short-term Prediction and Filtering of Solar Power Using State-Space Gaussian Processes
(Papers Track)
Abstract and authors: (click to expand)Abstract: Short-term forecasting of solar photovoltaic energy (PV) production is important for powerplant management. Ideally these forecasts are equipped with error bars, so that downstream decisions can account for uncertainty. To produce predictions with error bars in this setting, we consider Gaussian processes (GPs) for modelling and predicting solar photovoltaic energy production in the UK. A standard application of GP regression on the PV timeseries data is infeasible due to the large data size and non-Gaussianity of PV readings. However, this is made possible by leveraging recent advances in scalable GP inference, in particular, by using the state-space form of GPs, combined with modern variational inference techniques. The resulting model is not only scalable to large datasets but can also handle continuous data streams via Kalman filtering. Authors: So Takao (UCL); Sean Nassimiha (UCL); Peter Dudfield (Open Climate Fix); Jack Kelly (Open Climate Fix); Marc Deisenroth (University College London) |
Time-series analysis Climate science and climate modeling Power and energy systems Causal and Bayesian methods Uncertainty quantification and robustness |
NeurIPS 2022 |
Identifying latent climate signals using sparse hierarchical Gaussian processes
(Papers Track)
Abstract and authors: (click to expand)Abstract: Extracting latent climate signals from multiple climate model simulations is important to estimate future climate change. To tackle this we develop a sparse hierarchical Gaussian process (SHGP), which probabilistically learns a latent distribution from a set of vectors. We use this to predict the latent surface temperature change globally and for central England from an ensemble of climate models, in a scalable manner and with robust uncertainty propagation. Authors: Matt Amos (Lancaster University); Thomas Pinder (Lancaster University); Paul Young (Lancaster University) |
Climate science and climate modeling Earth science Causal and Bayesian methods Uncertainty quantification and robustness |
NeurIPS 2022 |
Towards dynamical stability analysis of sustainable power grids using Graph Neural Networks
(Papers Track)
Abstract and authors: (click to expand)Abstract: To mitigate climate change, the share of renewable needs to be increased. Renewable energies introduce new challenges to power grids due to decentralization, reduced inertia and volatility in production. The operation of sustainable power grids with a high penetration of renewable energies requires new methods to analyze the dynamical stability. We provide new datasets of dynamical stability of synthetic power grids, and find that graph neural networks (GNNs) are surprisingly effective at predicting the highly non-linear target from topological information only. To illustrate the potential to scale to real-sized power grids, we demonstrate the successful prediction on a Texan power grid model. Authors: Christian Nauck (PIK); Michael Lindner (PIK); Konstantin Schürholt (University of St. Gallen); Frank Hellmann (PIK) |
Power and energy systems Classification, regression, and supervised learning |
NeurIPS 2022 |
Detecting Methane Plumes using PRISMA: Deep Learning Model and Data Augmentation
(Papers Track)
Abstract and authors: (click to expand)Abstract: The new generation of hyperspectral imagers, such as PRISMA, has improved significantly our detection capability of methane (CH4) plumes from space at high spatial resolution (∼30m). We present here a complete framework to identify CH4 plumes using images from the PRISMA satellite mission and a deep learning technique able to automatically detect plumes over large areas. To compensate for the sparse database of PRISMA images, we trained our model by transposing high resolution plumes from Sentinel-2 to PRISMA. Our methodology avoids computationally expensive synthetic plume from Large Eddy Simulations while generating a broad and realistic training database, and paves the way for large-scale detection of methane plumes using future hyperspectral sensors (EnMAP, EMIT, CarbonMapper). Authors: Alexis Groshenry (Kayrros); Clément Giron (Kayrros); Alexandre d'Aspremont (CNRS, DI, Ecole Normale Supérieure; Kayrros); Thomas Lauvaux (University of Reims Champagne Ardenne, GSMA, UMR 7331); Thibaud Ehret (Centre Borelli) |
Computer vision and remote sensing Earth observations and monitoring Earth science Classification, regression, and supervised learning Meta- and transfer learning |
NeurIPS 2022 |
Probabilistic forecasting of regional photovoltaic power production based on satellite-derived cloud motion
(Papers Track)
Abstract and authors: (click to expand)Abstract: Solar energy generation drastically increased in the last years, and it is expected to grow even more in the next decades. So, accurate intra-day forecasts are needed to improve the predictability of the photovoltaic power production and associated balancing measures to increase the shares of renewable energy in the power grid. Most forecasting methods require numerical weather predictions, which are slow to compute, or long-term datasets to run the forecast. These issues make the models difficult to implement in an operational setting. To overcome these problems, we propose a novel regional intraday probabilistic PV power forecasting model able to exploit only 2 hours of satellite-derived cloudiness maps to produce the ensemble forecast. The model is easy to implement in an operational setting as it is based on Pysteps, an already-operational Python library for precipitation nowcasting. With few adaptations of the Steps algorithm, we reached state-of-the-art performance, reaching a 71% lower RMSE than the Persistence model and a 50% lower CRPS than the Persistence Ensemble model for forecast lead times up to 4 hours. Authors: Alberto Carpentieri (Bern University of Applied Science); Doris Folini (Institute for Atmospheric and Climate Science, ETH Zurich); Martin Wild (Institute for Atmospheric and Climate Science, ETH Zurich); Angela Meyer (Bern University of Applied Science) |
Power and energy systems Computer vision and remote sensing Uncertainty quantification and robustness |
NeurIPS 2022 |
Robustifying machine-learned algorithms for efficient grid operation
(Papers Track)
Abstract and authors: (click to expand)Abstract: We propose a learning-augmented algorithm, RobustML, for operation of dispatchable generation that exploits the good performance of a machine-learned algorithm while providing worst-case guarantees on cost. We evaluate the algorithm on a realistic two-generator system, where it exhibits robustness to distribution shift while enabling improved efficiency as renewable penetration increases. Authors: Nicolas Christianson (California Institute of Technology); Christopher Yeh (California Institute of Technology); Tongxin Li (The Chinese University of Hong Kong (Shenzhen)); Mahdi Torabi Rad (Beyond Limits); Azarang Golmohammadi (Beyond Limits, Inc.); Adam Wierman (California Institute of Technology) |
Uncertainty quantification and robustness Power and energy systems Reinforcement learning and control |
NeurIPS 2022 |
Deep Hydrology: Hourly, Gap-Free Flood Maps Through Joint Satellite and Hydrologic Modelling
(Papers Track)
Abstract and authors: (click to expand)Abstract: Climate change-driven weather disasters are rapidly increasing in both frequency and magnitude. Floods are the most damaging of these disasters, with approximately 1.46 billion people exposed to inundation depths of over 0.15m, a significant life and livelihood risk. Accurate knowledge of flood-extent for ongoing and historical events facilitates climate adaptation in flood-prone communities by enabling near real-time disaster monitoring to support planning, response, and relief during these extreme events. Satellite observations can be used to derive flood-extent maps directly; however, these observations are impeded by cloud and canopy cover, and can be very infrequent and hence miss the flood completely. In contrast, physically-based inundation models can produce spatially complete event maps but suffer from high uncertainty if not frequently calibrated with expensive land and infrastructure surveys. In this study, we propose a deep learning approach to reproduce satellite-observed fractional flood-extent maps given dynamic state variables from hydrologic models, fusing information contained within the states with direct observations from satellites. Our model has an hourly temporal resolution, contains no cloud-gaps, and generalizes to watersheds across the continental United States with a 6% error on held-out areas that never flooded before. We further demonstrate through a case study in Houston, Texas that our model can distinguish tropical cyclones that caused flooding from those that did not within two days of landfall, thereby providing a reliable source for flood-extent maps that can be used by disaster monitoring services. Authors: Tanya Nair (Cloud To Street); Veda Sunkara (Cloud to Street); Jonathan Frame (Cloud to Street); Philip Popien (Cloud to Street); Subit Chakrabarti (Cloud To Street) |
Computer vision and remote sensing Climate justice Climate science and climate modeling Disaster management and relief Earth science Extreme weather Hybrid physical models |
NeurIPS 2022 |
Machine learning emulation of a local-scale UK climate model
(Papers Track)
Abstract and authors: (click to expand)Abstract: Climate change is causing the intensification of rainfall extremes. Precipitation projections with high spatial resolution are important for society to prepare for these changes, e.g. to model flooding impacts. Physics-based simulations for creating such projections are very computationally expensive. This work demonstrates the effectiveness of diffusion models, a form of deep generative models, for generating much more cheaply realistic high resolution rainfall samples for the UK conditioned on data from a low resolution simulation. We show for the first time a machine learning model that is able to produce realistic high-resolution rainfall predictions based on a physical model that resolves atmospheric convection, a key process behind extreme rainfall. By adding self-learnt, location-specific information to low resolution relative vorticity, quantiles and time-mean of the samples match well their counterparts from the high-resolution simulation. Authors: Henry Addison (University of Bristol); Elizabeth Kendon (Met Office Hadley Centre); Suman Ravuri (DeepMind); Laurence Aitchison (University of Bristol); Peter Watson (Bristol) |
Climate science and climate modeling Extreme weather Generative modeling |
NeurIPS 2022 |
Bridging the Microwave Data Gap; Using Bayesian Deep Learning to “See” the Unseen
(Papers Track)
Abstract and authors: (click to expand)Abstract: Having microwave data with the spatial and temporal resolution of infrared data would provide a large positive impact on many climate and weather applications. We demonstrate that Bayesian deep learning is a promising technique for both creating and improving synthetic microwave data from infrared data. We report 0.7% mean absolute percentage error for 183+/-3 GHz microwave brightness temperature and uncertainty metrics and find that more training data is needed to achieve improved performance at 166 GHz, 37 GHz, and 23 GHz. Analysis of the spatial distribution of uncertainty reveals that additional cloud data will provide the greatest increase in skill, which will potentially allow for generation of many secondary products derived from microwave data in the future. Authors: Pedro Ortiz (Naval Postgraduate School); Eleanor Casas (Naval Postgraduate School); Marko Orescanin (Naval Postgraduate School); Scott Powell (Naval Postgraduate School) |
Earth observations and monitoring Climate science and climate modeling Earth science Extreme weather Causal and Bayesian methods Classification, regression, and supervised learning Uncertainty quantification and robustness |
NeurIPS 2022 |
Learning evapotranspiration dataset corrections from water cycle closure supervision
(Papers Track)
Abstract and authors: (click to expand)Abstract: Evapotranspiration (ET) is one of the most uncertain components of the global water cycle. Improving global ET estimates is needed to better our understanding of the global water cycle so as to forecast the consequences of climate change on the future of global water resource distribution. This work presents a methodology to derive monthly corrections of global ET datasets at 0.25 degree resolution. We use ML to generalize sparse catchment-level water cycle closure residual information to global and dense pixel-level residuals. Our model takes a probabilistic view on ET datasets and their correction that we use to regress catchment-level residuals using a sum-aggregated supervision. Using four global ET datasets, we show that our learned model has learned ET corrections that accurately generalize its water cycle-closure results to unseen catchments. Authors: Tristan E.M Hascoet (Kobe University); Victor Pellet (LERMA); Filipe Aires (LERMA) |
Earth observations and monitoring Climate science and climate modeling Causal and Bayesian methods Classification, regression, and supervised learning |
NeurIPS 2022 |
Convolutional Neural Processes for Inpainting Satellite Images: Application to Water Body Segmentation
(Papers Track)
Abstract and authors: (click to expand)Abstract: The widespread availability of satellite images has allowed researchers to monitor the impact of climate on socio-economic and environmental issues through examples like crop and water body classification to measure food scarcity and risk of flooding. However, a common issue of satellite images is missing values due to measurement defects, which render them unusable by existing methods without data imputation. To repair the data, inpainting methods can be employed, which are based on classical PDEs or interpolation methods. Recently, deep learning approaches have shown promise in this realm, however many of these methods do not explicitly take into account the inherent spatio-temporal structure of satellite images. In this work, we cast satellite image inpainting as a meta-learning problem, and implement Convolutional Neural Processes (ConvNPs) in which we frame each satellite image as its own task or 2D regression problem. We show that ConvNPs outperform classical methods and state-of-the-art deep learning inpainting models on a scanline problem for LANDSAT 7 satellite images, assessed on a variety of in- and out-of-distribution images. Our results successfully match the performance of clean images on a downstream water body segmentation task in Canada. Authors: Alexander Pondaven (Imperial College London); Mart Bakler (Imperial College London); Donghu Guo (Imperial College London); Hamzah Hashim (Imperial College London); Martin G Ignatov (Imperial college London); Samir Bhatt (Imperial College London); Seth Flaxman (Oxford); Swapnil Mishra (Imperial College London); Elie Alhajjar (USMA); Harrison Zhu (Imperial College London) |
Computer vision and remote sensing Earth observations and monitoring Generative modeling Meta- and transfer learning |
NeurIPS 2022 |
A POMDP Model for Safe Geological Carbon Sequestration
(Papers Track)
Abstract and authors: (click to expand)Abstract: Geological carbon capture and sequestration (CCS), where CO2 is stored in subsurface formations, is a promising and scalable approach for reducing global emissions.However, if done incorrectly, it may lead to earthquakes and leakage of CO2 back to the surface, harming both humans and the environment. These risks are exacerbated by the large amount of uncertainty in the structure of the storage formation. For these reasons, we propose that CCS operations be modeled as a partially observable Markov decision process (POMDP) and decisions be informed using automated planning algorithms. To this end, we develop a simplified model of CCS operations based on a 2D spillpoint analysis that retains many of the challenges and safety considerations of the real-world problem. We show how off-the-shelf POMDP solvers outperform expert baselines for safe CCS planning. This POMDP model can be used as a test bed to drive the development of novel decision-making algorithms for CCS operations. Authors: Anthony Corso (Stanford University); Yizheng Wang (Stanford Univerity); Markus Zechner (Stanford University); Jef Caers (Stanford University); Mykel J Kochenderfer (Stanford University) |
Carbon capture and sequestration Reinforcement learning and control |
NeurIPS 2022 |
Optimizing Japanese dam reservoir inflow forecast for efficient operation
(Papers Track)
Abstract and authors: (click to expand)Abstract: Despite a climate and topology favorable to hydropower (HP) generation, HP only accounts for 4% of today’s Japanese primary energy consumption mix. In recent years, calls for improving the efficiency of Japanese HP towards achieving a more sustainable energy mix have emerged from prominent voices in the Ministry of Land, Infrastructure, Transport and Tourism (MILT). Among potential optimizations, data-driven dam operation policies using accurate river discharge forecasts have been advocated for. In the meantime, Machine Learning (ML) has recently made important strides in hydrological modeling, with forecast accuracy improvements demonstrated on both precipitation nowcasting and river discharge prediction. We are motivated by the convergence of these societal and technological contexts: our final goal is to provide scientific evidence and actionable insights for dam infrastructure managers and policy makers to implement more energy-efficient and flood-resistant dam operation policies on a national scale. Towards this goal this work presents a preliminary study of ML-based dam inflow forecasts on a dataset of 127 Japanese public dams we assembled. We discuss our preliminary results and lay out a path for future studies. Authors: Keisuke Yoshimi (Kobe University); Tristan E.M Hascoet (Kobe University); Rousslan F. Julien Dossa (Kobe University); Ryoichi Takashima (Kobe University); Tetsuya Takiguchi (Kobe University); Satoru Oishi (Kobe University) |
Earth science Time-series analysis |
NeurIPS 2022 |
Deep Climate Change: A Dataset and Adaptive domain pre-trained Language Models for Climate Change Related Tasks
(Papers Track)
Abstract and authors: (click to expand)Abstract: The quantity and quality of literature around climate change (CC) and its impacts are increasing yearly. Yet, this field has received limited attention in the Natural Language Processing (NLP) community. With the help of large Language Models (LMs) and transfer learning, NLP can support policymakers, researchers, and climate activists in making sense of large-scale and complex CC-related texts. CC-related texts include specific language that general language models cannot represent accurately. Therefore we collected a climate change corpus consisting of over 360 thousand abstracts of top climate scientists' articles from trustable sources covering large temporal and spatial scales. Comparison of the performance of GPT2 LM and our 'climateGPT2 models', fine-tuned on the CC-related corpus, on claim generation (text generation) and fact-checking, downstream tasks show the better performance of the climateGPT2 models compared to the GPT2. The climateGPT2 models decrease the validation loss to 1.08 for claim generation from 43.4 obtained by GPT2. We found that climateGPT2 models improved the masked language model objective for the fact-checking task by increasing the F1 score from 0.67 to 0.72. Authors: Saeid Vaghefi (University of Zürich); Veruska Muccione (University of Zürich); Christian Huggel (University of Zürich); Hamed Khashehchi (2w2e GmbH); Markus Leippold (University of Zurich) |
Natural language processing Meta- and transfer learning |
NeurIPS 2022 |
Data-Driven Optimal Solver for Coordinating a Sustainable and Stable Power Grid
(Papers Track)
Abstract and authors: (click to expand)Abstract: With today's pressing climate change concerns, the widespread integration of low-carbon technologies such as sustainable generation systems (e.g. photovoltaics, wind turbines, etc.) and flexible consumer devices (e.g. storage, electric vehicles, smart appliances, etc.) into the electric grid is vital. Although these power entities can be deployed at large, these are highly variable in nature and must interact with the existing grid infrastructure without violating electrical limits so that the system continues to operate in a stable manner at all times. In order to ensure the integrity of grid operations while also being economical, system operators will need to rapidly solve the optimal power flow (OPF) problem in order to adapt to these fluctuations. Inherent non-convexities in the OPF problem do not allow traditional model-based optimization techniques to offer guarantees on optimality, feasibility and convergence. In this paper, we propose a data-driven OPF solver built on information-theoretic and semi-supervised machine learning constructs. We show that this solver is able to rapidly compute solutions (i.e. in sub-second range) that are within 3\% of optimality with guarantees on feasibility on a benchmark IEEE 118-bus system. Authors: Junfei Wang (York University); Pirathayini Srikantha (York University) |
Power and energy systems Climate finance and economics Unsupervised and semi-supervised learning |
NeurIPS 2022 |
Don't Waste Data: Transfer Learning to Leverage All Data for Machine-Learnt Climate Model Emulation
(Papers Track)
Abstract and authors: (click to expand)Abstract: How can we learn from all available data when training machine-learnt climate models, without incurring any extra cost at simulation time? Typically, the training data comprises coarse-grained high-resolution data. But only keeping this coarse-grained data means the rest of the high-resolution data is thrown out. We use a transfer learning approach, which can be applied to a range of machine learning models, to leverage all the high-resolution data. We use three chaotic systems to show it stabilises training, gives improved generalisation performance and results in better forecasting skill. Our code is at https://github.com/raghul-parthipan/dont_waste_data Authors: Raghul Parthipan (University of Cambridge); Damon Wischik (Univeristy of Cambridge) |
Climate science and climate modeling Generative modeling Meta- and transfer learning Time-series analysis |
NeurIPS 2022 |
Explainable Multi-Agent Recommendation System for Energy-Efficient Decision Support in Smart Homes
(Papers Track)
Abstract and authors: (click to expand)Abstract: Transparent, understandable, and persuasive recommendations support the electricity consumers’ behavioral change to tackle the energy efficiency problem. This paper proposes an explainable multi-agent recommendation system for load shifting for household appliances. First, we extend a novel multi-agent approach by designing an Explainability Agent that provides explainable recommendations for optimal appliance scheduling in a textual and visual manner. Second, we enhance the predictive capacity of other agents by including weather data and applying state-of-the-art models (i.e., k-nearest-neighbours, extreme gradient boosting, adaptive boosting, random forest, logistic regression, and explainable boosting machines). Since we want to help the user understand a single recommendation, we focus on local explainability approaches. In particular, we apply post-model approaches LIME (local, interpretable, model-agnostic explanation) and SHAP (Shapley additive explanations) as model-agnostic tools that can explain the predictions of the chosen classifiers. We further provide an overview of the predictive and explainability performance. Our results show a substantial improvement in the performance of the multi-agent system while at the same time opening up the “black box” of recommendations. To show the pathway to positive impact regarding climate change, we provide a discussion on the potential impact of the suggested approach. Authors: Alona Zharova (Humboldt University of Berlin); Annika Boer (Humboldt University of Berlin); Julia Knoblauch (Humboldt University of Berlin); Kai Ingo Schewina (Humboldt University of Berlin); Jana Vihs (Humboldt University of Berlin) |
Recommender systems Buildings Power and energy systems Interpretable ML |
NeurIPS 2022 |
FIRO: A Deep-neural Network for Wildfire Forecast with Interpretable Hidden States
(Papers Track)
Abstract and authors: (click to expand)Abstract: Several wildfire danger systems have emerged from decades of research. One such system is the National Fire-Danger Rating System (NFDRS), which is used widely across the United States and is a key predictor in the Global ECMWF Fire Forecasting (GEFF) model. The NFDRS is composed of over 100 equations relating wildfire risk to weather conditions, climate and land cover characteristics, and fuel. These equations and the corresponding 130+ parameters were developed via field and lab experiments. These parameters, which are fixed in the standard NFDRS and GEFF implementations, may not be the most appropriate for a climate-changing world. In order to adjust the NFDRS parameters to current climate conditions and specific geographical locations, we recast NFDRS in PyTorch to create a new deep learning-based Fire Index Risk Optimizer (FIRO). FIRO predicts the ignition component, or the probability a wildfire would require suppression in the presence of a firebrand, and calibrates the uncertain parameters for a specific region and climate conditions by training on observed fires. Given the rare occurrence of wildfires, we employed the extremal dependency index (EDI) as the loss function. Using ERA5 reanalysis and MODIS burned area data, we trained FIRO models for California, Texas, Italy, and Madagascar. Across these four geographies, the average EDI improvement was 175% above the standard NFDRS implementation Authors: Eduardo R Rodrigues (MSR); Campbell D Watson (IBM Reserch); Bianca Zadrozny (IBM Research); Gabrielle Nyirjesy (Columbia University) |
Disaster management and relief Forestry and other land use Interpretable ML |
NeurIPS 2022 |
Towards a spatially transferable super resolution model for downscaling Antarctic surface melt
(Papers Track)
Abstract and authors: (click to expand)Abstract: Surface melt on the Antarctic Ice Sheet is an important climate indicator, yet the spatial scale of modeling and observing surface melt is insufficient to capture crucial details and understand local processes. High-resolution climate models could provide a solution, but they are computationally expensive and require finetuning for some model parameters. An alternative method, pioneering in geophysics, is single-image super resolution (SR) applied on lower-resolution model output. However, often input and output of such SR models are available on the same, fixed spatial domain. High-resolution model simulations over Antarctica are available only in some regions. To be able to apply an SR model elsewhere, we propose to make the single-image SR model physics-aware, using surface albedo and elevation as additional input. Our results show a great improvement in the spatial transferability of the conventional SR model. Although issues with the input satellite-derived albedo remain, adding physics awareness paves a way toward a spatially transferable SR model for downscaling Antarctic surface melt. Authors: Zhongyang Hu (IMAU); Yao Sun (TUM); Peter Kuipers Munneke (IMAU); Stef Lhermitte (TU Delft); Xiaoxiang Zhu (Technical University of Munich,Germany) |
Climate science and climate modeling Earth science Classification, regression, and supervised learning Computer vision and remote sensing |
NeurIPS 2022 |
Forecasting European Ozone Air Pollution With Transformers
(Papers Track)
Abstract and authors: (click to expand)Abstract: Surface ozone is an air pollutant that contributes to hundreds of thousands of premature deaths annually. Accurate short-term ozone forecasts may allow improved policy to reduce the risk to health, such as air quality warnings. However, forecasting ozone is a difficult problem, as surface ozone concentrations are controlled by a number of physical and chemical processes which act on varying timescales. Accounting for these temporal dependencies appropriately is likely to provide more accurate ozone forecasts. We therefore deploy a state-of-the-art transformer-based model, the Temporal Fusion Transformer, trained on observational station data from three European countries. In four-day test forecasts of daily maximum 8-hour ozone, the novel approach is highly skilful (MAE = 4.6 ppb, R2 = 0.82), and generalises well to two European countries unseen during training (MAE = 4.9 ppb, R2 = 0.79). The model outperforms standard machine learning models on our data, and compares favourably to the published performance of other deep learning architectures tested on different data. We illustrate that the model pays attention to physical variables known to control ozone concentrations, and that the attention mechanism allows the model to use relevant days of past ozone concentrations to make accurate forecasts. Authors: Seb Hickman (University of Cambridge); Paul Griffiths (University of Cambridge); Alex Archibald (University of Cambridge); Peer Nowack (Imperial College London); Elie Alhajjar (USMA) |
Climate science and climate modeling Earth science Extreme weather Health Public policy Societal adaptation and resilience Classification, regression, and supervised learning Interpretable ML Time-series analysis |
NeurIPS 2022 |
Stability Constrained Reinforcement Learning for Real-Time Voltage Control
(Papers Track)
Abstract and authors: (click to expand)Abstract: This paper is a summary of a recently submitted work. Deep Reinforcement Learning (DRL) has been recognized as a promising tool to address the challenges in real-time control of power systems. However, its deployment in real-world power systems has been hindered by a lack of explicit stability and safety guarantees. In this paper, we propose a stability constrained reinforcement learning method for real-time voltage control in both single-phase and three-phase distribution grids. The key idea underlying our approach is an explicitly constructed Lyapunov function that certifies stability. We demonstrate the effectiveness of our approach with IEEE test feeders, where the proposed method achieves the best overall performance, while always achieving voltage stability. In contrast, standard RL methods often fail to achieve voltage stability. Authors: Jie Feng (UCSD); Yuanyuan Shi (University of California San Diego); Guannan Qu (Carnegie Mellon University); Steven Low (California Institute of Technology); Animashree Anandkumar (Caltech); Adam Wierman (California Institute of Technology) |
Power and energy systems Reinforcement learning and control |
NeurIPS 2022 |
Land Use Prediction using Electro-Optical to SAR Few-Shot Transfer Learning
(Papers Track)
Abstract and authors: (click to expand)Abstract: Satellite image analysis has important implications for land use, urbanization, and ecosystem monitoring. Deep learning methods can facilitate the analysis of different satellite modalities, such as electro-optical (EO) and synthetic aperture radar (SAR) imagery, by supporting knowledge transfer between the modalities to compensate for individual shortcomings. Recent progress has shown how distributional alignment of neural network embeddings can produce powerful transfer learning models by employing a sliced Wasserstein distance (SWD) loss. We analyze how this method can be applied to Sentinel-1 and -2 satellite imagery and develop several extensions toward making it effective in practice. In an application to few-shot Local Climate Zone (LCZ) prediction, we show that these networks outperform multiple common baselines on datasets with a large number of classes. Further, we provide evidence that instance normalization can significantly stabilize the training process and that explicitly shaping the embedding space using supervised contrastive learning can lead to improved performance. Authors: Marcel Hussing (University of Pennsylvania); Karen Li (University of Pennsylvania); Eric Eaton (University of Pennsylvania) |
Meta- and transfer learning Cities and urban planning Earth observations and monitoring |
NeurIPS 2022 |
Exploring Randomly Wired Neural Networks for Climate Model Emulation
(Papers Track)
Abstract and authors: (click to expand)Abstract: Exploring the climate impacts of various anthropogenic emissions scenarios is key to making informed decisions for climate change mitigation and adaptation. State-of-the-art Earth system models can provide detailed insight into these impacts, but have a large associated computational cost on a per-scenario basis. This large computational burden has driven recent interest in developing cheap machine learning models for the task of climate model emulation. In this manuscript, we explore the efficacy of randomly wired neural networks for this task. We describe how they can be constructed and compare them to their standard feedforward counterparts using the ClimateBench dataset. Specifically, we replace the dense layers in multilayer perceptrons, convolutional neural networks, and convolutional long short-term memory networks with randomly wired ones and assess the impact on model performance for models with 1 million and 10 million parameters. We find average performance improvements of 4.2% across model complexities and prediction tasks, with substantial performance improvements of up to 16.4% in some cases. Furthermore, we find no significant difference in prediction speed between networks with standard feedforward dense layers and those with randomly wired layers. These findings indicate that randomly wired neural networks may be suitable direct replacements for traditional dense layers in many standard models. Authors: William J Yik (Harvey Mudd College); Sam J Silva (The University of Southern California); Andrew Geiss (Pacific Northwest National Laboratory); Duncan Watson-Parris (University of Oxford) |
Climate science and climate modeling Earth observations and monitoring Earth science Extreme weather Classification, regression, and supervised learning Computer vision and remote sensing Time-series analysis |
NeurIPS 2022 |
Closing the Domain Gap -- Blended Synthetic Imagery for Climate Object Detection
(Papers Track)
Abstract and authors: (click to expand)Abstract: Object detection models have great potential to increase both the frequency and cost-efficiency of assessing climate-relevant infrastructure in satellite imagery. However, model performance can suffer when models are applied to stylistically different geographies. We propose a technique to generate synthetic imagery using minimal labeled examples of the target object at a low computational cost. Our technique blends example objects onto unlabeled images of the target domain. We show that including these synthetic images improves the average precision of a YOLOv3 object detection model when compared to a baseline and other popular domain adaptation techniques. Authors: Caleb Kornfein (Duke University); Frank Willard (Duke University); Caroline Tang (Duke University); Yuxi Long (Duke University); Saksham Jain (Duke University); Jordan Malof (Duke University); Simiao Ren (Duke University); Kyle Bradbury (Duke University) |
Computer vision and remote sensing Meta- and transfer learning |
NeurIPS 2022 |
SustainGym: A Benchmark Suite of Reinforcement Learning for Sustainability Applications
(Papers Track)
Abstract and authors: (click to expand)Abstract: The lack of standardized benchmarks for reinforcement learning (RL) in sustainability applications has made it difficult to both track progress on specific domains and identify bottlenecks for researchers to focus their efforts on. In this paper, we present SustainGym, a suite of two environments designed to test the performance of RL algorithms on realistic sustainability tasks. The first environment simulates the problem of scheduling decisions for a fleet of electric vehicle (EV) charging stations, and the second environment simulates decisions for a battery storage system bidding in an electricity market. We describe the structure and features of the environments and show that standard RL algorithms have significant room for improving performance. We discuss current challenges in introducing RL to real-world sustainability tasks, including physical constraints and distribution shift. Authors: Christopher Yeh (California Institute of Technology); Victor Li (California Institute of Technology); Rajeev Datta (California Institute of Technology); Yisong Yue (Caltech); Adam Wierman (California Institute of Technology) |
Reinforcement learning and control Power and energy systems |
NeurIPS 2022 |
Remote estimation of geologic composition using interferometric synthetic-aperture radar in California’s Central Valley
(Papers Track)
Abstract and authors: (click to expand)Abstract: California's Central Valley is the national agricultural center, producing 1/4 of the nation’s food. However, land in the Central Valley is sinking at a rapid rate (as much as 20 cm per year) due to continued groundwater pumping. Land subsidence has a significant impact on infrastructure resilience and groundwater sustainability. In this study, we aim to identify specific regions with different temporal dynamics of land displacement and find relationships with underlying geological composition. Then, we aim to remotely estimate geologic composition using interferometric synthetic aperture radar (InSAR)-based land deformation temporal changes using machine learning techniques. We identified regions with different temporal characteristics of land displacement in that some areas (e.g., Helm) with coarser grain geologic compositions exhibited potentially reversible land deformation (elastic land compaction). We found a significant correlation between InSAR-based land deformation and geologic composition using random forest and deep neural network regression models. We also achieved significant accuracy with 1/4 sparse sampling to reduce any spatial correlations among data, suggesting that the model has the potential to be generalized to other regions for indirect estimation of geologic composition. Our results indicate that geologic composition can be estimated using InSAR-based land deformation data. In-situ measurements of geologic composition can be expensive and time consuming and may be impractical in some areas. The generalizability of the model sheds light on high spatial resolution geologic composition estimation utilizing existing measurements. Authors: Kyongsik Yun (California Institute of Technology); Kyra Adams (California Institute of Technology); John Reager (California Institute of Technology); Zhen Liu (California Institute of Technology); Caitlyn Chavez (California Institute of Technology); Michael Turmon (California Institute of Technology); Thomas Lu (California Institute of Technology) |
Earth observations and monitoring Classification, regression, and supervised learning Interpretable ML |
NeurIPS 2022 |
AutoML for Climate Change: A Call to Action
(Papers Track)
Abstract and authors: (click to expand)Abstract: The challenge that climate change poses to humanity has spurred a rapidly developing field of artificial intelligence research focused on climate change applications. The climate change ML (CCML) community works on a diverse, challenging set of problems which often involve physics-constrained ML or heterogeneous spatiotemporal data. It would be desirable to use automated machine learning (AutoML) techniques to automatically find high-performing architectures and hyperparameters for a given dataset. In this work, we benchmark popular Auto ML libraries on three high-leverage CCML applications: climate modeling, wind power forecasting, and catalyst discovery. We find that out-of-the-box AutoML libraries currently fail to meaningfully surpass the performance of human-designed CCML models. However, we also identify a few key weaknesses, which stem from the fact that most AutoML techniques are tailored to computer vision and NLP applications. For example, while dozens of search spaces have been designed for image and language data, none have been designed for spatiotemporal data. Addressing these key weaknesses can lead to the discovery of novel architectures that yield substantial performance gains across numerous CCML applications. Therefore, we present a call to action to the AutoML community, since there are a number of concrete, promising directions for future work in the space of AutoML for CCML. We release our code and a list of resources at https://github.com/climate-change-automl/climate-change-automl. Authors: Renbo Tu (University of Toronto); Nicholas Roberts (University of Wisconsin-Madison); Vishak Prasad C (Indian Institute Of Technology, Bombay); Sibasis Nayak (Indian Institute of Technology, Bombay); Paarth Jain (Indian Institute of Technology Bombay); Frederic Sala (University of Wisconsin-Madison); Ganesh Ramakrishnan (IIT Bombay); Ameet Talwalkar (CMU); Willie Neiswanger (Stanford University); Colin White (Abacus.AI) |
Other Climate science and climate modeling Materials science and discovery Power and energy systems |
NeurIPS 2022 |
Temperature impacts on hate speech online: evidence from four billion tweets
(Papers Track)
Abstract and authors: (click to expand)Abstract: Human aggression is no longer limited to the physical space but exists in the form of hate speech on social media. Here, we examine the effect of temperature on the occurrence of hate speech on Twitter and interpret the results in the context of climate change, human behavior and mental health. Employing supervised machine learning models, we identify hate speech in a data set of four billion geolocated tweets from over 750 US cities (2014 – 2020). We statistically evaluate the changes in daily hate tweets against changes in local temperature, isolating the temperature influence from confounding factors using binned panel-regression models. We find a low prevalence of hate tweets in moderate temperatures and observe sharp increases of up to 12% for colder and up to 22% for hotter temperatures, indicating that not only hot but also cold temperatures increase aggressive tendencies. Further, we observe that for extreme temperatures hate speech also increases as a percentage of total tweeting activity, crowding out non-hate speech. The quasi-quadratic shape of the temperature-hate tweet curve is robust across varying climate zones, income groups, religious and political beliefs. The prevalence of the results across climatic and socioeconomic splits points to limits in adaptation. Our results illuminate hate speech online as an impact channel through which temperature alters societal aggression. Authors: Annika Stechemesser (Potsdam Insitute for Climate Impact Research); Anders Levermann (Potsdam Institute for Climate Impact Research); Leonie Wenz (Potsdam Institute for Climate Impact Research) |
Classification, regression, and supervised learning Health Societal adaptation and resilience Data mining Natural language processing |
NeurIPS 2022 |
Cross Modal Distillation for Flood Extent Mapping
(Papers Track)
Abstract and authors: (click to expand)Abstract: The increasing intensity and frequency of floods is one of the many consequences of our changing climate. In this work, we explore ML techniques that improve the flood detection module of an operational early flood warning system. Our method exploits an unlabelled dataset of paired multi-spectral and Synthetic Aperture Radar (SAR) imagery to reduce the labeling requirements of a purely supervised learning method. Past attempts have used such unlabelled data by creating weak labels out of them, but end up learning the label mistakes in those weak labels. Motivated by knowledge distillation and semi supervised learning, we explore the use of a teacher to train a student with the help of a small hand labeled dataset and a large unlabelled dataset. Unlike the conventional self distillation setup, we propose a cross modal distillation framework that transfers supervision from a teacher trained on richer modality (multi-spectral images) to a student model trained on SAR imagery. The trained models are then tested on the Sen1Floods11 dataset. Our model outperforms the Sen1Floods11 SAR baselines by an absolute margin of 4.15% pixel wise Intersection-over-Union (IoU) on the test split. Authors: Shubhika Garg (Google); Ben Feinstein (Google); Shahar Timnat (Google); Vishal V Batchu (Google); gideon dror (The Academic College of Tel-Aviv-Yaffo); Adi Gerzi Rosenthal (Google); Varun Gulshan (Google Research) |
Computer vision and remote sensing Disaster management and relief Earth observations and monitoring Unsupervised and semi-supervised learning |
NeurIPS 2022 |
Transformer Neural Networks for Building Load Forecasting
(Papers Track)
Abstract and authors: (click to expand)Abstract: Accurate electrical load forecasts of buildings are needed to optimize local energy storage and to make use of demand-side flexibility. We study the usage of Transformer neural networks for short-term electrical load forecasting of 296 buildings from a public dataset. Transformer neural networks trained on many buildings give the best forecasts on 115 buildings, and multi-layer perceptrons trained on a single building are better on 161 buildings. In addition, we evaluate the models on buildings that were not used for training, and find that Transformer neural networks generalize better than multi-layer perceptrons and our statistical baselines. This shows that the usage of Transformer neural networks for building load forecasting could reduce training resources due to the good generalization to unseen buildings, and they could be useful for cold-start scenarios. Authors: Matthias Hertel (KIT); Simon Ott (KIT); Oliver Neumann (KIT); Benjamin Schäfer (KIT); Ralf Mikut (Karlsruhe Institute of Technology); Veit Hagenmeyer (Karlsruhe Institute of Technology (KIT)) |
Time-series analysis Buildings Classification, regression, and supervised learning |
NeurIPS 2022 |
Estimating Chicago’s tree cover and canopy height using multi-spectral satellite imagery
(Papers Track)
Abstract and authors: (click to expand)Abstract: Information on urban tree canopies is fundamental to mitigating climate change as well as improving quality of life. Urban tree planting initiatives face a lack of up-to-date data about the horizontal and vertical dimensions of the tree canopy in cities. We present a pipeline that utilizes LiDAR data as ground-truth and then trains a multi-task machine learning model to generate reliable estimates of tree cover and canopy height in urban areas using multi-source multi-spectral satellite imagery for the case study of Chicago. Authors: John Francis (University College London) |
Cities and urban planning Ecosystems and biodiversity Public policy Computer vision and remote sensing |
NeurIPS 2022 |
Reconstruction of Grid Measurements in the Presence of Adversarial Attacks
(Papers Track)
Abstract and authors: (click to expand)Abstract: In efforts to mitigate the adverse effects of climate change, policymakers have set ambitious goals to reduce the carbon footprint of all sectors - including the electric grid. To facilitate this, sustainable energy systems like renewable generation must { be} deployed at high numbers throughout the grid. As these are highly variable in nature, the grid must be closely monitored so that system operators will have elevated situational awareness and can execute timely actions to maintain stable grid operations. With the widespread deployment of sensors like phasor measurement units (PMUs), an abundance of data is available for conducting accurate state estimation. However, due to the cyber-physical nature of the power grid, measurement data can be perturbed in an adversarial manner to enforce incorrect decision-making. In this paper, we propose a novel reconstruction method that leverages on machine learning constructs like CGAN and gradient search to recover the original states when subjected to adversarial perturbations. Experimental studies conducted on the practical IEEE 118-bus benchmark power system show that the proposed method can reduce errors due to perturbation by large margins (i.e. up to 100%). Authors: Amirmohammad Naeini (York University); Samer El Kababji (Western University); Pirathayini Srikantha (York University) |
Generative modeling Other Unsupervised and semi-supervised learning |
NeurIPS 2022 |
Heat Demand Forecasting with Multi-Resolutional Representation of Heterogeneous Temporal Ensemble
(Papers Track)
Abstract and authors: (click to expand)Abstract: One of the primal challenges faced by utility companies is ensuring efficient supply with minimal greenhouse gas emissions. The advent of smart meters and smart grids provide an unprecedented advantage in realizing an optimised supply of thermal energies through proactive techniques such as load forecasting. In this paper, we propose a forecasting framework for heat demand based on neural networks where the time series are encoded as scalograms equipped with the capacity of embedding exogenous variables such as weather, and holiday/non-holiday. Subsequently, CNNs are utilized to predict the heat load multi-step ahead. Finally, the proposed framework is compared with other state-of-the-art methods, such as SARIMAX and LSTM. The quantitative results from retrospective experiments show that the proposed framework consistently outperforms the state-of-the-art baseline method with real-world data acquired from Denmark. A minimal mean error of 7.54% for MAPE and 417kW for RMSE is achieved with the proposed framework in comparison to all other methods. Authors: Satyaki Chatterjee (Pattern Recognition Lab, FAU Erlangen-Nuremberg); Adithya Ramachandran (Pattern Recognition Lab, Friedrich Alexander University, Erlangen); Thorkil Flensmark Neergaard (Brønderslev Forsyning A/S); Andreas K Maier (Pattern Recognition Lab, FAU Erlangen-Nuremberg); Siming Bayer (Pattern Recognition Lab, Friedrich-Alexander University) |
Time-series analysis Cities and urban planning Power and energy systems Classification, regression, and supervised learning Interpretable ML |
NeurIPS 2022 |
Generalized Ice Detection on Wind Turbine Rotor Blades with Neural Style Transfer
(Papers Track)
Abstract and authors: (click to expand)Abstract: Wind energy’s ability to liberate the world of conventional sources of energy relies on lowering the significant costs associated with the maintenance of wind turbines. Since icing events on turbine rotor blades are a leading cause of operational failures, identifying icing in advance is critical. Some recent studies focus on specific wind parks and fail to generalize to unseen scenarios (e.g. new rotor blade designs). We propose the utilisation of synthetic data augmentation via neural style transfer to improve the generalization of existing ice prediction models. We show that training models with augmented data that captures domain-invariant icing characteristics can help improve predictive performance across multiple wind parks. Through efficient identification of icing, this study can support preventive maintenance of wind energy sources by making them more reliable towards tackling climate change. Authors: Joyjit Chatterjee (University of Hull); Maria Teresa Alvela Nieto (University of Bremen); Hannes Gelbhardt (University of Bremen); Nina Dethlefs (University of Hull); Jan Ohlendorf (University of Bremen); Klaus-Dieter Thoben (University of Bremen) |
Power and energy systems Classification, regression, and supervised learning Computer vision and remote sensing Meta- and transfer learning |
NeurIPS 2022 |
Identifying Compound Climate Drivers of Forest Mortality with β-VAE
(Papers Track)
Abstract and authors: (click to expand)Abstract: Climate change is expected to lead to higher rates of forest mortality. Forest mortality is a complex phenomenon driven by the interaction of multiple climatic variables at multiple temporal scales, further modulated by the current state of the forest (e.g. age, stem diameter, and leaf area index). Identifying the compound climate drivers of forest mortality would greatly improve understanding and projections of future forest mortality risk. Observation data are, however, limited in accuracy and sample size, particularly in regard to forest state variables and mortality events. In contrast, simulations with state-of-the-art forest models enable the exploration of novel machine learning techniques for associating forest mortality with driving climate conditions. Here we simulate 160,000 years of beech, pine and spruce forest dynamics with the forest model FORMIND. We then apply β-VAE to learn disentangled latent representations of weather conditions and identify those that are most likely to cause high forest mortality. The learned model successfully identifies three characteristic climate representations that can be interpreted as different compound drivers of forest mortality. Authors: Mohit Anand (Helmholtz Centre for Environmental Research - UFZ); Lily-belle Sweet (Helmholtz Centre for Environmental Research - UFZ); Gustau Camps-Valls (Universitat de València); Jakob Zscheischler (Helmholtz Centre for Environmental Research - UFZ) |
Generative modeling Climate science and climate modeling Earth observations and monitoring Earth science Ecosystems and biodiversity Extreme weather Forestry and other land use Classification, regression, and supervised learning Interpretable ML Time-series analysis |
NeurIPS 2022 |
TCFD-NLP: Assessing alignment of climate disclosures using NLP for the financial markets
(Papers Track)
Abstract and authors: (click to expand)Abstract: Climate-related disclosure is increasing in importance as companies and stakeholders alike aim to reduce their environmental impact and exposure to climate-induced risk. Companies primarily disclose this information in annual or other lengthy documents where climate information is not the sole focus. To assess the quality of a company's climate-related disclosure, these documents, often hundreds of pages long, must be reviewed manually by climate experts. We propose a more efficient approach to assessing climate-related financial information. We construct a model leveraging TF-IDF, sentence transformers and multi-label k nearest neighbors (kNN). The developed model is capable of assessing alignment of climate disclosures at scale, with a level of granularity and transparency that will support decision-making in the financial markets with relevant climate information. In this paper, we discuss the data that enabled this project, the methodology, and how the resulting model can drive climate impact. Authors: Rylen Sampson (Manifest Climate); Aysha Cotterill (Manifest Climate); Quoc Tien Au (Manifest Climate) |
Climate finance and economics Classification, regression, and supervised learning Natural language processing |
NeurIPS 2022 |
Deep Learning for Rapid Landslide Detection using Synthetic Aperture Radar (SAR) Datacubes
(Papers Track)
Abstract and authors: (click to expand)Abstract: With climate change predicted to increase the likelihood of landslide events, there is a growing need for rapid landslide detection technologies that help inform emergency responses. Synthetic Aperture Radar (SAR) is a remote sensing technique that can provide measurements of affected areas independent from weather or lighting conditions. Usage of SAR, however, is hindered by domain knowledge that is necessary for the pre-processing steps and its interpretation requires expert knowledge. We provide simplified, pre-processed, machine-learning ready SAR datacubes for four globally located landslide events obtained from several Sentinel-1 satellite passes before and after a landslide triggering event together with segmentation maps of the landslides. From this dataset, using the Hokkaido, Japan datacube, we study the feasibility of SAR-based landslide detection with supervised deep learning (DL). Our results demonstrate that DL models can be used to detect landslides from SAR data, achieving an Area under the Precision-Recall curve exceeding 0.7. We find that additional satellite visits enhance detection performance, but that early detection is possible when SAR data is combined with terrain information from a digital elevation model. This can be especially useful for time-critical emergency interventions. Authors: Vanessa Boehm (UC Berkeley); Wei Ji Leong (The Ohio State University); Ragini Bal Mahesh (German Aerospace Center DLR); Ioannis Prapas (National Observatory of Athens); Siddha Ganju (Nvidia); Freddie Kalaitzis (University of Oxford); Edoardo Nemni (United Nations Satellite Centre (UNOSAT)); Raul Ramos-Pollan (Universidad de Antioquia) |
Disaster management and relief Earth observations and monitoring Classification, regression, and supervised learning Computer vision and remote sensing |
NeurIPS 2022 |
Hybrid Recurrent Neural Network for Drought Monitoring
(Papers Track)
Abstract and authors: (click to expand)Abstract: Droughts are pervasive hydrometeorological phenomena and global hazards, whose frequency and intensity are expected to increase in the context of climate change. Drought monitoring is of paramount relevance. Here we propose a hybrid model for drought detection that integrates both climatic indices and data-driven models in a hybrid deep learning approach. We exploit time-series of multi-scale Standardized Precipitation Evapotranspiration Index together with precipitation and temperature as inputs. We introduce a dual-branch recurrent neural network with convolutional lateral connections for blending the data. Experimental and ablative results show that the proposed system outperforms both the considered drought index and purely data-driven deep learning models. Our results suggest the potential of hybrid models for drought monitoring and open the door to synergistic systems that learn from data and domain knowledge altogether. Authors: Mengxue Zhang (Universitat de València); Miguel-Ángel Fernández-Torres (Universitat de València); Gustau Camps-Valls (Universitat de València) |
Climate science and climate modeling Extreme weather Computer vision and remote sensing Hybrid physical models Time-series analysis |
NeurIPS 2022 |
Deep Learning for Global Wildfire Forecasting
(Papers Track)
Abstract and authors: (click to expand)Abstract: Climate change is expected to aggravate wildfire activity through the exacerbation of fire weather. Improving our capabilities to anticipate wildfires on a global scale is of uttermost importance for mitigating their negative effects. In this work, we create a global fire dataset and demonstrate a prototype for predicting the presence of global burned areas on a sub-seasonal scale with the use of segmentation deep learning models. Particularly, we present an open-access global analysis-ready datacube, which contains a variety of variables related to the seasonal and sub-seasonal fire drivers (climate, vegetation, oceanic indices, human-related variables), as well as the historical burned areas and wildfire emissions for 2001-2021. We train a deep learning model, which treats global wildfire forecasting as an image segmentation task and skillfully predicts the presence of burned areas 8, 16, 32 and 64 days ahead of time. Our work motivates the use of deep learning for global burned area forecasting and paves the way towards improved anticipation of global wildfire patterns. Authors: Ioannis Prapas (National Observatory of Athens); Akanksha Ahuja (NOA); Spyros Kondylatos (National Observatory of Athens); Ilektra Karasante (National Observatory of Athens); Lazaro Alonso (Max Planck Institute for Biogeochemistry); Eleanna Panagiotou (Harokopio University of Athens); Charalampos Davalas (Harokopio University of Athens); Dimitrios Michail (Harokopio University of Athens); Nuno Carvalhais (Max Planck Institute for Biogeochemistry); Ioannis Papoutsis (National Observatory of Athens) |
Earth observations and monitoring Climate science and climate modeling Disaster management and relief Earth science Extreme weather Classification, regression, and supervised learning Computer vision and remote sensing |
NeurIPS 2022 |
Causal Modeling of Soil Processes for Improved Generalization
(Papers Track)
Abstract and authors: (click to expand)Abstract: Measuring and monitoring soil organic carbon is critical for agricultural productivity and for addressing critical environmental problems. Soil organic carbon not only enriches nutrition in soil, but also has a gamut of co-benefits such as improving water storage and limiting physical erosion. Despite a litany of work in soil organic carbon estimation, current approaches do not generalize well across soil conditions and management practices. We empirically show that explicit modeling of cause-and-effect relationships among the soil processes improves the out-of-distribution generalizability of prediction models. We provide a comparative analysis of soil organic carbon estimation models where the skeleton is estimated using causal discovery methods. Our framework provide an average improvement of 81% in test mean squared error and 52% in test mean absolute error. Authors: Somya Sharma (U. Minnesota); Swati Sharma (Microsoft Research); Emre Kiciman (Microsoft Research); Andy Neal (Rothamstead); Ranveer Chandra (Microsoft Research); John Crawford (University of Glasgow); Sara Malvar (Microsoft); Eduardo R Rodrigues (MSR) |
Agriculture and food Causal and Bayesian methods |
NeurIPS 2022 |
Machine Learning for Activity-Based Road Transportation Emissions Estimation
(Papers Track)
Abstract and authors: (click to expand)Abstract: Measuring and attributing greenhouse gas (GHG) emissions remains a challenging problem as the world strives towards meeting emissions reductions targets. As a significant portion of total global emissions, the road transportation sector represents an enormous challenge for estimating and tracking emissions at a global scale. To meet this challenge, we have developed a hybrid approach for estimating road transportation emissions that combines the strengths of machine learning and satellite imagery with localized emissions factors data to create an accurate, globally scalable, and easily configurable GHG monitoring framework. Authors: Derek Rollend (JHU); Kevin Foster (JHU); Tomek Kott (JHU); Rohita Mocharla (JHU); Rodrigo Rene Rai Munoz Abujder (Johns Hopkins Applied Physics Laboratory); Neil Fendley (JHU/APL); Chace Ashcraft (JHU/APL); Frank Willard (JHU); Marisa Hughes (JHU) |
Transportation Cities and urban planning Classification, regression, and supervised learning Computer vision and remote sensing |
NeurIPS 2022 |
Bayesian State-Space SCM for Deforestation Baseline Estimation for Forest Carbon Credit
(Papers Track)
Best Paper: Pathway to Impact
Abstract and authors: (click to expand)Abstract: In forest carbon credit, the concept of dynamic (or ex-post) baseline has been discussed to overcome the criticism of junk carbon credit, while an ex-ante baseline is still necessary in terms of project finance and risk assessment. We propose a Bayesian state-space SCM, which integrates both ex-ante and ex-post baseline estimation in a time-series causal inference framework. We apply the proposed model to a REDD+ project in Brazil, and show that it might have had a small, positive effect but had been over-credited and that the 90% predictive interval of the ex-ante baseline included the ex-post baseline, implying our ex-ante estimation can work effectively. Authors: Keisuke Takahata (sustainacraft, Inc.); Hiroshi Suetsugu (sustainacraft, Inc.); Keiichi Fukaya (National Institute for Environmental Studies); Shinichiro Shirota (Hitotsubashi University) |
Forestry and other land use Causal and Bayesian methods |
NeurIPS 2022 |
Estimating Corporate Scope 1 Emissions Using Tree-Based Machine Learning Methods
(Papers Track)
Abstract and authors: (click to expand)Abstract: Companies worldwide contribute to climate change, emitting significant amounts of greenhouse gases (GHGs). Yet, most do not report their direct or Scope 1 emissions, resulting in a large data gap in corporate emissions. This study aims to fill this gap by training several decision-tree machine learning models to predict company-level Scope 1 emissions. Our results demonstrate that the Extreme Gradient Boosting and LightGBM models perform best, where the former shows a 19% improvement in prediction error over a benchmark model. Our model is also of reduced complexity and greater computational efficiency; it does not require meta-learners and is trained on a smaller number of features, for which data is more common and accessible compared to prior works. Our features are uniquely chosen based on concepts of environmental pollution in economic theory. Predicting corporate emissions with machine learning can be used as a gap-filling approach, which would allow for better GHG accounting and tracking, thus facilitating corporate decarbonization efforts in the long term. It can also impact representations of a company’s carbon performance and carbon risks, thereby helping to funnel investments towards companies with lower emissions and those making true efforts to decarbonize. Authors: Elham Kheradmand (University of Montreal); Maida Hadziosmanovic (Concordia University); Nazim Benguettat (Concordia); H. Damon Matthews (Concordia University); Shannon M. Lloyd (Concordia University) |
Classification, regression, and supervised learning Climate finance and economics Interpretable ML |
NeurIPS 2022 |
Analyzing Micro-Level Rebound Effects of Energy Efficient Technologies
(Papers Track)
Abstract and authors: (click to expand)Abstract: Energy preservation is central to prevent resource depletion, climate change and environment degradation. Investment in raising efficiency of appliances is among the most significant attempts to save energy. Ironically, introduction of many such energy saving appliances increased the total energy consumption instead of reducing it. This effect in literature is attributed to the inherent Jevons paradox (JP) and optimism bias (OB) in consumer behavior. However, the magnitude of these instincts vary among different people. Identification of this magnitude for each household can enable the development of appropriate policies that induce desired energy saving behaviour. Using the RECS 2015 dataset, the paper uses machine learning for each electrical appliance to determine the dependence of their total energy consumption on their energy star rating. This shows that only substitutable appliances register increase in energy demand upon boosted efficiency. Lastly, an index is noted to indicate the varying influence of JP and OB on different households. Authors: Mayank Jain (University College Dublin); Mukta Jain (Delhi School of Economics); Tarek T. Alskaif (Wageningen University); Soumyabrata Dev (University College Dublin) |
Behavioral and social science Cities and urban planning Power and energy systems Classification, regression, and supervised learning |
NeurIPS 2022 |
Comparing the carbon costs and benefits of low-resource solar nowcasting
(Papers Track)
Abstract and authors: (click to expand)Abstract: Mitigating emissions in line with climate goals requires the rapid integration of low carbon energy sources, such as solar photovoltaics (PV) into the electricity grid. However, the energy produced from solar PV fluctuates due to clouds obscuring the sun's energy. Solar PV yield nowcasting is used to help anticipate peaks and troughs in demand to support grid integration. This paper compares multiple low-resource approaches to nowcasting solar PV yield. To do so, we use a dataset of UK satellite imagery and solar PV energy readings over a 1 to 4-hour time range. Our work investigates the performance of multiple nowcasting models. The paper also estimates the carbon emissions generated and averted by deploying models such as these, and finds that short-term PV forecasting may have a benefit several orders of magnitude greater than its carbon cost and that this benefit holds for small models that could be deployable in low-resource settings around the globe. Authors: Ben Dixon (UCL); Jacob Bieker (Open Climate Fix); Maria Perez-Ortiz (University College London) |
Computer vision and remote sensing Climate science and climate modeling Time-series analysis |
NeurIPS 2022 |
Climate Policy Tracker: Pipeline for automated analysis of public climate policies
(Papers Track)
Abstract and authors: (click to expand)Abstract: The number of standardized policy documents regarding climate policy and their publication frequency is significantly increasing. The documents are long and tedious for manual analysis, especially for policy experts, lawmakers, and citizens who lack access or domain expertise to utilize data analytics tools. Potential consequences of such a situation include reduced citizen governance and involvement in climate policies and an overall surge in analytics costs, rendering less accessibility for the public. In this work, we use a Latent Dirichlet Allocation-based pipeline for the automatic summarization and analysis of 10-years of national energy and climate plans (NECPs) for the period from 2021 to 2030, established by 27 Member States of the European Union. We focus on analyzing policy framing, the language used to describe specific issues, to detect essential nuances in the way governments frame their climate policies and achieve climate goals. The methods leverage topic modeling and clustering for the comparative analysis of policy documents across different countries. It allows for easier integration in potential user-friendly applications for the development of theories and processes of climate policy. This would further lead to better citizen governance and engagement over climate policies and public policy research. Authors: Artur Żółkowski (Warsaw University of Technology); Mateusz Krzyziński (Warsaw University of Technology); Piotr Wilczyński (Warsaw University of Technology); Stanisław Giziński (University of Warsaw); Emilia Wiśnios (University of Warsaw); Bartosz Pieliński (University of Warsaw); Julian Sienkiewicz (Warsaw University of Technology); Przemysław Biecek (Warsaw University of Technology) |
Public policy Natural language processing |
NeurIPS 2022 |
Topic correlation networks inferred from open-ended survey responses reveal signatures of ideology behind carbon tax opinion
(Papers Track)
Abstract and authors: (click to expand)Abstract: Ideology can often render policy design ineffective by overriding what, at face value, are rational incentives. A timely example is carbon pricing, whose public support is strongly influenced by ideology. As a system of ideas, ideology expresses itself in the way people explain themselves and the world. As an object of study, ideology is then amenable to a generative modelling approach within the text-as-data paradigm. Here, we analyze the structure of ideology underlying carbon tax opinion using topic models. An idea, termed a topic, is operationalized as the fixed set of proportions with which words are used when talking about it. We characterize ideology through the relational structure between topics. To access this latent structure, we use the highly expressive Structural Topic Model to infer topics and the weights with which individual opinions mix topics. We fit the model to a large dataset of open-ended survey responses of Canadians elaborating on their support of or opposition to the tax. We propose and evaluate statistical measures of ideology in our data, such as dimensionality and heterogeneity. Finally, we discuss the implications of the results for transition policy in particular, and of our approach to analyzing ideology for computational social science in general. Authors: Maximilian Puelma Touzel (Mila) |
Behavioral and social science Climate finance and economics Public policy Generative modeling Natural language processing |
NeurIPS 2022 |
Controllable Generation for Climate Modeling
(Papers Track)
Abstract and authors: (click to expand)Abstract: Recent years have seen increased interest in modeling future climate trends, especially from the point of view of accurately predicting, understanding and mitigating downstream impacts. For instance, current state-of-the-art process-based agriculture models rely on high-resolution climate data during the growing season for accurate estimation of crop yields. However, high-resolution climate data for future climates is unavailable and needs to be simulated, and that too for multiple possible climate scenarios, which becomes prohibitively expensive via traditional methods. Meanwhile, deep generative models leveraging the expressivity of neural networks have shown immense promise in modeling distributions in high dimensions. Here, we cast the problem of simulation of climate scenarios in a generative modeling framework. Specifically, we leverage the GAN (Generative Adversarial Network) framework for simulating synthetic climate scenarios. We condition the model by quantifying the degree of ``extremeness" of the observed sample, which allows us to sample from different parts of the distribution. We demonstrate the efficacy of the proposed method on the CHIRPS precipitation dataset. Authors: Moulik Choraria (University of Illinois at Urbana-Champaign); Daniela Szwarcman (IBM Research); Bianca Zadrozny (IBM Research); Campbell D Watson (IBM Reserch); Lav Varshney (UIUC: ECE) |
Climate science and climate modeling Extreme weather Generative modeling |
NeurIPS 2022 |
Learn to Bid: Deep Reinforcement Learning with Transformer for Energy Storage Bidding in Energy and Contingency Reserve Markets
(Papers Track)
Abstract and authors: (click to expand)Abstract: As part of efforts to tackle climate change, grid-scale battery energy storage systems (BESS) play an essential role in facilitating reliable and secure power system operation with variable renewable energy (VRE). BESS can balance time-varying electricity demand and supply in the spot market through energy arbitrage and in the frequency control ancillary services (FCAS) market through service enablement or delivery. Effective algorithms are needed for the optimal participation of BESS in multiple markets. Using deep reinforcement learning (DRL), we present a BESS bidding strategy in the joint spot and contingency FCAS markets, leveraging a transformer-based temporal feature extractor to exploit the temporal trends of volatile energy prices. We validate our strategy on real-world historical energy prices in the Australian National Electricity Market (NEM). We demonstrate that the novel DRL-based bidding strategy significantly outperforms benchmarks. The simulation also reveals that the joint bidding in both the spot and contingency FCAS markets can yield a much higher profit than in individual markets. Our work provides a viable use case for the BESS, contributing to the power system operation with high penetration of renewables. Authors: Jinhao Li (Monash University); Changlong Wang (Monash University); Yanru Zhang (University of Electronic Science and Technology of China); Hao Wang (Monash University) |
Power and energy systems Reinforcement learning and control |
NeurIPS 2022 |
Curriculum Based Reinforcement Learning to Avert Cascading Failures in the Electric Grid
(Papers Track)
Abstract and authors: (click to expand)Abstract: We present an approach to integrate the domain knowledge of the electric power grid operations into reinforcement learning (RL) frameworks for effectively learning RL agents to prevent cascading failures. A curriculum-based approach with reward tuning is incorporated into the training procedure by modifying the environment using the network physics. Our procedure is tested on an actor-critic-based agent on the IEEE 14-bus test system using the RL environment developed by RTE, the French transmission system operator (TSO). We observed that naively training the RL agent without the curriculum approach failed to prevent cascading for most test scenarios, while the curriculum based RL agents succeeded in most test scenarios, illustrating the importance of properly integrating domain knowledge of physical systems for real-world RL applications. Authors: Amarsagar Reddy Ramapuram Matavalam (Arizona State University); Kishan Guddanti (Pacific Northwest National Lab); Yang Weng (Arizona State University) |
Power and energy systems Reinforcement learning and control |
NeurIPS 2022 |
Short-range forecasts of global precipitation using deep learning-augmented numerical weather prediction
(Papers Track)
Abstract and authors: (click to expand)Abstract: Precipitation drives the hydroclimate of Earth and its spatiotemporal changes on a day to day basis have one of the most notable socioeconomic impacts. The success of numerical weather prediction (NWP) is measured by the improvement of forecasts for various physical fields such as temperature and pressure. Large biases however exist in the precipitation predictions. Pure deep learning based approaches lack the advancements acheived by NWP in the past two to three decades. Hybrid methodology using NWP outputs as inputs to the deep learning based refinement tool offer an attractive means taking advantage of both NWP and state of the art deep learning algorithms. Augmenting the output from a well-known NWP model: Coupled Forecast System ver.2 (CFSv2) with deep learning for the first time, we demonstrate a hybrid model capability (DeepNWP) which shows substantial skill improvements for short-range global precipitation at 1-, 2- and 3-days lead time. To achieve this hybridization, we address the sphericity of the global data by using modified DLWP-CS architecture which transforms all the fields to cubed-sphere projection. The dynamical model outputs corresponding to precipitation and surface temperature are ingested to a UNET for predicting the target ground truth precipitation. While the dynamical model CFSv2 shows a bias in the range of +5 to +7 mm/day over land, the multivariate deep learning model reduces it to -1 to +1 mm/day over global land areas. We validate the results by taking examples from Hurricane Katrina in 2005, Hurricane Ivan in 2004, Central European floods in 2010, China floods in 2010, India floods in 2005 and the Myanmar cyclone Nargis in 2008. Authors: Manmeet Singh (The University of Texas at Austin); Vaisakh SB (Indian Institute of Tropical Meteorology); Nachiketa Acharya (Department of Meteorology and Atmospheric Science,Pennsylvania State University); Aditya Grover (UCLA); Suryachandra A. Rao (Indian Institute of Tropical Meteorology); Bipin Kumar (Indian Institute of Tropical Meteorology); Zong-Liang Yang (The University of Texas at Austin); Dev Niyogi (The University of Texas at Austin) |
Climate science and climate modeling Hybrid physical models |
NeurIPS 2022 |
A Multi-Scale Deep Learning Framework for Projecting Weather Extremes
(Papers Track)
Best Paper: ML Innovation
Abstract and authors: (click to expand)Abstract: Weather extremes are a major societal and economic hazard, claiming thousands of lives and causing billions of dollars in damage every year. Under climate change, their impact and intensity are expected to worsen significantly. Unfortunately, general circulation models (GCMs), which are currently the primary tool for climate projections, cannot characterize weather extremes accurately. To address this, we present a multi-resolution deep-learning framework that, firstly, corrects a GCM's biases by matching low-order and tail statistics of its output with observations at coarse scales; and secondly, increases the level of detail of the debiased GCM output by reconstructing the finer scales as a function of the coarse scales. We use the proposed framework to generate statistically realistic realizations of the climate over Western Europe from a simple GCM corrected using observational atmospheric reanalysis. We also discuss implications for probabilistic risk assessment of natural disasters in a changing climate. Authors: Antoine Blanchard (MIT); Nishant Parashar (Verisk Analytics); Boyko Dodov (Verisk Analytics); Christian Lessig (Otto-von-Guericke-Universitat Magdeburg); Themis Sapsis (MIT) |
Extreme weather Hybrid physical models |
NeurIPS 2022 |
A Global Classification Model for Cities using ML
(Papers Track)
Abstract and authors: (click to expand)Abstract: This paper develops a novel data set for three key resources use; namely, food, water, and energy, for 9000 cities globally. The data set is then utilized to develop a clustering approach as a starting point towards a global classification model. This novel clustering approach aims to contribute to developing an inclusive view of resource efficiency for all urban centers globally. The proposed clustering algorithm is comprised of three steps: first, outlier detection to address specific city characteristics, then a Variational Autoencoder (VAE), and finally, Agglomerative Clustering (AC) to improve the classification results. Our results show that this approach is more robust and yields better results in creating delimited clusters with high Calinski-Harabasz Index scores and Silhouette Coefficient than other baseline clustering methods. Authors: Doron Hazan (MIT); Mohamed Habashy (Massachusetts Institute of Technology); Mohanned ElKholy (Massachusetts Institute of Technology); Omer Mousa (American University in Cairo); Norhan M Bayomi (MIT Environmental Solutions Initiative); Matias Williams (Massachusetts Institute of Technology); John Fernandez (Massachusetts Institute of Technology) |
Cities and urban planning Classification, regression, and supervised learning Data mining |
NeurIPS 2022 |
EnhancedSD: Downscaling Solar Irradiance from Climate Model Projections
(Papers Track)
Abstract and authors: (click to expand)Abstract: Renewable energy-based electricity systems are seen as a keystone of future decarbonization efforts. However, power system planning does not currently consider the impacts of climate change on renewable energy resources such as solar energy, chiefly due to a paucity of climate-impacted high-resolution solar power data. Existing statistical downscaling (SD) methods that learn to map coarse-resolution versions of historical reanalysis data to generate finer resolution outputs are of limited use when applied to future climate model projections due to the domain gap between climate models and reanalysis data. In contrast, we present EnhancedSD, a deep learning-based framework for downscaling coarse-scale climate model outputs to high-resolution observational (reanalysis) data. Our proposed ML based downscaling allows for future reanalysis projections, which can be pivotal for mitigating climate change’s impacts on power systems planning. Authors: Nidhin Harilal (University of Colorado, Boulder); Bri-Mathias S Hodge (University of Colorado Boulder); Claire Monteleoni (University of Colorado Boulder); Aneesh Subramanian (University of California, San Diego) |
Computer vision and remote sensing Climate science and climate modeling Power and energy systems |
NeurIPS 2022 |
Positional Encoder Graph Neural Networks for Geographic Data
(Papers Track)
Abstract and authors: (click to expand)Abstract: Modeling spatial dependencies in geographic data is of crucial importance for the modeling of our planet. Graph neural networks (GNNs) provide a powerful and scalable solution for modeling continuous spatial data. However, in the absence of further context on the geometric structure of the data, they often rely on Euclidean distances to construct the input graphs. This assumption can be improbable in many real-world settings, where the spatial structure is more complex and explicitly non-Euclidean (e.g., road networks). In this paper, we propose PE-GNN, a new framework that incorporates spatial context and correlation explicitly into the models. Building on recent advances in geospatial auxiliary task learning and semantic spatial embeddings, our proposed method (1) learns a context-aware vector encoding of the geographic coordinates and (2) predicts spatial autocorrelation in the data in parallel with the main task. We show the effectiveness of our approach on two climate-relevant regression tasks: 3d spatial interpolation and air temperature prediction. The code for this study can be accessed via: https://bit.ly/3xDpfyV. Authors: Konstantin Klemmer (Microsoft Research); Nathan S Safir (University of Georgia); Daniel B Neill (New York University) |
Classification, regression, and supervised learning Climate science and climate modeling Earth observations and monitoring Extreme weather |
NeurIPS 2022 |
Image-based Early Detection System for Wildfires
(Papers Track)
Abstract and authors: (click to expand)Abstract: Wildfires are a disastrous phenomenon which cause damage to land, loss of property, air pollution, and even loss of human life. Due to the warmer and drier conditions created by climate change, more severe and uncontrollable wildfires are expected to occur in the coming years. This could lead to a global wildfire crisis and have dire consequences on our planet. Hence, it has become imperative to use technology to help prevent the spread of wildfires. One way to prevent the spread of wildfires before they become too large is to perform early detection i.e, detecting the smoke before the actual fire starts. In this paper, we present our Wildfire Detection and Alert System which use machine learning to detect wildfire smoke with a high degree of accuracy and can send immediate alerts to users. Our technology is currently being used in the USA to monitor data coming in from hundreds of cameras daily. We show that our system has a high true detection rate and a low false detection rate. Our performance evaluation study also shows that on an average our system detects wildfire smoke faster than an actual person. Authors: Omkar Ranadive (Alchera X); Jisu Kim (Alchera); Serin Lee (Alchera X); Youngseo Cha (Alchera); Heechan Park (Alchera); Minkook Cho (Alchera); Young Hwang (Alchera) |
Computer vision and remote sensing Disaster management and relief Other Classification, regression, and supervised learning |
NeurIPS 2022 |
Towards Global Crop Maps with Transfer Learning
(Papers Track)
Abstract and authors: (click to expand)Abstract: The continuous increase in global population and the impact of climate change on crop production are expected to affect the food sector significantly. In this context, there is need for timely, large-scale and precise mapping of crops for evidence-based decision making. A key enabler towards this direction are new satellite missions that freely offer big remote sensing data of high spatio-temporal resolution and global coverage. During the previous decade and because of this surge of big Earth observations, deep learning methods have dominated the remote sensing and crop mapping literature. Nevertheless, deep learning models require large amounts of annotated data that are scarce and hard-to-acquire. To address this problem, transfer learning methods can be used to exploit available annotations and enable crop mapping for other regions, crop types and years of inspection. In this work, we have developed and trained a deep learning model for paddy rice detection in South Korea using Sentinel-1 VH time-series. We then fine-tune the model for i) paddy rice detection in France and Spain and ii) barley detection in the Netherlands. Additionally, we propose a modification in the pre-trained weights in order to incorporate extra input features (Sentinel-1 VV). Our approach shows excellent performance when transferring in different areas for the same crop type and rather promising results when transferring in a different area and crop type. Authors: Hyun-Woo Jo (Korea University); Alkiviadis Marios Koukos (National Observatory of Athens); Vasileios Sitokonstantinou (National Observatory of Athens); Woo-Kyun Lee (Korea University); Charalampos Kontoes (National Observatory of Athens) |
Meta- and transfer learning Agriculture and food Earth observations and monitoring Forestry and other land use Classification, regression, and supervised learning Computer vision and remote sensing |
NeurIPS 2022 |
Pyrocast: a Machine Learning Pipeline to Forecast Pyrocumulonimbus (PyroCb) Clouds
(Papers Track)
Abstract and authors: (click to expand)Abstract: Pyrocumulonimbus (pyroCb) clouds are storm clouds generated by extreme wildfires. PyroCbs are associated with unpredictable, and therefore dangerous, wildfire spread. They can also inject smoke particles and trace gases into the upper troposphere and lower stratosphere, affecting the Earth's climate. As global temperatures increase, these previously rare events are becoming more common. Being able to predict which fires are likely to generate pyroCb is therefore key to climate adaptation in wildfire-prone areas. This paper introduces Pyrocast, a pipeline for pyroCb analysis and forecasting. The pipeline's first two components, a pyroCb database and a pyroCb forecast model, are presented. The database brings together geostationary imagery and environmental data for over 148 pyroCb events across North America, Australia, and Russia between 2018 and 2022. Random Forests, Convolutional Neural Networks (CNNs), and CNNs pretrained with Auto-Encoders were tested to predict the generation of pyroCb for a given fire six hours in advance. The best model predicted pyroCb with an AUC of 0.90±0.04. Authors: Kenza Tazi (University of Cambridge); Emiliano Díaz Salas-Porras (University of Valencia); Ashwin Braude (Institut Pierre-Simon Laplace); Daniel Okoh (National Space Research and Development Agency); Kara D. Lamb (Columbia University); Duncan Watson-Parris (University of Oxford); Paula Harder (Fraunhofer ITWM); Nis Meinert (Pasteur Labs) |
Classification, regression, and supervised learning Earth observations and monitoring Computer vision and remote sensing |
NeurIPS 2022 |
Evaluating Digital Tools for Sustainable Agriculture using Causal Inference
(Papers Track)
Abstract and authors: (click to expand)Abstract: In contrast to the rapid digitalization of several industries, agriculture suffers from low adoption of climate-smart farming tools. Even though AI-driven digital agriculture can offer high-performing predictive functionalities, they lack tangible quantitative evidence on their benefits to the farmers. Field experiments can derive such evidence, but are often costly and time consuming. To this end, we propose an observational causal inference framework for the empirical evaluation of the impact of digital tools on target farm performance indicators. This way, we can increase farmers' trust via enhancing the transparency of the digital agriculture market, and in turn accelerate the adoption of technologies that aim to increase productivity and secure a sustainable and resilient agriculture against a changing climate. As a case study, we perform an empirical evaluation of a recommendation system for optimal cotton sowing, which was used by a farmers' cooperative during the growing season of 2021. We leverage agricultural knowledge to develop the causal graph of the farm system, we use the back-door criterion to identify the impact of recommendations on the yield and subsequently we estimate it using several methods on observational data. The results showed that a field sown according to our recommendations enjoyed a significant increase in yield 12% to 17%. Authors: Ilias Tsoumas (National Observatory of Athens); Georgios Giannarakis (National Observatory of Athens); Vasileios Sitokonstantinou (National Observatory of Athens); Alkiviadis Marios Koukos (National Observatory of Athens); Dimitra A Loka (Hellenic Agricultural Organization ELGO DIMITRA); Nikolaos S Bartsotas (National Observatory of Athens); Charalampos Kontoes (National Observatory of Athens); Ioannis N Athanasiadis (Wageningen University and Research) |
Causal and Bayesian methods Agriculture and food Earth observations and monitoring Extreme weather Societal adaptation and resilience |
NeurIPS 2022 |
Generating physically-consistent high-resolution climate data with hard-constrained neural networks
(Papers Track)
Abstract and authors: (click to expand)Abstract: The availability of reliable, high-resolution climate and weather data is important to inform long-term decisions on climate adaptation and mitigation and to guide rapid responses to extreme events. Forecasting models are limited by computational costs and therefore often can only make coarse resolution predictions. Statistical downscaling can provide an efficient method of upsampling low-resolution data. In this field, deep learning has been applied successfully, often using image super-resolution methods from computer vision. Despite achieving visually compelling results in some cases, such models often violate conservation laws when predicting physical variables. In order to conserve important physical quantities, we developed a deep downscaling method that guarantees physical constraints are satisfied, by adding a renormalization layer at the end of the neural network. Furthermore, the constrained model also improves the performance according to standard metrics. We show the applicability of our methods across different popular architectures and upsampling factors using ERA5 reanalysis data. Authors: Paula Harder (Mila); Qidong Yang (New York University); Venkatesh Ramesh (Mila); Prasanna Sattigeri (IBM Research); Alex Hernandez-Garcia (Mila - Quebec AI Institute); Campbell D Watson (IBM Reserch); Daniela Szwarcman (IBM Research); David Rolnick (McGill University, Mila) |
Climate science and climate modeling Classification, regression, and supervised learning |
NeurIPS 2022 |
Transformers for Fast Emulation of Atmospheric Chemistry Box Models
(Papers Track)
Abstract and authors: (click to expand)Abstract: When modeling atmospheric chemistry, concentrations are determined by numerically solving large systems of ordinary differential equations that represent a set of chemical reactions. These solvers can be very computationally intensive, particularly those with the thousands or tens of thousands of chemical species and reactions that make up the most accurate models. We demonstrate the application of a deep learning transformer architecture to emulate an atmospheric chemistry box model, and show that this attention-based model outperforms LSTM and autoencoder baselines while providing interpretable predictions that are more than 2 orders of magnitude faster than a numerical solver. This work is part of a larger study to replace the numerical solver in a 3D global chemical model with a machine learned emulator and achieve significant speedups for global climate simulations. Authors: Herbie Bradley (University of Cambridge); Nathan Luke Abraham (National Centre for Atmospheric Science, UK); Peer Nowack (Imperial College London); Doug McNeall (Met Office Hadley Centre, UK) |
Climate science and climate modeling Earth science Hybrid physical models Time-series analysis |
NeurIPS 2022 |
Flood Prediction with Graph Neural Networks
(Papers Track)
Abstract and authors: (click to expand)Abstract: Climate change is increasing the frequency of flooding around the world. As a consequence, there is a growing demand for effective flood prediction. Machine learning is a promising alternative to hydrodynamic models for flood prediction. However, existing approaches focus on capturing either the spatial or temporal flood patterns using CNNs or RNNs, respectively. In this work, we propose FloodGNN, which is a graph neural network (GNN) for flood prediction. Compared to existing approaches, FloodGNN (i) employs a graph-based model (GNN); (ii) operates on both spatial and temporal dimensions; and (iii) processes the water flow velocities as vector features, instead of scalar features. Experiments show that FloodGNN achieves promising results, outperforming an RNN-based baseline. Authors: Arnold N Kazadi (Rice University); James Doss-Gollin (Rice University); Antonia Sebastian (UNC Chapel Hill); Arlei Silva (Rice University) |
Extreme weather Classification, regression, and supervised learning |
NeurIPS 2022 |
Neural Representation of the Stratospheric Ozone Layer
(Papers Track)
Abstract and authors: (click to expand)Abstract: In climate modeling, the stratospheric ozone layer is typically only considered in a highly simplified form due to computational constraints. For climate projections, it would be of advantage to include the mutual interactions between stratospheric ozone, temperature, and atmospheric dynamics to accurately represent radiative forcing. The overarching goal of our research is to replace the ozone chemistry in climate models with a machine-learned neural representation of the stratospheric ozone chemistry that allows for a particularly fast, but accurate and stable simulation. We created a benchmark data set from pairs of input and output variables that we stored from simulations of a chemistry and transport model. We analyzed several variants of multilayer perceptrons suitable for physical problems to learn a neural representation of a function that predicts 24-hour ozone tendencies based on input variables. We performed a comprehensive hyperparameter optimization of the multilayer perceptron using Bayesian search and Hyperband early stopping. We validated our model by implementing it in a chemistry and transport model and comparing computation time, accuracy, and stability with the full chemistry module. We found that our model had a computation time that was a factor of 700 faster than the full chemistry module. The accuracy of our model compares favorably to the full chemistry module within a two-year simulation run, also outperforms a previous polynomial approach for fast ozone chemistry, and reproduces seasonality well in both hemispheres. In conclusion, the neural representation of stratospheric ozone chemistry in simulation resulted in an ozone layer that showed a high accuracy, significant speed-up, and stability in a long-term simulation. Authors: Helge Mohn (Alfred Wegener Institute for Polar and Marine Research); Daniel Kreyling (Alfred Wegener Institute for Polar and Marine Research); Ingo Wohltmann (Alfred Wegener Institute for Polar and Marine Research); Ralph Lehmann (Alfred Wegener Institute for Polar and Marine Research); Peter Maaß (University of Bremen); Markus Rex (Alfred Wegener Institute for Polar and Marine Research) |
Climate science and climate modeling Classification, regression, and supervised learning |
NeurIPS 2022 |
DL-Corrector-Remapper: A grid-free bias-correction deep learning methodology for data-driven high-resolution global weather forecasting
(Papers Track)
Abstract and authors: (click to expand)Abstract: Data-driven models, such as FourCastNet (FCN), have shown exemplary performance in high-resolution global weather forecasting. This performance, however, is based on supervision on mesh-gridded weather data without the utilization of raw climate observational data, the gold standard ground truth. In this work we develop a methodology to correct, remap, and fine-tune gridded uniform forecasts of FCN so it can be directly compared against observational ground truth, which is sparse and non-uniform in space and time. This is akin to bias-correction and post-processing of numerical weather prediction (NWP), a routine operation at meteorological and weather forecasting centers across the globe. The Adaptive Fourier Neural Operator (AFNO) architecture is used as the backbone to learn continuous representations of the atmosphere. The spatially and temporally non-uniform output is evaluated by the non-uniform discrete inverse Fourier transform (NUIDFT) given the output query locations. We call this network the Deep-Learning-Corrector-Remapper (DLCR). The improvement in DLCR’s performance against the gold standard ground truth over the baseline’s performance shows its potential to correct, remap, and fine-tune the mesh-gridded forecasts under the supervision of observations. Authors: Tao Ge (Washington University in St. Louis); Jaideep Pathak (NVIDIA Corporation); Akshay Subramaniam (NVIDIA); Karthik Kashinath (NVIDIA) |
Classification, regression, and supervised learning Climate science and climate modeling Extreme weather |
NeurIPS 2022 |
Industry-scale CO2 Flow Simulations with Model-Parallel Fourier Neural Operators
(Papers Track)
Abstract and authors: (click to expand)Abstract: Carbon capture and storage (CCS) is one of the most promising technologies for reducing greenhouse gas emissions and relies on numerical reservoir simulations for identifying and monitoring CO2 storage sites. In many commercial settings however, numerical reservoir simulations are too computationally expensive for important downstream application such as optimization or uncertainty quantification. Deep learning-based surrogate models offer the possibility to solve PDEs many orders of magnitudes faster than conventional simulators, but they are difficult to scale to industrial-scale problem settings. Using model-parallel deep learning, we train the largest CO2 surrogate model to date on a 3D simulation grid with two million grid points. To train the 3D simulator, we generate a new training dataset based on a real-world CCS simulation benchmark. Once trained, each simulation with the network is five orders of magnitude faster than a numerical reservoir simulator and 4,500 times cheaper. This paves the way to applications that require thousands of (sequential) simulations, such as optimizing the location of CO2 injection wells to maximize storage capacity and minimize risk of leakage. Authors: Philipp A Witte (Microsoft); Russell Hewett (Microsoft); Ranveer Chandra (Microsoft Research) |
Carbon capture and sequestration Classification, regression, and supervised learning |
NeurIPS 2022 |
Adaptive Bias Correction for Improved Subseasonal Forecasting
(Papers Track)
Abstract and authors: (click to expand)Abstract: Subseasonal forecasting — predicting temperature and precipitation 2 to 6 weeks ahead — is critical for effective water allocation, wildfire management, and drought and flood mitigation. Recent international research efforts have advanced the subseasonal capabilities of operational dynamical models, yet temperature and precipitation prediction skills remains poor, partly due to stubborn errors in representing atmospheric dynamics and physics inside dynamical models. To counter these errors, we introduce an adaptive bias correction (ABC) method that combines state-of-the-art dynamical forecasts with observations using machine learning. When applied to the leading subseasonal model from the European Centre for Medium-Range Weather Forecasts (ECMWF), ABC improves temperature forecasting skill by 60-90% and precipitation forecasting skill by 40-69% in the contiguous U.S. We couple these performance improvements with a practical workflow, based on Cohort Shapley, for explaining ABC skill gains and identifying higher-skill windows of opportunity based on specific climate conditions. Authors: Soukayna Mouatadid (University of Toronto); Paulo Orenstein (IMPA); Genevieve E Flaspohler (MIT); Judah Cohen (AER); Miruna Oprescu (Cornell University); Ernest Fraenkel (MIT); Lester Mackey (Microsoft Research New England) |
Climate science and climate modeling Extreme weather |
NeurIPS 2022 |
Nowformer : A Locally Enhanced Temporal Learner for Precipitation Nowcasting
(Papers Track)
Abstract and authors: (click to expand)Abstract: The precipitation video datasets have distinctive meteorological patterns where a mass of fluid moves in a particular direction across the entire frames, and each local area of the fluid has an individual life cycle from initiation to maturation to decay. This paper proposes a novel transformer-based model for precipitation nowcasting that can extract global and local dynamics within meteorological characteristics. The experimental results show our model achieves state-of-the-art performances on the precipitation nowcasting benchmark. Authors: Jinyoung Park (KAIST); Inyoung Lee (KAIST); Minseok Son (KAIST); Seungju Cho (KAIST); Changick Kim (KAIST) |
Computer vision and remote sensing Earth observations and monitoring |
NeurIPS 2022 |
An Unsupervised Learning Perspective on the Dynamic Contribution to Extreme Precipitation Changes
(Papers Track)
Abstract and authors: (click to expand)Abstract: Despite the importance of quantifying how the spatial patterns of extreme precipitation will change with warming, we lack tools to objectively analyze the storm-scale outputs of modern climate models. To address this gap, we develop an unsupervised machine learning framework to quantify how storm dynamics affect precipitation extremes and their changes without sacrificing spatial information. Over a wide range of precipitation quantiles, we find that the spatial patterns of extreme precipitation changes are dominated by spatial shifts in storm regimes rather than intrinsic changes in how these storm regimes produce precipitation. Authors: Griffin S Mooers (UC Irvine); Tom Beucler (University of Lausanne); Michael Pritchard (UCI); Stephan Mandt (University of California, Irivine) |
Climate science and climate modeling Unsupervised and semi-supervised learning |
NeurIPS 2022 |
An Interpretable Model of Climate Change Using Correlative Learning
(Papers Track)
Abstract and authors: (click to expand)Abstract: Determining changes in global temperature and precipitation that may indicate climate change is complicated by annual variations. One approach for finding potential climate change indicators is to train a model that predicts the year from annual means of global temperatures and precipitations. Such data is available from the CMIP6 ensemble of simulations. Here a two-hidden-layer neural network trained on this data successfully predicts the year. Differences among temperature and precipitation patterns for which the model predicts specific years reveal changes through time. To find these optimal patterns, a new way of interpreting what the neural network has learned is explored. Alopex, a stochastic correlative learning algorithm, is used to find optimal temperature and precipitation maps that best predict a given year. These maps are compared over multiple years to show how temperature and precipitations patterns indicative of each year change over time. Authors: Charles Anderson (Colorado State University); Jason Stock (Colorado State University) |
Climate science and climate modeling Interpretable ML |
NeurIPS 2022 |
Multimodal Wildland Fire Smoke Detection
(Papers Track)
Abstract and authors: (click to expand)Abstract: Research has shown that climate change creates warmer temperatures and drier conditions, leading to longer wildfire seasons and increased wildfire risks in the United States. These factors have in turn led to increases in the frequency, extent, and severity of wildfires in recent years. Given the danger posed by wildland fires to people, property, wildlife, and the environment, there is an urgency to provide tools for effective wildfire management. Early detection of wildfires is essential to minimizing potentially catastrophic destruction. In this paper, we present our work on integrating multiple data sources in SmokeyNet, a deep learning model using spatio-temporal information to detect smoke from wildland fires. Camera image data is integrated with weather sensor measurements and processed by SmokeyNet to create a multimodal wildland fire smoke detection system. Our results show that incorporating multimodal data in SmokeyNet improves performance in terms of both F1 and time-to-detection over the baseline with a single data source. With a time-to-detection of only a few minutes, SmokeyNet can serve as an automated early notification system, providing a useful tool in the fight against destructive wildfires. Authors: Mai Nguyen (University of California San Diego); Shreyas Anantha Ramaprasad (University of California San Diego); Jaspreet Kaur Bhamra (University of California San Diego); Siddhant Baldota (University of California San Diego); Garrison Cottrell (UC San Diego) |
Disaster management and relief Computer vision and remote sensing |
NeurIPS 2022 |
Using uncertainty-aware machine learning models to study aerosol-cloud interactions
(Papers Track)
Abstract and authors: (click to expand)Abstract: Aerosol-cloud interactions (ACI) include various effects that result from aerosols entering a cloud, and affecting cloud properties. In general, an increase in aerosol concentration results in smaller droplet sizes which leads to larger, brighter, longer-lasting clouds that reflect more sunlight and cool the Earth. The strength of the effect is however heterogeneous, meaning it depends on the surrounding environment, making ACI one of the most uncertain effects in our current climate models. In our work, we use causal machine learning to estimate ACI from satellite observations by reframing the problem as a treatment (aerosol) and outcome (change in droplet radius). We predict the causal effect of aerosol on clouds with uncertainty bounds depending on the unknown factors that may be influencing the impact of aerosol. Of the three climate models evaluated, we find that only one plausibly recreates the trend, lending more credence to its estimate cooling due to ACI. Authors: Maëlys Solal (University of Oxford); Andrew Jesson (University of Oxford); Yarin Gal (University of Oxford); Alyson Douglas (University of Oxford) |
Causal and Bayesian methods Climate science and climate modeling Earth observations and monitoring Uncertainty quantification and robustness |
NeurIPS 2022 |
Accessible Large-Scale Plant Pathology Recognition
(Papers Track)
Abstract and authors: (click to expand)Abstract: Plant diseases are costly and threaten agricultural production and food security worldwide. Climate change is increasing the frequency and severity of plant diseases and pests. Therefore, detection and early remediation can have a significant impact, especially in developing countries. However, AI solutions are yet far from being in production. The current process for plant disease diagnostic consists of manual identification and scoring by humans, which is time-consuming, low-supply, and expensive. Although computer vision models have shown promise for efficient and automated plant disease identification, there are limitations for real-world applications: a notable variation in visual symptoms of a single disease, different light and weather conditions, and the complexity of the models. In this work, we study the performance of efficient classification models and training "tricks" to solve this problem. Our analysis represents a plausible solution for these ecological disasters and might help to assist producers worldwide. More information available at: https://github.com/mv-lab/mlplants Authors: Marcos V. Conde (University of Würzburg); Dmitry Gordeev (H2O.ai) |
Computer vision and remote sensing Disaster management and relief Forestry and other land use Health Active learning Classification, regression, and supervised learning |
NeurIPS 2022 |
Dynamic weights enabled Physics-Informed Neural Network for simulating the mobility of Engineered Nano Particles in a contaminated aquifer
(Papers Track)
Abstract and authors: (click to expand)Abstract: Numerous polluted groundwater sites across the globe require an active remediation strategy for the restoration of natural environmental conditions and local ecosystem. The Engineered Nanoparticles (ENPs) has emerged as an efficient reactive agent for the in-situ degradation of groundwater contaminants. While the performance of these ENPs has been highly promising on the laboratory scale, their application in a real field case conditions is still limited. The optimized injection of the ENPs in the contaminated aquifer and its subsequent monitoring are hindered by the complex transport and retention mechanisms of ENPs. Therefore, a predictive tool for understanding the transport and retention behavior of ENPs becomes highly important. The existing tools in the literature are dominated with numerical simulators, which have limited flexibility and accuracy in the presence of sparse dataset. In this work, a dynamic weights enabled Physics-Informed Neural network (dw-PINN) framework is applied to model the nano-particle´s behavior within an aquifer. The result from the forward model demonstrates the effective capability of dw-PINN in accurately predicting the ENPs mobility. The model verification step shows that the mean squared error of the predicted ENPs concentration using dw-PINN converges to a minimum value of 1.3e-5. In the subsequent step, the result from the inverse model estimates the governing parameters of ENPs mobility with reasonable accuracy. The research work demonstrates the tool´s capability in providing predictive insights for the development of an efficient groundwater remediation strategy. Authors: Shikhar Nilabh (Amphos21) |
Earth science Hybrid physical models |
NeurIPS 2022 |
Calibration of Large Neural Weather Models
(Papers Track)
Abstract and authors: (click to expand)Abstract: Uncertainty quantification of weather forecasts is a necessity for reliably planning for and responding to extreme weather events in a warming world. This motivates the need for well-calibrated ensembles in probabilistic weather forecasting. We present initial results for the calibration of large-scale deep neural weather models for data-driven probabilistic weather forecasting. By explicitly accounting for uncertainties about the forecast's initial condition and model parameters, we generate ensemble forecasts that show promising results on standard diagnostics for probabilistic forecasts. Specifically, we are approaching the Integrated Forecasting System (IFS), the gold standard on probabilistic weather forecasting, on: (i) the spread-error agreement; and (ii) the Continuous Ranked Probability Score (CRPS). Our approach scales to state-of-the-art data-driven weather models, enabling cheap post-hoc calibration of pretrained models with tens of millions of parameters and paving the way towards the next generation of well-calibrated data-driven weather models. Authors: Andre Graubner (Nvidia); Kamyar Kamyar Azizzadenesheli (Nvidia); Jaideep Pathak (NVIDIA Corporation); Morteza Mardani (Nvidia); Mike Pritchard (Nvidia); Karthik Kashinath (Nvidia); Anima Anandkumar (NVIDIA/Caltech) |
Uncertainty quantification and robustness Climate science and climate modeling |
NeurIPS 2022 |
Learning to forecast vegetation greenness at fine resolution over Africa with ConvLSTMs
(Papers Track)
Abstract and authors: (click to expand)Abstract: Forecasting the state of vegetation in response to climate and weather events is a major challenge. Its implementation will prove crucial in predicting crop yield, forest damage, or more generally the impact on ecosystems services relevant for socio-economic functioning, which if absent can lead to humanitarian disasters. Vegetation status depends on weather and environmental conditions that modulate complex ecological processes taking place at several timescales. Interactions between vegetation and different environmental drivers express responses at instantaneous but also time-lagged effects, often showing an emerging spatial context at landscape and regional scales. We formulate the land surface forecasting task as a strongly guided video prediction task where the objective is to forecast the vegetation developing at very fine resolution using topography and weather variables to guide the prediction. We use a Convolutional LSTM (ConvLSTM) architecture to address this task and predict changes in the vegetation state in Africa using Sentinel-2 satellite NDVI, having ERA5 weather reanalysis, SMAP satellite measurements, and topography (DEM of SRTMv4.1) as variables to guide the prediction. Ours results highlight how ConvLSTM models can not only forecast the seasonal evolution of NDVI at high resolution, but also the differential impacts of weather anomalies over the baselines. The model is able to predict different vegetation types, even those with very high NDVI variability during target length. Authors: Claire Robin (Biogeochemical Integration, Max-Planck-Institute for Biogeochemistry, Jena, Germany); Christian Requena-Mesa (Computer Vision Group, Friedrich Schiller University Jena; DLR Institute of Data Science, Jena; Max Planck Institute for Biogeochemistry, Jena); Vitus Benson (Max-Planck-Institute for Biogeochemistry); Jeran Poehls (Max-Planck-Institute for Biogeochemistry); Lazaro Alonzo (Max-Planck-Institute for Biogeochemistry Max-Planck-Institute for Biogeochemistry); Nuno Carvalhais (Max-Planck-Institute for Biogeochemistry); Markus Reichstein (Max Planck Institute for Biogeochemistry, Jena; Michael Stifel Center Jena for Data-Driven and Simulation Science, Jena) |
Computer vision and remote sensing Agriculture and food Ecosystems and biodiversity Extreme weather Time-series analysis |
NeurIPS 2022 |
Generative Modeling of High-resolution Global Precipitation Forecasts
(Papers Track)
Abstract and authors: (click to expand)Abstract: Forecasting global precipitation patterns and, in particular, extreme precipitation events is of critical importance to preparing for and adapting to climate change. Making accurate high-resolution precipitation forecasts using traditional physical models remains a major challenge in operational weather forecasting as they incur substantial computational costs and struggle to achieve sufficient forecast skill. Recently, deep-learning-based models have shown great promise in closing the gap with numerical weather prediction (NWP) models in terms of precipitation forecast skill, opening up exciting new avenues for precipitation modeling. However, it is challenging for these deep learning models to fully resolve the fine-scale structures of precipitation phenomena and adequately characterize the extremes of the long-tailed precipitation distribution. In this work, we present several improvements to the architecture and training process of a current state-of-the art deep learning precipitation model (FourCastNet) using a novel generative adversarial network (GAN) to better capture fine scales and extremes. Our improvements achieve superior performance in capturing the extreme percentiles of global precipitation, while comparable to state-of-the-art NWP models in terms of forecast skill at 1--2 day lead times. Together, these improvements set a new state-of-the-art in global precipitation forecasting. Authors: James Duncan (University of California, Berkeley); Peter Harrington (Lawrence Berkeley National Laboratory (Berkeley Lab)); Shashank Subramanian (Lawrence Berkeley National Laboratory) |
Climate science and climate modeling Extreme weather Computer vision and remote sensing Generative modeling |
NeurIPS 2022 |
Learning Surrogates for Diverse Emission Models
(Papers Track)
Abstract and authors: (click to expand)Abstract: Transportation plays a major role in global CO2 emission levels, a factor that directly connects with climate change. Roadway interventions that reduce CO2 emission levels have thus become a timely requirement. An integral need in assessing the impact of such roadway interventions is access to industry-standard programmatic and instantaneous emission models with various emission conditions such as fuel types, vehicle types, cities of interest, etc. However, currently, there is a lack of well-calibrated emission models with all these properties. Addressing these limitations, this paper presents 1100 programmatic and instantaneous vehicular CO2 emission models with varying fuel types, vehicle types, road grades, vehicle ages, and cities of interest. We hope the presented emission models will facilitate future research in tackling transportation-related climate impact. The released version of the emission models can be found here. Authors: Edgar Ramirez Sanchez (MIT); Catherine H Tang (Massachusetts Institute of Technology); Vindula Jayawardana (MIT); Cathy Wu (MIT) |
Transportation Cities and urban planning Classification, regression, and supervised learning Data mining |
NeurIPS 2022 |
Continual VQA for Disaster Response Systems
(Papers Track)
Abstract and authors: (click to expand)Abstract: Visual Question Answering (VQA) is a multi-modal task that involves answering questions from an input image, semantically understanding the contents of the image and answering it in natural language. Using VQA for disaster management is an important line of research due to the scope of problems that are answered by the VQA system. However, the main challenge is the delay caused by the generation of labels in the assessment of the affected areas. To tackle this, we deployed pre-trained CLIP model, which is trained on visual-image pairs. however, we empirically see that the model has poor zero-shot performance. Thus, we instead use pre-trained embeddings of text and image from this model for our supervised training and surpass previous state-of-the-art results on the FloodNet dataset. We expand this to a continual setting, which is a more real-life scenario. We tackle the problem of catastrophic forgetting using various experience replay methods. Authors: Aditya Kane (Pune Institute of Computer Technology); V Manushree (Manipal Institute Of Technology); Sahil S Khose (Georgia Institute of Technology) |
Disaster management and relief Active learning |
NeurIPS 2022 |
Performance evaluation of deep segmentation models on Landsat-8 imagery
(Papers Track)
Abstract and authors: (click to expand)Abstract: Contrails, short for condensation trails, are line-shaped ice clouds produced by aircraft engine exhaust when they fly through the cold and humid air. They generate a greenhouse effect by absorbing or directing back to Earth approximately 33% of emitted outgoing longwave radiation. They account for over half of the climate change resulting from aviation activities. Avoiding contrails and adjusting flight routes could be an inexpensive and effective way to reduce their impact. An accurate, automated, and reliable detection algorithm is required to develop and evaluate contrail avoidance strategies. Advancement in contrail detection has been severely limited due to several factors, primarily due to a lack of quality-labelled data. Recently, McCloskey et al. proposed a large human-labelled Landsat-8. Each contrail is carefully labelled with various inputs in various scenes of Landsat-8 satellite imagery. In this work, we benchmark several popular segmentation models with combinations of different loss functions and encoder backbones. This work is the first to apply state-of-the-art segmentation techniques to detect contrails in low-orbit satellite imagery. Our work can also be used as an open benchmark for contrail segmentation. Authors: Akshat Bhandari (Manipal Institute of Technology, Manipal); Pratinav Seth (Manipal Institute of Technology); Sriya Rallabandi (Manipal Institute of Technology); Aditya Kasliwal (Manipal Institute of Technology); Sanchit Singhal (Manipal Institute of Technology) |
Earth observations and monitoring Computer vision and remote sensing |
NeurIPS 2022 |
Synthesis of Realistic Load Data: Adversarial Networks for Learning and Generating Residential Load Patterns
(Papers Track)
Abstract and authors: (click to expand)Abstract: Responsible energy consumption plays a key role in reducing carbon footprint and CO2 emissions to tackle climate change. A better understanding of the residential consumption behavior using smart meter data is at the heart of the mission, which can inform residential demand flexibility, appliance scheduling, and home energy management. However, access to high-quality residential load data is still limited due to the cost-intensive data collection process and privacy concerns of data shar- ing. In this paper, we develop a Generative Adversarial Network (GAN)-based method to model the complex and diverse residential load patterns and generate synthetic yet realistic load data. We adopt a generation-focused weight selection method to select model weights to address the mode collapse problem and generate diverse load patterns. We evaluate our method using real-world data and demon- strate that it outperforms three representative state-of-the-art benchmark models in better preserving the sequence level temporal dependencies and aggregated level distributions of load patterns. Authors: Xinyu Liang (Monash University); Hao Wang (Monash University) |
Power and energy systems Generative modeling |
NeurIPS 2022 |
Guided Transformer Network for Detecting Methane Emissions in Sentinel-2 Satellite Imagery
(Proposals Track)
Abstract and authors: (click to expand)Abstract: Methane (CH4) is the chief contributor to global climate change and its mitigation is targeted by the EU, US and jurisdictions worldwide [2]. Recent studies have shown that imagery from the multi-spectral instrument on Sentinel-2 satellites is capable of detecting and estimating large methane emissions. However, most of the current methods rely on temporal relations between a ratio of shortwave-infrared spectra and assume relatively constant ground conditions, and availability of ground information on when there was no methane emission on site. To address such limitations we propose a guided query-based transformer neural network architecture, that will detect and quantify methane emissions without dependence on temporal information. The guided query aspect of our architecture is driven by a Sentinel Enhanced Matched Filter (SEMF) approach, also discussed in this paper. Our network uses all 12 spectral channels of Sentinel-2 imagery to estimate ground terrain and detect methane emissions. No dependence on temporal data makes it more robust to changing ground and terrain conditions and more computationally efficient as it reduces the need to process historical time-series imagery to compute a single date emissions analysis. Authors: Satish Kumar (University of California, Santa Barbara); William Kingwill (Orbio Earth); Rozanne Mouton (Orbio Earth); Wojciech Adamczyk (ETH, Zurich); Robert Huppertz (Orbio Earth); Evan D Sherwin (Stanford University, Energy and Resources Engineering) |
Carbon capture and sequestration Earth observations and monitoring Computer vision and remote sensing |
NeurIPS 2022 |
Identification of medical devices using machine learning on distribution feeder data for informing power outage response
(Proposals Track)
Abstract and authors: (click to expand)Abstract: Power outages caused by extreme weather events due to climate change have doubled in the United States in the last two decades. Outages pose severe health risks to over 4.4 million individuals dependent on in-home medical devices. Data on the number of such individuals residing in a given area is limited. This study proposes a load disaggregation model to predict the number of medical devices behind an electric distribution feeder. This data can be used to inform planning and response to power outages and other emergencies. The proposed solution serves as a measure for climate change adaptation. Authors: Paraskevi Kourtza (University of Edinburgh); Maitreyee Marathe (University of Wisconsin-Madison); Anuj Shetty (Stanford University); Diego Kiedanski (Yale University) |
Power and energy systems Disaster management and relief Health Classification, regression, and supervised learning Time-series analysis |
NeurIPS 2022 |
Analyzing the global energy discourse with machine learning
(Proposals Track)
Abstract and authors: (click to expand)Abstract: To transform our economy towards net-zero emissions, industrial development of clean energy technologies (CETs) to replace fossil energy technologies (FETs) is crucial. Although the media has great power in influencing consumer behavior and decision making in business and politics, its role in the energy transformation is still underexplored. In this paper, we analyze the global energy discourse via machine learning. For this, we collect a large-scale dataset with ~5 million news articles from seven of the world’s major CO2 emitting countries, covering eight CETs and four FETs. Using machine learning, we then analyze the content of news articles on a highly granular level and along several dimensions, namely relevance (for the energy discourse), context (e.g., costs, regulation, investment), and connotations (e.g., high/increasing vs. low/decreasing costs). By linking empirical discourse patterns to investment and deployment data of CETs and FETs, this study advances the current understanding about the role of the media in the energy transformation. Thereby, it enables businesses, investors, and policy makers to respond more effectively to sensitive topics in the media discourse and leverage windows of opportunity for scaling CETs. Authors: Malte Toetzke (ETH Zurich); Benedict Probst (ETH Zurich); Yasin Tatar (ETH Zurich); Stefan Feuerriegel (LMU Munich); Volker Hoffmann (ETH Zurich) |
Public policy Behavioral and social science Natural language processing |
NeurIPS 2022 |
Deep-S2SWind: A data-driven approach for improving Sub-seasonal wind predictions
(Proposals Track)
Abstract and authors: (click to expand)Abstract: A major transformation to mitigate climate change implies a rapid decarbonisation of the energy system and thus, increasing the use of renewable energy sources, such as wind power. However, renewable resources are strongly dependent on local and large-scale weather conditions, which might be influenced by climate change. Weather-related risk assessments are essential for the energy sector, in particular, for power system management decisions, for which forecasts of climatic conditions from several weeks to months (i.e. sub-seasonal scales) are of key importance. Here, we propose a data-driven approach to predict wind speed at longer lead-times that can benefit the energy sector. The main goal of this study is to assess the potential of machine learning algorithms to predict periods of low wind speed conditions that have a strong impact on the energy sector. Authors: Noelia Otero Felipe (University of Bern); Pascal Horton (University of Bern) |
Climate science and climate modeling Earth observations and monitoring Interpretable ML Time-series analysis |
NeurIPS 2022 |
Towards Low Cost Automated Monitoring of Life Below Water to De-risk Ocean-Based Carbon Dioxide Removal and Clean Power
(Proposals Track)
Abstract and authors: (click to expand)Abstract: Oceans will play a crucial role in our efforts to combat the growing climate emergency. Researchers have proposed several strategies to harness greener energy through oceans and use oceans as carbon sinks. However, the risks these strategies might pose to the ocean and marine ecosystem are not well understood. It is imperative that we quickly develop a range of tools to monitor ocean processes and marine ecosystems alongside the technology to deploy these solutions on a large scale into the oceans. Large arrays of inexpensive cameras placed deep underwater coupled with machine learning pipelines to automatically detect, classify, count and estimate fish populations have the potential to continuously monitor marine ecosystems and help study the impacts of these solutions on the ocean. In this proposal, we discuss the challenges presented by a dark artificially lit underwater video dataset captured 500m below the surface, propose potential solutions to address these challenges, and present preliminary results from detecting and classifying 6 species of fish in deep underwater camera data. Authors: Kameswari Devi Ayyagari (Dalhousie University); Christopher Whidden (Dalhousie University); Corey Morris (Department of Fisheries and Oceans); Joshua Barnes (National Research Council Canada) |
Oceans and marine systems Climate science and climate modeling Classification, regression, and supervised learning Computer vision and remote sensing |
NeurIPS 2022 |
Towards the Automatic Analysis of Ceilometer Backscattering Profiles using Unsupervised Learning
(Proposals Track)
Abstract and authors: (click to expand)Abstract: Ceilometers use a laser beam to capture certain phenomena in the atmosphere like clouds, precipitation, or aerosol layers. These measurements can be visualized in so-called quick looks that at the moment are mostly analyzed manually by meteorology experts. In this work, we illustrate the path towards the automatic analysis of quick looks by using a hybrid approach combining an image segmentation algorithm with unsupervised representation learning and clustering. We present a first proof of concept and give an outlook on possible future work. Authors: Michael Dammann (HAW Hamburg); Ina Mattis (Deutscher Wetterdienst); Michael Neitzke (HAW Hamburg); Ralf Möller (University of Lübeck) |
Climate science and climate modeling Extreme weather Computer vision and remote sensing Unsupervised and semi-supervised learning |
NeurIPS 2022 |
Modelling the performance of delivery vehicles across urban micro-regions to accelerate the transition to cargo-bike logistics
(Proposals Track)
Abstract and authors: (click to expand)Abstract: Light goods vehicles (LGV) used extensively in the last mile of delivery are one of the leading polluters in cities. Cargo-bike logistics has been put forward as a high impact candidate for replacing LGVs, with experts estimating over half of urban van deliveries being replaceable by cargo bikes, due to their faster speeds, shorter parking times and more efficient routes across cities. By modelling the relative delivery performance of different vehicle types across urban micro-regions, machine learning can help operators evaluate the business and environmental impact of adding cargo-bikes to their fleets. In this paper, we introduce two datasets, and present initial progress in modelling urban delivery service time (e.g. cruising for parking, unloading, walking). Using Uber’s H3 index to divide the cities into hexagonal cells, and aggregating OpenStreetMap tags for each cell, we show that urban context is a critical predictor of delivery performance. Authors: Max C Schrader (University of Alabama); Navish Kumar (IIT Kharagpur); Nicolas Collignon (University of Edinburgh); Maria S Astefanoaei (IT University of Copenhagen); Esben Sørig (Kale Collective); Soonmyeong Yoon (Kale Collective); Kai Xu (University of Edinburgh); Akash Srivastava (MIT-IBM) |
Cities and urban planning Transportation Classification, regression, and supervised learning Data mining Unsupervised and semi-supervised learning |
NeurIPS 2022 |
An Inversion Algorithm of Ice Thickness and InSAR Data for the State of Friction at the Base of the Greenland Ice Sheet
(Proposals Track)
Abstract and authors: (click to expand)Abstract: With the advent of climate change and global warming, the Greenland Ice Sheet (GrIS) has been melting at an alarming rate, losing over 215 Gt per yr, and accounting for 10% of mean global sea level rise since the 1990s. It is imperative to understand what dynamics are causing ice loss and influencing ice flow in order to successfully project mass changes of ice sheets and associated sea level rise. This work applies machine learning, ice thickness data, and horizontal ice velocity measurements from satellite radar data to quantify the magnitudes and distributions of the basal traction forces that are holding the GrIS back from flowing into the ocean. Our approach uses a hybrid model: InSAR velocity data trains a linear regression model, and these model coefficients are fed into a geophysical algorithm to estimate basal tractions that capture relationships between the ice motion and physical variables. Results indicate promising model performance and reveal significant presence of large basal traction forces around the coastline of the GrIS. Authors: Aryan Jain (Amador Valley High School); Jeonghyeop Kim (Stony Brook University); William Holt (Stony Brook University) |
Earth science Earth observations and monitoring Classification, regression, and supervised learning Hybrid physical models |
NeurIPS 2022 |
Deep learning-based bias adjustment of decadal climate predictions
(Proposals Track)
Abstract and authors: (click to expand)Abstract: Decadal climate predictions are key to inform adaptation strategies in a warming climate. Coupled climate models used for decadal predictions are, however, imperfect representations of the climate system leading to forecast biases. Biases can also result from a poor model initialization that, when combined with forecast drift, can produce errors depending non-linearly on lead time. We propose a deep learning-based bias correction approach for the post-processing of gridded forecasts to enhance the accuracy of decadal predictions. Authors: Reinel Sospedra-Alfonso (Environment and Climate Change Canada); Johannes Exenberger (Graz University of Technology); Marie C McGraw (Cooperative Institute for Research in the Atmosphere | CIRA); Trung Kien Dang (National University of Singapore) |
Climate science and climate modeling Classification, regression, and supervised learning |
NeurIPS 2022 |
Surrogate Modeling for Methane Dispersion Simulations Using Fourier Neural Operator
(Proposals Track)
Abstract and authors: (click to expand)Abstract: Methane leak detection and remediation are critical for tackling climate change, where methane dispersion simulations play an important role in emission source attribution. As 3D modeling of methane dispersion is often costly and time-consuming, we train a deep-learning-based surrogate model using the Fourier Neural Operator to learn the PDE solver in our study. Our preliminary result shows that our surrogate modeling provides a fast, accurate and cost-effective solution to methane dispersion simulations, thus reducing the cycle time of methane leak detection. Authors: Qie Zhang (Microsoft); Mirco Milletari (Microsoft); Yagna Deepika Oruganti (Microsoft); Philipp A Witte (Microsoft) |
Climate science and climate modeling Earth observations and monitoring Classification, regression, and supervised learning |
NeurIPS 2022 |
Detecting Floods from Cloudy Scenes: A Fusion Approach Using Sentinel-1 and Sentinel-2 Imagery
(Proposals Track)
Abstract and authors: (click to expand)Abstract: As the result of climate change, extreme flood events are becoming more frequent. To better respond to such disasters, and to test and calibrate flood models, we need accurate real-world data on flooding extent. Detection of floods from remote sensed imagery suffers from a widespread problem: clouds block flood scenes in images, leading to degraded and fragmented flood datasets. To address this challenge, we propose a workflow based on U-Net, and a dataset that detects flood in cloud-prone areas by fusing information from the Sentinel-1 and Sentinel-2 satellites. The expected result will be a reliable and detailed catalogue of flood extents and how they change through time, allowing us to better understand flooding in different morphological settings and climates. Authors: Qiuyang Chen (University of Edinburgh); Xenofon Karagiannis (Earth-i Ltd.); Simon M. Mudd (University of Edinburgh) |
Computer vision and remote sensing Disaster management and relief Extreme weather Classification, regression, and supervised learning Other Time-series analysis |
NeurIPS 2022 |
Urban Heat Island Detection and Causal Inference Using Convolutional Neural Networks
(Proposals Track)
Abstract and authors: (click to expand)Abstract: Compared to rural areas, urban areas experience higher temperatures for longer periods of time because of the urban heat island (UHI) effect. This increased heat stress leads to greater mortality, increased energy demand, regional changes to precipitation patterns, and increased air pollution. Urban developers can minimize the UHI effect by incorporating features that promote air flow and heat dispersion (e.g., increasing green space). However, understanding which urban features to implement is complex, as local meteorology strongly dictates how the environment responds to changes in urban form. In this proposal we describe a methodology for estimating the causal relationship between changes in urban form and changes in the UHI effect. Changes in urban form and temperature changes are measured using convolutional neural networks, and a causal inference matching approach is proposed to estimate causal relationships. The success of this methodology will enable urban developers to implement city-specific interventions to mitigate the warming planet's impact on cities. Authors: Zachary D Calhoun (Duke University); Ziyang Jiang (Duke University); Mike Bergin (Duke University); David Carlson (Duke University) |
Cities and urban planning Buildings Climate justice Climate science and climate modeling Disaster management and relief Extreme weather Health Causal and Bayesian methods Classification, regression, and supervised learning Computer vision and remote sensing Time-series analysis |
NeurIPS 2022 |
Forecasting Global Drought Severity and Duration Using Deep Learning
(Proposals Track)
Abstract and authors: (click to expand)Abstract: Drought detection and prediction is challenging due to the slow onset of the event and varying degrees of dependence on numerous physical and socio-economic factors that differentiate droughts from other natural disasters. In this work, we propose DeepXD (Deep learning for Droughts), a deep learning model with 26 physics-informed input features for SPI (Standardised Precipitation Index) forecasting to identify and classify droughts using monthly oceanic indices, global meteorological and vegetation data, location (latitude, longitude) and land cover for the years 1982 to 2018. In our work, we propose extracting features by considering the atmosphere and land moisture and energy budgets and forecasting global droughts on a seasonal and an annual scale at 1, 3, 6, 9, 12 and 24 months lead times. SPI helps us to identify the severity and the duration of the drought to classify them as meteorological, agricultural and hydrological. Authors: Akanksha Ahuja (NOA); Xin Rong Chua (Centre for Climate Research Singapore) |
Classification, regression, and supervised learning Agriculture and food Climate science and climate modeling Disaster management and relief Earth observations and monitoring Earth science Extreme weather Societal adaptation and resilience Data mining |
NeurIPS 2022 |
ForestBench: Equitable Benchmarks for Monitoring, Reporting, and Verification of Nature-Based Solutions with Machine Learning
(Proposals Track)
Abstract and authors: (click to expand)Abstract: Restoring ecosystems and reducing deforestation are necessary tools to mitigate the anthropogenic climate crisis. Current measurements of forest carbon stock can be inaccurate, in particular for underrepresented and small-scale forests in the Global South, hindering transparency and accountability in the Monitoring, Reporting, and Verification (MRV) of these ecosystems. There is thus need for high quality datasets to properly validate ML-based solutions. To this end, we present ForestBench, which aims to collect and curate geographically-balanced gold-standard datasets of small-scale forest plots in the Global South, by collecting ground-level measurements and visual drone imagery of individual trees. These equitable validation datasets for ML-based MRV of nature-based solutions shall enable assessing the progress of ML models for estimating above-ground biomass, ground cover, and tree species diversity. Authors: Lucas Czech (Carnegie Institution for Science); Björn Lütjens (MIT); David Dao (ETH Zurich) |
Earth observations and monitoring Carbon capture and sequestration Climate finance and economics Climate justice Ecosystems and biodiversity Forestry and other land use Local and indigenous knowledge systems Public policy Classification, regression, and supervised learning Computer vision and remote sensing Other |
NeurIPS 2022 |
Estimating Heating Loads in Alaska using Remote Sensing and Machine Learning Methods
(Proposals Track)
Abstract and authors: (click to expand)Abstract: Alaska and the larger Arctic region are in much greater need of decarbonization than the rest of the globe as a result of the accelerated consequences of climate change over the past ten years. Heating for homes and businesses accounts for over 75% of the energy used in the Arctic region. However, the lack of thorough and precise heating load estimations in these regions poses a significant obstacle to the transition to renewable energy. In order to accurately measure the massive heating demands in Alaska, this research pioneers a geospatial-first methodology that integrates remote sensing and machine learning techniques. Building characteristics such as height, size, year of construction, thawing degree days, and freezing degree days are extracted using open-source geospatial information in Google Earth Engine (GEE). These variables coupled with heating load forecasts from the AK Warm simulation program are used to train models that forecast heating loads on Alaska’s Railbelt utility grid. Our research greatly advances geospatial capability in this area and considerably informs the decarbonization activities currently in progress in Alaska. Authors: Madelyn Gaumer (University of Washington); Nick Bolten (Paul G. Allen School of Computer Science and Engineering, University of Washington); Vidisha Chowdhury (Heinz College of Information Systems and Public Policy, Carnegie Mellon University); Philippe Schicker (Heinz College of Information Systems and Public Policy, Carnegie Mellon University); Shamsi Soltani (Department of Epidemiology and Population Health, Stanford University School of Medicine); Erin D Trochim (University of Alaska Fairbanks) |
Classification, regression, and supervised learning Buildings Cities and urban planning Earth observations and monitoring Computer vision and remote sensing |
NeurIPS 2022 |
Interpretable Spatiotemporal Forecasting of Arctic Sea Ice Concentration at Seasonal Lead Times
(Proposals Track)
Abstract and authors: (click to expand)Abstract: There are many benefits from the accurate forecasting of Arctic sea ice, however existing models struggle to reliably predict sea ice concentration at long lead times. Many numerical models exist but can be sensitive to initial conditions, and while recent deep learning-based methods improve overall robustness, they either do not utilize temporal trends or rely on architectures that are not performant at learning long-term sequential dependencies. We propose a method of forecasting sea ice concentration using neural circuit policies, a form of continuous time recurrent neural architecture, which improve the learning of long-term sequential dependencies compared to existing techniques and offer the added benefits of adaptability to irregular sequence intervals and high interpretability. Authors: Matthew Beveridge (Independent Researcher); Lucas Pereira (ITI, LARSyS, Técnico Lisboa) |
Computer vision and remote sensing Climate science and climate modeling Earth observations and monitoring Earth science Oceans and marine systems Classification, regression, and supervised learning Interpretable ML Time-series analysis Uncertainty quantification and robustness |
NeurIPS 2022 |
CliMedBERT: A Pre-trained Language Model for Climate and Health-related Text
(Proposals Track)
Abstract and authors: (click to expand)Abstract: Climate change is threatening human health in unprecedented orders and many ways. These threats are expected to grow unless effective and evidence-based policies are developed and acted upon to minimize or eliminate them. Attaining such a task requires the highest degree of the flow of knowledge from science into policy. The multidisciplinary, location-specific, and vastness of published science makes it challenging to keep track of novel work in this area, as well as making the traditional knowledge synthesis methods inefficient in infusing science into policy. To this end, we consider developing multiple domain-specific language models (LMs) with different variations from Climate- and Health-related information, which can serve as a foundational step toward capturing available knowledge to enable solving different tasks, such as detecting similarities between climate- and health-related concepts, fact-checking, relation extraction, evidence of health effects to policy text generation, and more. To our knowledge, this is the first work that proposes developing multiple domain-specific language models for the considered domains. We will make the developed models, resources, and codebase available for the researchers. Authors: Babak Jalalzadeh Fard (University of Nebraska Medical Center); Sadid A. Hasan (Microsoft); Jesse E. Bell (University of Nebraska Medical Center) |
Health Natural language processing |
NeurIPS 2022 |
Improving accuracy and convergence of federated learning edge computing methods for generalized DER forecasting applications in power grid
(Proposals Track)
Abstract and authors: (click to expand)Abstract: This proposal aims to develop more accurate federated learning (FL) methods with faster convergence properties and lower communication requirements, specifically for forecasting distributed energy resources (DER) such as renewables, energy storage, and loads in modern, low-carbon power grids. This will be achieved by (i) leveraging recently developed extensions of FL such as hierarchical and iterative clustering to improve performance with non-IID data, (ii) experimenting with different types of FL global models well-suited to time-series data, and (iii) incorporating domain-specific knowledge from power systems to build more general FL frameworks and architectures that can be applied to diverse types of DERs beyond just load forecasting, and with heterogeneous clients. Authors: Vineet Jagadeesan Nair (MIT) |
Power and energy systems Buildings Time-series analysis |
NeurIPS 2022 |
Personalizing Sustainable Agriculture with Causal Machine Learning
(Proposals Track)
Best Paper: Proposals
Abstract and authors: (click to expand)Abstract: To fight climate change and accommodate the increasing population, global crop production has to be strengthened. To achieve the "sustainable intensification" of agriculture, transforming it from carbon emitter to carbon sink is a priority, and understanding the environmental impact of agricultural management practices is a fundamental prerequisite to that. At the same time, the global agricultural landscape is deeply heterogeneous, with differences in climate, soil, and land use inducing variations in how agricultural systems respond to farmer actions. The "personalization" of sustainable agriculture with the provision of locally adapted management advice is thus a necessary condition for the efficient uplift of green metrics, and an integral development in imminent policies. Here, we formulate personalized sustainable agriculture as a Conditional Average Treatment Effect estimation task and use Causal Machine Learning for tackling it. Leveraging climate data, land use information and employing Double Machine Learning, we estimate the heterogeneous effect of sustainable practices on the field-level Soil Organic Carbon content in Lithuania. We thus provide a data-driven perspective for targeting sustainable practices and effectively expanding the global carbon sink. Authors: Georgios Giannarakis (National Observatory of Athens); Vasileios Sitokonstantinou (National Observatory of Athens); Roxanne Suzette Lorilla (National Observatory of Athens); Charalampos Kontoes (National Observatory of Athens) |
Causal and Bayesian methods Agriculture and food Carbon capture and sequestration Earth observations and monitoring Ecosystems and biodiversity Public policy Societal adaptation and resilience Classification, regression, and supervised learning Data mining |
NeurIPS 2022 |
Disaster Risk Monitoring Using Satellite Imagery
(Tutorials Track)
Abstract and authors: (click to expand)Abstract: Natural disasters such as flood, wildfire, drought, and severe storms wreak havoc throughout the world, causing billions of dollars in damages, and uprooting communities, ecosystems, and economies. Unfortunately, flooding events are on the rise due to climate change and sea level rise. The ability to detect and quantify them can help us minimize their adverse impacts on the economy and human lives. Using satellites to study flood is advantageous since physical access to flooded areas is limited and deploying instruments in potential flood zones can be dangerous. We are proposing a hands-on tutorial to highlight the use of satellite imagery and computer vision to study natural disasters. Specifically, we aim to demonstrate the development and deployment of a flood detection model using Sentinel-1 satellite data. The tutorial will cover relevant fundamental concepts as well as the full development workflow of a deep learning-based application. We will include important considerations such as common pitfalls, data scarcity, augmentation, transfer learning, fine-tuning, and details of each step in the workflow. Importantly, the tutorial will also include a case study on how the application was used by authorities in response to a flood event. We believe this tutorial will enable machine learning practitioners of all levels to develop new technologies that tackle the risks posed by climate change. We expect to deliver the below learning outcomes: • Develop various deep learning-based computer vision solutions using hardware-accelerated open-source tools that are optimized for real-time deployment • Create an optimized pipeline for the machine learning development workflow • Understand different performance metrics for model evaluation that are relevant for real world datasets and data imbalances • Understand the public sector’s efforts to support climate action initiatives and point out where the audience can contribute Authors: Kevin Lee (NVIDIA); Siddha Ganju (NVIDIA); Edoardo Nemni (UNOSAT) |
Disaster management and relief Climate science and climate modeling Earth observations and monitoring Extreme weather Active learning Classification, regression, and supervised learning Computer vision and remote sensing Data mining Meta- and transfer learning |
NeurIPS 2022 |
Machine Learning for Predicting Climate Extremes
(Tutorials Track)
Abstract and authors: (click to expand)Abstract: Climate change has led to a rapid increase in the occurrence of extreme weather events globally, including floods, droughts, and wildfires. In the longer term, some regions will experience aridification while others will risk sinking due to rising sea levels. Typically, such predictions are done via weather and climate models that simulate the physical interactions between the atmospheric, oceanic, and land surface processes that operate at different scales. Due to the inherent complexity, these climate models can be inaccurate or computationally expensive to run, especially for detecting climate extremes at high spatiotemporal resolutions. In this tutorial, we aim to introduce the participants to machine learning approaches for addressing two fundamental challenges. We will walk the participants through a hands-on tutorial for predicting climate extremes relating to temperature and precipitation in 2 setups: (1) temporal forecasting: the goal is to predict climate variables into the future (both direct single step approaches and iterative approaches that roll out the model for several timesteps), and (2) spatial downscaling: the goal is to learn a mapping that transforms low-resolution outputs of climate models into high-resolution regional forecasts. Through introductory presentations and colab notebooks, we aim to expose the participants to (a) APIs for accessing and navigating popular repositories that host global climate data, such as the Copernicus data store, (b) identifying relevant datasets, including auxiliary data (e.g., other climate variables such as geopotential), (c) scripts for downloading and preprocessing relevant datasets, (d) algorithms for training machine learning models, (d) metrics for evaluating model performance, and (e) visualization tools for both the dataset and predicted outputs. The coding notebooks will be in Python. No prior knowledge of climate science is required. Authors: Hritik Bansal (UCLA); Shashank Goel (University of California Los Angeles); Tung Nguyen (University of California, Los Angeles); Aditya Grover (UCLA) |
Climate science and climate modeling Earth observations and monitoring Earth science Extreme weather Classification, regression, and supervised learning Data mining Hybrid physical models |
NeurIPS 2022 |
FourCastNet: A practical introduction to a state-of-the-art deep learning global weather emulator
(Tutorials Track)
Abstract and authors: (click to expand)Abstract: Accurate, reliable, and efficient means of forecasting global weather patterns are of paramount importance to our ability to mitigate and adapt to climate change. Currently, real-time weather forecasting requires repeated numerical simulation and data assimilation cycles on dedicated supercomputers, which restricts the ability to make reliable, high-resolution forecasts to a handful of organizations. However, recent advances in deep learning, specifically the FourCastNet model, have shown that data-driven approaches can forecast important atmospheric variables with excellent skill and comparable accuracy to standard numerical methods, but at orders-of-magnitude lower computational and energy cost during inference, enabling larger ensembles for better probabilistic forecasts. In this tutorial, we demonstrate various applications of FourCastNet for high-resolution global weather forecasting, with examples including real-time forecasts, uncertainty quantification for extreme events, and adaptation to specific variables or localized regions of interest. The tutorial will provide examples that will demonstrate the general workflow for formatting and working with global atmospheric data, running autoregressive inference to obtain daily global forecasts, saving/visualizing model predictions of atmospheric events such as hurricanes and atmospheric rivers, and computing quantitative evaluation metrics for weather models. The exercises will primarily use PyTorch and do not require detailed understanding of the climate and weather system. With this tutorial, we hope to equip attendees with basic knowledge about building deep learning-based weather model surrogates and obtaining forecasts of crucial atmospheric variables using these models. Authors: Jaideep Pathak (NVIDIA Corporation); Shashank Subramanian (Lawrence Berkeley National Laboratory); Peter Harrington (Lawrence Berkeley National Laboratory (Berkeley Lab)); Thorsten Kurth (Nvidia); Andre Graubner (Nvidia); Morteza Mardani (NVIDIA Corporation); David M. Hall (NVIDIA); Karthik Kashinath (Lawrence Berkeley National Laboratory); Anima Anandkumar (NVIDIA/Caltech) |
Earth science Climate science and climate modeling Extreme weather Classification, regression, and supervised learning Uncertainty quantification and robustness |
NeurIPS 2022 |
Automating the creation of LULC datasets for semantic segmentation
(Tutorials Track)
Abstract and authors: (click to expand)Abstract: High resolution and accurate Land Use and Land Cover mapping (LULC) datasets are increasingly important and can be widely used in monitoring climate change impacts in agriculture, deforestation, and the carbon cycle. These datasets represent physical classifications of land types and spatial information over the surface of the Earth. These LULC datasets can be leveraged in a plethora of research topics and industries to mitigate and adapt to environmental changes. High resolution urban mappings can be used to better monitor and estimate building albedo and urban heat island impacts, and accurate representation of forests and vegetation can even be leveraged to better monitor the carbon cycle and climate change through improved land surface modelling. The advent of machine learning (ML) based CV techniques over the past decade provides a viable option to automate LULC mapping. One impediment to this has been the lack of large ML datasets. Large vector datasets for LULC are available, but can’t be used directly by ML practitioners due to a knowledge gap in transforming the input into a dataset of paired satellite images and segmentation masks. We demonstrate a novel end-to-end pipeline for LULC dataset creation that takes vector land cover data and provides a training-ready dataset. We will use Sentinel-2 satellite imagery and the European Urban Atlas LULC data. The pipeline manages everything from downloading satellite data, to creating and storing encoded segmentation masks and automating data checks. We then use the resulting dataset to train a semantic segmentation model. The aim of the pipeline is to provide a way for users to create their own custom datasets using various combinations of multispectral satellite and vector data. In addition to presenting the pipeline, we aim to provide an introduction to multispectral imagery, geospatial data and some of the challenges in using it for ML. Authors: Sambhav S Rohatgi (Spacesense.ai); Anthony Mucia (Spacesense.ai) |
Computer vision and remote sensing Cities and urban planning Climate science and climate modeling Forestry and other land use Classification, regression, and supervised learning Other |
AAAI FSS 2022 |
AI-Based Text Analysis for Evaluating Food Waste Policies
Abstract and authors: (click to expand)Abstract: Food waste is a major contributor to climate change, making the reduction of food waste one of the most important strategies to preserve threatened ecosystems and increase economic benefits. To evaluate the impact of food waste policies in this arena and provide actionable guidance to policymakers, we conducted an AI-based text analysis of food waste policy provisions. Specifically, we a) identified commonalities across state policy texts, b) clustered states by shared policy text, and c) examined relationships between state cluster memberships and food waste . This approach generated state clusters but demonstrated very limited convergent validity with policy ratings provided by subject matter experts and no predictive validity with food waste. We discuss the potential of using supervised machine learning to analyze food waste policy text as a next step. Authors: John Aitken (The MITRE Corporation), Denali Rao (The MITRE Corporation), Balca Alaybek (The MITRE Corporation), Amber Sprenger (The MITRE Corporation), Grace Mika (The MITRE Corporation), Rob Hartman (The MITRE Corporation) and Laura Leets (The MITRE Corporation) |
|
AAAI FSS 2022 |
Data-Driven Reduced-Order Model for Atmospheric CO2 Dispersion
Abstract and authors: (click to expand)Abstract: Machine learning frameworks have emerged as powerful tools for the enhancement of computational fluid dynamics simulations and the construction of reduced-order models (ROMs). The latter are particularly desired when their full-order counterparts portray multiple spatiotemporal features and demand high processing power and storage capacity, such as climate models. In this work, a ROM for CO2 dispersion across Earth‘s atmosphere was built from NASA’s gridded daily OCO-2 carbon dioxide assimilated dataset. For that, a proper orthogonal decomposition was performed, followed by a non-intrusive operator inference (OpInf). This scientific machine learning technique was capable of accurately representing and predicting the detailed CO2 concentration field for about one year ahead, with a normalized root-mean-square error below 5%. It suggests OpInf-based ROMs may be a reliable alternative for fast response climate-related predictions. Authors: Pedro Roberto Barbosa Rocha (IBM Research), Marcos Sebastião de Paula Gomes (Pontifical Catholic University of Rio de Janeiro), João Lucas de Sousa Almeida (IBM Research), Allan Moreira Carvalho (IBM Research) and Alberto Costa Nogueira Junior (IBM Research) |
|
AAAI FSS 2022 |
KnowUREnvironment: An Automated Knowledge Graph for Climate Change and Environmental Issues
Abstract and authors: (click to expand)Abstract: Despite climate change being one of the greatest threats to humanity, many people are still in denial or lack motivation for appropriate action. A structured source of knowledge can help increase public awareness while also helping crucial natural language understanding tasks such as information retrieval, question answering, and recommendation systems. We introduce KnowUREnvironment – a knowledge graph for climate change and related environmental issues, extracted from the scientific literature. We automatically identify 210,230 domain-specific entities/concepts and encode how these concepts are interrelated with 411,860 RDF triples backed up with evidence from the literature, without using any supervision or human intervention. Human evaluation shows our extracted triples are syntactically and factually correct (81.69% syntactic correctness and 75.85% precision). The proposed framework can be easily extended to any domain that can benefit from such a knowledge graph. Authors: Md Saiful Islam (University of Rochester), Adiba Proma (University of Rochester), Yilin Zhou (University of Rochester), Syeda Nahida Akter (Carnegie Mellon University), Caleb Wohn (University of Rochester) and Ehsan Hoque (University of Rochester) |
|
AAAI FSS 2022 |
Towards Generating Large Synthetic Phytoplankton Datasets for Efficient Monitoring of Harmful Algal Blooms
Abstract and authors: (click to expand)Abstract: Climate change is increasing the frequency and severity of harmful algal blooms (HABs), which cause significant fish deaths in aquaculture farms. This contributes to ocean pollution and greenhouse gas (GHG) emissions since dead fish are either dumped into the ocean or taken to landfills, which in turn negatively impacts the climate. Currently, the standard method to enumerate harmful algae and other phytoplankton is to manually observe and count them under a microscope. This is a time-consuming, tedious and error-prone process, resulting in compromised management decisions by farmers. Hence, automating this process for quick and accurate HAB monitoring is extremely helpful. However, this requires large and diverse datasets of phytoplankton images, and such datasets are hard to produce quickly. In this work, we explore the feasibility of generating novel high-resolution photorealistic synthetic phytoplankton images, containing multiple species in the same image, given a small dataset of real images. To this end, we employ Generative Adversarial Networks (GANs) to generate synthetic images. We evaluate three different GAN architectures: ProjectedGAN, FastGAN, and StyleGANv2 using standard image quality metrics. We empirically show the generation of high-fidelity synthetic phytoplankton images using a training dataset of only 961 real images. Thus, this work demonstrates the ability of GANs to create large synthetic datasets of phytoplankton from small training datasets, accomplishing a key step towards sustainable systematic monitoring of harmful algal blooms. Authors: Nitpreet Bamra (University of Waterloo), Vikram Voleti (Mila, University of Montreal), Alexander Wong (University of Waterloo) and Jason Deglint (University of Waterloo) |
|
AAAI FSS 2022 |
Generating physically-consistent high-resolution climate data with hard-constrained neural networks
Abstract and authors: (click to expand)Abstract: The availability of reliable, high-resolution climate and weather data is important to inform long-term decisions on climate adaptation and mitigation and to guide rapid responses to extreme events. Forecasting models are limited by computational costs and therefore often predict quantities at a coarse spatial resolution. Statistical downscaling can provide an efficient method of upsampling low-resolution data. In this field, deep learning has been applied successfully, often using methods from the super-resolution domain in computer vision. Despite often achieving visually compelling results, such models often violate conservation laws when predicting physical variables. In order to conserve important physical quantities, we develop methods that guarantee physical constraints are satisfied by a deep downscaling model while also increasing their performance according to traditional metrics. We introduce two ways of constraining the network: a renormalization layer added to the end of the neural network and a successive approach that scales with increasing upsampling factors. We show the applicability of our methods across different popular architectures and upsampling factors using ERA5 reanalysis data. Authors: Paula Harder (Fraunhofer Institute ITWM, Mila Quebec AI Institute), Qidong Yang (Mila Quebec AI Institute, New York University), Venkatesh Ramesh (Mila Quebec AI Institute, University of Montreal), Alex Hernandez-Garcia (Mila Quebec AI Institute, University of Montreal), Prasanna Sattigeri (IBM Research), Campbell D. Watson (IBM Research), Daniela Szwarcman (IBM Research) and David Rolnick (Mila Quebec AI Institute, McGill University). |
|
AAAI FSS 2022 |
Discovering Transition Pathways Towards Coviability with Machine Learning
Abstract and authors: (click to expand)Abstract: This paper presents our ongoing French-Brazilian collaborative project which aims at: (1) establishing a diagnosis of socio-ecological coviability for several sites of interest in Nordeste, the North-East region of Brazil (in the states of Paraiba, Ceara, Pernambuco, and Rio Grande do Norte known for their biodiversity hotspots and vulnerabilities to climate change) using advanced data science techniques for multisource and multimodal data fusion and (2) finding transition pathways towards coviability equilibrium using machine learning techniques. Data collected in the field by scientists, ecologists, local actors combined with volunteered information, pictures from smart-phones, and data available on-line from satellite imagery, social media, surveys, etc. can be used to compute various coviability indicators of interest for the local actors. These indicators are useful to characterize and monitor the socio-ecological coviability status along various dimensions of anthropization, human welfare, ecological and biodiversity balance, and ecosystem intactness and vulnerabilities. Authors: Laure Berti-Equille (IRD) and Rafael Raimundo (UFPB) |
|
AAAI FSS 2022 |
Wildfire Forecasting with Satellite Images and Deep Generative Model
Abstract and authors: (click to expand)Abstract: Wildfire prediction has been one of the most critical tasks that humanities want to thrive at. While it plays a vital role in protecting human life, it is also difficult because of its stochastic and chaotic properties. We tackled the problem by interpreting a series of wildfire images into a video and used it to anticipate how the fire would behave in the future. However, creating video prediction models that account for the inherent uncertainty of the future is challenging. The bulk of published attempts are based on stochastic image-autoregressive recurrent networks, which raise various performance and application difficulties such as computational cost and limited efficiency on massive datasets. Another possibility is to use entirely latent temporal models that combine frame synthesis with temporal dynamics. However, due to design and training issues, no such model for stochastic video prediction has yet been proposed in the literature. This paper addresses these issues by introducing a novel stochastic temporal model whose dynamics are driven in a latent space. It naturally predicts video dynamics by allowing our lighter, more interpretable latent model to beat previous state-of-the-art approaches on the GOES-16 dataset. Results are compared using various benchmarking models. Authors: Thai-Nam Hoang (University of Wisconsin - Madison), Sang Truong (Stanford University) and Chris Schmidt (University of Wisconsin - Madison) |
|
AAAI FSS 2022 |
From Ideas to Deployment - A Joint Industry-University Research Effort on Tackling Carbon Storage Challenges with AI
Abstract and authors: (click to expand)Abstract: Carbon capture and storage (CCS) offers a promising means for significant reductions in greenhouse gas emissions and climate change mitigation at a large scale. Modeling CO2 transport and pressure buildup is central to understanding the responses of geosystems after CO2 injection and assessing the suitability and safety of CO2 storage. However, numerical simulations of geological CO2 storage often suffer from its multi-physics nature and complex non-linear governing equations, and is further complicated by flexible injection designs including changes in injection rates, resulting in formidable computational costs. New ideas have emerged such as data-driven models to tackle such challenges but very few have been fully developed and deployed as reliable tools. With the joint efforts of industry and universities, we are currently working on a new mechanism of fostering cross-disciplinary collaboration, developing, deploying, and scaling data-driven tools for CCS. A deep learning suite that can act as an alternative to CCS variable rate injection simulation will be the first tool developed under this mechanism. Based on the surrogate model, optimal design of injection strategy under pressure buildup constraints will be enabled with machine learning. Authors: Junjie Xu (Tsinghua University), Jiesi Lei (Tsinghua University), Yang Li (Tsinghua University), Junfan Ren (College of Petroleum Engineering, China University of Petroleum (Beijing), Beijing, China), Jian Qiu (Product and Solution & Website Business Unit, Alibaba Cloud, Hangzhou, Zhejiang, China), Biao Luo (Product and Solution & Website Business Unit, Alibaba Cloud, Hangzhou, Zhejiang, China), Lei Xiao (Product and Solution & Website Business Unit, Alibaba Cloud, Hangzhou, Zhejiang, China) and Wenwen Zhou (Product and Solution & Website Business Unit, Alibaba Cloud, Hangzhou, Zhejiang, China) |
|
AAAI FSS 2022 |
NADBenchmarks - a compilation of Benchmark Datasets for Machine Learning Tasks related to Natural Disasters
Abstract and authors: (click to expand)Abstract: Climate change has increased the intensity, frequency, and duration of extreme weather events and natural disasters across the world. While the increased data on natural disasters improves the scope of machine learning(ML) for this field, progress is relatively slow. One bottleneck is the lack of benchmark datasets that would allow ML researchers to quantify their progress against a standard metric. The objective of this short paper is to explore the state of benchmark datasets for ML tasks related to natural disasters, categorizing the datasets according to the disaster management cycle. We compile a list of existing benchmark datasets that have been introduced in the past five years. We propose a web platform where researchers can search for benchmark datasets in this domain, and develop a preliminary version of such a platform using our compiled list. This paper is intended to aid researchers in finding benchmark datasets to train their ML models on, and provide general directions in for topics where they can contribute new benchmark datasets. Authors: Adiba Proma (University of Rochester), Md Saiful Islam (University of Rochester), Stela Ciko (University of Rochester), Raiyan Abdul Baten (University of Rochester) and Ehsan Hoque (University of Rochester) |
|
AAAI FSS 2022 |
Contrastive Learning for Climate Model Bias Correction and Super-Resolution
Abstract and authors: (click to expand)Abstract: Climate models often require post-processing in order to make accurate estimates of local climate risk. The most common post-processing applied is bias-correction and spatial resolution enhancement. However, the statistical methods typically used for this not only are incapable of capturing multivariate spatial correlation information but are also reliant on rich observational data often not available outside of developed countries, limiting their potential. Here we propose an alternative approach to this challenge based on a combination of image super resolution (SR) and contrastive learning generative adversarial networks (GANs). We benchmark performance against NASA’s flagship post-processed CMIP6 climate model product, NEX-GDDP. We find that our model successfully reaches a spatial resolution double that of NASA’s product while also achieving comparable or improved levels of bias correction in both daily precipitation and temperature. The resulting higher fidelity simulations of present and forward-looking climate can enable more local, accurate models of hazards like flooding, drought, and heatwaves. Authors: Tristan Ballard (Sust Global) and Gopal Erinjippurath (Sust Global) |
|
AAAI FSS 2022 |
Employing Deep Learning to Quantify Power Plant Greenhouse Gas Emissions via Remote Sensing Data
Abstract and authors: (click to expand)Abstract: Greenhouse gasses (GHG) emitted from fossil-fuel-burning power plants pose a global threat to climate and public health. GHG emissions degrade air quality and increase the frequency of natural disasters five-fold, causing 8.7 million deaths per year. Quantifying GHG emissions is crucial for the success of the planet. However, current methods to track emissions cost upwards of $520,000/plant. These methods are cost prohibitive for developing countries, and are not globally standardized, leading to inaccurate emissions reports from nations and companies. I developed a low-cost solution via an end-to-end deep learning pipeline that utilizes observations of emitted smoke plumes in satellite imagery to provide an accurate, precise system for quantifying power plant GHG emissions by segmentation of power plant smoke plumes, classification of the plant fossil fuels, and algorithmic prediction of power generation and CO2 emissions. The pipeline was able to achieve a segmentation Intersection Over Union (IoU) score of 0.841, fuel classification accuracy of 92%, and quantify power generation and CO2 emission rates with R2 values of 0.852 and 0.824 respectively. The results of this work serve as a step toward the low-cost monitoring and detection of major sources of GHG emissions, helping limit their catastrophic impacts on climate and our planet. Authors: Aryan Jain (Amador Valley High School) |
|
AAAI FSS 2022 |
ClimateBert: A Pretrained Language Model for Climate-Related Text
Abstract and authors: (click to expand)Abstract: Over the recent years, large pretrained language models (LM) have revolutionized the field of natural language processing (NLP). However, while pretraining on general language has been shown to work very well for common language, it has been observed that niche language poses problems. In particular, climate-related texts include specific language that common LMs can not represent accurately. We argue that this shortcoming of today's LMs limits the applicability of modern NLP to the broad field of text processing of climate-related texts. As a remedy, we propose ClimateBert, a transformer-based language model that is further pretrained on over 2 million paragraphs of climate-related texts, crawled from various sources such as common news, research articles, and climate reporting of companies. We find that ClimateBert leads to a 48% improvement on a masked language model objective which, in turn, leads to lowering error rates by 3.57% to 35.71% for various climate-related downstream tasks like text classification, sentiment analysis, and fact-checking Authors: Nicolas Webersinke (FAU Erlangen-Nürnberg), Mathias Kraus (FAU Erlangen-Nürnberg), Julia Anna Bingler (ETH Zurich) and Markus Leippold (UZH Zurich) |
|
AAAI FSS 2022 |
Curator: Creating Large-Scale Curated Labelled Datasets using Self-Supervised Learning
Abstract and authors: (click to expand)Abstract: Applying Machine learning to domains like Earth Sciences is impeded by the lack of labeled data, despite a large corpus of raw data available in such domains. For instance, training a wildfire classifier on satellite imagery requires curating a massive and diverse dataset, which is an expensive and time-consuming process that can span from weeks to months. Searching for relevant examples in over 40 petabytes of unlabelled data requires researchers to manually hunt for such images, much like finding a needle in a haystack. We present a no-code end-to-end pipeline, Curator, which dramatically minimizes the time taken to curate an exhaustive labeled dataset. Curator is able to search massive amounts of unlabelled data by combining self-supervision, scalable nearest neighbor search, and active learning to learn and differentiate image representations. The pipeline can also be readily applied to solve problems across different domains. Overall, the pipeline makes it practical for researchers to go from just one reference image to a comprehensive dataset in a diminutive span of time. Authors: Tarun Narayanan (SpaceML), Ajay Krishnan (SpaceML), Anirudh Koul (Pinterest, SpaceML, FDL) and Siddha Ganju (NVIDIA, SpaceML, FDL) |
|
AAAI FSS 2022 |
De-risking Carbon Capture and Sequestration with Explainable CO2 Leakage Detection in Time-lapse Seismic Monitoring Images
Abstract and authors: (click to expand)Abstract: With the growing global deployment of carbon capture and sequestration technology to combat climate change, monitoring and detection of potential CO2 leakage through existing or storage induced faults are critical to the safe and long-term viability of the technology. Recent work on time-lapse seismic monitoring of CO2 storage has shown promising results in its ability to monitor the growth of the CO2 plume from surface recorded seismic data. However, due to the low sensitivity of seismic imaging to CO2 concentration, additional developments are required to efficiently interpret the seismic images for leakage. In this work, we introduce a binary classification of time-lapse seismic images to delineate CO2 plumes (leakage) using state-of-the-art deep learning models. Additionally, we localize the leakage region of CO2 plumes by leveraging Class Activation Mapping (CAM) methods. Authors: Huseyin Tuna Erdinc (Georgia Institute of Technology), Abhinav Prakash Gahlot (Georgia Institute of Technology), Ziyi Yin (Georgia Institute of Technology), Mathias Louboutin (Georgia Institute of Technology) and Felix J. Herrmann (Georgia Institute of Technology) |
|
AAAI FSS 2022 |
Predicting Wildfire Risk Under Novel 21st-Century Climate Conditions
Abstract and authors: (click to expand)Abstract: Wildfires are one of the most impactful hazards associated with climate change, and in a hotter, drier world, wildfires will be much more common than they have historically been. However, the exact severity and frequency of future wildfires are difficult to estimate, because climate change will create novel combinations of vegetation and fire weather outside what has been historically observed. This provides a challenge for AI-based approaches to long-term fire risk modeling, as much future fire risk is outside of the available feature space provided by the historical record. Here, we give an overview of this problem that is inherent to many climate change impacts and propose a restricted model form that makes monotonic and interpretable predictions in novel fire weather environments. We then show how our model outperforms other neural networks and logistic regression models when making predictions on unseen data from a decade into the future. Authors: Matthew Cooper (Sust Global). |
|
AAAI FSS 2022 |
Probabilistic Machine Learning in Polar Earth and Climate Science: A Review of Applications and Opportunities
Abstract and authors: (click to expand)Abstract: Our world’s climate future is on thin ice. The study of longterm weather patterns in the polar regions is an important building block in tackling Climate Change. Our understanding of the past, the present and the future of the earth system, and the inherent uncertainty, informs planning, mitigation, and adaptation strategies. In this work we review previous applications of machine learning and statistical computing to polar climate research, and we highlight promising probabilistic machine learning methods that address the modelling needs of climate-related research in the Arctic and the Antarctic. We discuss common challenges in this interdisciplinary field and provide an overview of opportunities for future work in this novel area of research. Authors: Kim Bente (The University of Sydney), Judy Kay (The University of Sydney) and Roman Marchant (Commonwealth Scientific and Industrial Research Organisation (CSIRO)) |
|
AAAI FSS 2022 |
Rethinking Machine Learning for Climate Science: A Dataset Perspective
Abstract and authors: (click to expand)Abstract: The growing availability of data sources is a predominant factor enabling the widespread success of machine learning (ML) systems across a wide range of applications. Typically, training data in such systems constitutes a source of ground-truth, such as measurements about a physical object (e.g., natural images) or a human artifact (e.g., natural language). In this position paper, we take a critical look at the validity of this assumption for datasets for climate science. We argue that many such climate datasets are uniquely biased due to the pervasive use of external simulation models (e.g., general circulation models) and proxy variables (e.g., satellite measurements) for imputing and extrapolating in-situ observational data. We discuss opportunities for mitigating the bias in the training and deployment of ML systems using such datasets. Finally, we share views on improving the reliability and accountability of ML systems for climate science applications. Authors: Aditya Grover (UCLA) |
|
AAAI FSS 2022 |
Graph Representation Learning for Energy Demand Data: Application to Joint Energy System Planning under Emissions Constraints
Abstract and authors: (click to expand)Abstract: To meet the mid-century goal of CO2 emissions reduction requires a rapid transformation of current electric power and natural gas (NG) infrastructure. This necessitates a long-term planning of the joint power-NG system under representative demand patterns, operational constraints, and policy considerations. Our work is motivated by the computational and practical challenges associated with solving the generation and transmission expansion problem (GTEP) for joint planning of power-NG systems. Specifically, we focus on efficiently extracting a set of representative days from power and NG demand data in respective networks and using this set to reduce the computational burden required to solve the GTEP. We propose a Graph Autoencoder for Multiple time resolution Energy Systems (GAMES) to capture the spatio-temporal demand patterns in interdependent networks and account for differences in the temporal resolution of available data. The resulting embeddings are used in a clustering algorithm to select representative days. We evaluate the effectiveness of our approach in solving a GTEP formulation calibrated for the joint power-NG system in New England. This formulation accounts for the physical interdependencies between power and NG systems, including the joint emissions constraint. Our results show that the set of representative days obtained from GAMES not only allows us to tractably solve the GTEP formulation, but also achieves a lower cost of implementing the joint planning decisions. Authors: Aron Brenner (MIT), Rahman Khorramfar (MIT), Dharik Mallapragada (MIT) and Saurabh Amin (MIT) |
|
AAAI FSS 2022 |
Modeling Global Climate Negotiations, Agreements, and Long-Term Cooperation in RICE-N
Abstract and authors: (click to expand)Abstract: Comprehensive global cooperation is essential to limit global temperature increases while continuing economic development, e.g., reducing severe inequality or achieving long-term economic growth. Achieving long-term cooperation on climate change mitigation with n strategic agents poses a complex game-theoretic problem. For example, agents may negotiate and reach climate agreements, but there is no central authority to enforce adherence to those agreements. Hence, it is critical to design negotiation and agreement frameworks that foster cooperation, allow all agents to meet their individual policy objectives, and incentivize long-term adherence. This is an interdisciplinary challenge that calls for collaboration between researchers in machine learning, economics, climate science, law, policy, ethics, and other fields. In particular, we argue that machine learning is a critical tool to address the complexity of this domain. To facilitate this research, here we introduce RICE-N, a multi-region integrated assessment model that simulates the global climate and economy, and which can be used to design and evaluate the strategic outcomes for different negotiation and agreement frameworks. We also describe how to use multi-agent reinforcement learning to train rational agents using RICE-N. This framework underpins AI for Global Climate Cooperation, a working group collaboration and competition on climate negotiation and agreement design. Here, we invite the scientific community to design and evaluate their solutions using RICE-N, machine learning, economic intuition, and other domain knowledge. More information can be found on www.ai4climatecoop.org. Authors: Tianyu Zhang (Université de Montréal, MILA), Andrew Williams (Université de Montréal, MILA), Soham Phade (Salesforce Research), Sunil Srinivasa (Salesforce Research), Yang Zhang (MILA), Prateek Gupta (MILA, University of Oxford, The Alan Turing Institute), Yoshua Bengio (Université de Montréal, MILA, CIFAR) and Stephan Zheng (Salesforce Research) |
|
AAAI FSS 2022 |
Self-Supervised Representations of Geo-located Weather Time Series - an Evaluation and Analysis
Abstract and authors: (click to expand)Abstract: Self-supervised learning (SSL) is gaining traction in various domains, and demonstrated their potential particularly in tasks where labelled data is limited and costly to collect. In this work, we evaluate the performance existing self-supervised multivariate time series learning algorithms on weather-driven applications involving regression, classification and forecasting tasks. We experiment with a two-step protocol. In the first step, we employ and SSL algorithm and learn generic weather representations from multivariate weather data. Then, in the next step, we use these representations and fine-tune them for multiple downstream tasks. Through our initial experiments on air quality prediction and renewable energy generation tasks, we highlight the benefits of self-supervised weather representations, including improved performance in such tasks, ability to generalize with limited in-task data, and reduction in training time and carbon emissions. We expect such a direction to be relevant in multiple weather-driven applications supporting climate change mitigation and adaptation efforts. Authors: Arjun Ashok (IBM Research), Devyani Lambhate (IBM Research) and Jitendra Singh (IBM Research) |
|
AAAI FSS 2022 |
Predicting Daily Ozone Air Pollution With Transformers
Abstract and authors: (click to expand)Abstract: Surface ozone is an air pollutant that contributes to hundreds of thousands of premature deaths annually. Ozone at the surface also has considerable negative impacts on vegetation and crop yields. Ozone concentrations are affected by environmental factors, including temperature, which means that ozone concentrations are likely to change in future climates, posing risks to human health. This effect is known as the ozone climate penalty, and recent work suggests that currently polluted areas are likely to become more polluted by ozone in future climates. In light of recent stricter WHO regulations on surface ozone concentrations, we aim to build a predictive data-driven model for recent ozone concentrations, which could be used to make predictions of ozone concentrations in future climates, better quantifying future risks to human health and gaining insight into the variables driving ozone concentrations. We use observational station data from three European countries to train a transformer-based model to make predictions of daily maximum 8-hour ozone. Authors: Sebastian Hickman (University of Cambridge), Paul Griffiths (University of Cambridge), Peer Nowack (University of East Anglia) and Alex Archibald (University of Cambridge) |
|
AAAI FSS 2022 |
The Impact of TCFD Reporting - A New Application of Zero-Shot Analysis to Climate-Related Financial Disclosures
Abstract and authors: (click to expand)Abstract: We examine climate-related disclosures in 3,335 reports based on a sample of 188 banks that officially endorsed the recommendations of the Task Force for Climate-related Financial Disclosures (TCFD). In doing so, we introduce a new application of zero-shot text classification based on the BART model and a MNLI task. By developing a set of robust and fine-grained labels, we show that zero-shot analysis provides high accuracy in analyzing companies’ climate-related reporting without further model training. We are able to demonstrate that banks that support the TCFD increase their level of disclosure after officially declaring their support for the guidelines, although we also find significant differences depending on the topic of disclosure. Our findings yield important conclusions for the design of climate-related disclosures. Authors: Alix Auzepy (Justus-Liebig-Universität Gießen), Elena Tönjes (Justus-Liebig-Universität Gießen) and Christoph Funk (Justus-Liebig-Universität Gießen) |
|
AAAI FSS 2022 |
Using Natural Language Processing for Automating the Identification of Climate Action Interlinkages within the Sustainable Development Goals
Abstract and authors: (click to expand)Abstract: Climate action, Goal 13 of the UN Sustainable Development Goals (SDG), cuts across almost all SDGs. Achieving climate goals can reinforce the achievements in many other goals, but at the same time climate mitigation and adaptation measures may generate trade-offs, such as levelling the cost of energy and transitioning away from fossil fuels. Leveraging the synergies and minimizing the trade-offs among the climate goals and other SDGs is an imperative task for ensuring policy coherence. Understanding the interlinkages between climate action and other SDGs can help inform about the synergies and trade-offs. This paper presents a novel methodology by using natural language processing (NLP) to automate the process of systematically identifying the key interlinkages between climate action and SDGs from a large amount of climate literature. A qualitative SDG interlinkages model for climate action was automatically generated and visualized in a network graph. This work contributes to the conference thematic topic on using AI for policy alignment for climate change goals, SDGs and associated environmental, social and governance (ESG) frameworks. Authors: Xin Zhou (Institute for Global Environmental Strategies (IGES)), Kshitij Jain (Google Inc.), Mustafa Moinuddin (Institute for Global Environmental Strategies (IGES)) and Patrick McSharry (Carnegie Mellon University Africa; Oxford Man Institute of Quantitative Finance, Oxford University) |
|
AAAI FSS 2022 |
Imputation of Missing Streamflow Data at Multiple Gauging Stations in Benin Republic
Abstract and authors: (click to expand)Abstract: Streamflow observation data is vital for flood monitoring, agricultural, and settlement planning. However, such streamflow data are commonly plagued with missing observations due to various causes such as harsh environmental conditions and constrained operational resources. This problem is often more pervasive in under-resourced areas such as Sub-Saharan Africa. In this work, we reconstruct streamflow time series data through bias correction of the GEOGloWS ECMWF streamflow service (GESS) forecasts at ten river gauging stations in Benin. We perform bias correction by fitting Quantile Mapping, Gaussian Process, and Elastic Net regression in a constrained training period. We show by simulating missingness in a testing period that GESS forecasts have a significant bias that results in low predictive skill over the ten Beninese stations. Our findings suggest that overall bias correction by Elastic Net and Gaussian Process regression achieves superior skill relative to traditional imputation by Random Forest, k-Nearest Neighbour, and GESS lookup. The findings of this work provide a basis for integrating global GESS streamflow data into operational early-warning decision-making systems (e.g., flood alert) in countries vulnerable to drought and flooding due to extreme weather events. Authors: Rendani Mbuvha (Queen Mary University of London), Julien Yise Peniel Adounkpe (International Water Management Institute (IWMI)), Wilson Tsakane Mongwe (University of Johannesburg), Mandela Houngnibo (Agence Nationale de la Météorologie du Benin Meteo Benin), Nathaniel Newlands (Summerland Research and Development Centre, Agriculture and Agri-Food Canada) and Tshilidzi Marwala (University of Johannesburg) |
|
AAAI FSS 2022 |
Intermediate and Future Frame Prediction of Geostationary Satellite Imagery With Warp and Refine Network
Abstract and authors: (click to expand)Abstract: Geostationary satellite imagery has applications in climate and weather forecasting, planning natural energy resources, and predicting extreme weather events. For precise and accurate prediction, higher spatial and temporal resolution of geostationary satellite imagery is important. Although recent geostationary satellite resolution has improved, the long-term analysis of climate applications is limited to using multiple satellites from the past to the present due to the different resolutions. To solve this problem, we proposed warp and refine network (WR-Net). WR-Net is divided into an optical flow warp component and a warp image refinement component.We used the TV-L1 algorithm instead of deep learning-based approaches to extract the optical flow warp component. The deep-learning-based model is trained on the human-centric view of the RGB channel and does not work on geostationary satellites, which is gray-scale one-channel imagery.The refinement network refines the warped image through a multi-temporal fusion layer. We evaluated WR-Net by interpolation of temporal resolution at 4 min intervals to 2 min intervals in large-scale GK2A geostationary meteorological satellite imagery. Furthermore, we applied WR-Net to the future frame prediction task and showed that the explicit use of optical flow can help future frame prediction. Authors: Minseok Seo (SI Analytics), Yeji Choi (SI Analytics), Hyungon Ryu (NVIDIA), Heesun Park (National Institute of Meteorological Science), Hyungkun Bae (SI Analytics), Hyesook Lee (National Institute of Meteorological Science) and Wanseok Seo (NVIDIA) |
|
AAAI FSS 2022 |
Machine Learning Methods in Climate Finance: A Systematic Review
Abstract and authors: (click to expand)Abstract: Preventing the materialization of climate change is one of the main challenges of our time. The involvement of the financial sector is a fundamental pillar in this task, which has led to the emergence of a new field in the literature, climate finance. In turn, the use of Machine Learning (ML) as a tool to analyze climate finance is on the rise, due to the need to use big data to collect new climate-related information and model complex non-linear relationships. Considering the potential for the use of ML in climate finance and the proliferation of articles in this field, we propose a survey of the academic literature to assess how ML is enabling climate finance to scale up. The contribution of this paper is threefold. First, we do a systematic search in three scientific databases to assemble a corpus of relevant studies. Using topic modeling (Latent Dirichlet Allocation) we uncover representative thematic clusters. This allows us to statistically identify seven granular application domains where ML is playing a significant role in climate finance literature: natural hazards, biodiversity, agricultural risk, carbon markets, energy economics, ESG factors & investing, and climate data. Secondly, we do an analysis highlighting publication trends; and thirdly, we show a breakdown of ML methods applied by research area, aiming to spur further innovative work from ML experts. Authors: Andres Alonso-Robisco (Banco de España), Jose Manuel Carbo (Banco de España) and Jose Manuel Marques (Banco de España) |
|
NeurIPS 2021 |
Flood Segmentation on Sentinel-1 SAR Imagery with Semi-Supervised Learning
(Papers Track)
Abstract and authors: (click to expand)Abstract: Floods wreak havoc throughout the world, causing billions of dollars in damages, and uprooting communities, ecosystems and economies. The NASA Impact Emerging Techniques in Computational Intelligence (ETCI) competition on Flood Detection tasked participants with predicting flooded pixels after training with synthetic aperture radar (SAR) images in a supervised setting. We propose a semi-supervised learning pseudo-labeling scheme that derives confidence estimates from U-Net ensembles, thereby progressively improving accuracy. Concretely, we use a cyclical approach involving multiple stages (1) training an ensemble model of multiple U-Net architectures with the provided high confidence hand-labeled data and, generated pseudo labels or low confidence labels on the entire unlabeled test dataset, and then, (2) filter out quality generated labels and, (3) combine the generated labels with the previously available high confidence hand-labeled dataset. This assimilated dataset is used for the next round of training ensemble models. This cyclical process is repeated until the performance improvement plateaus. Additionally, we post process our results with Conditional Random Fields. Our approach sets a high score, and a new state-of-the-art on the Sentinel-1 dataset for the ETCI competition with 0.7654 IoU, an impressive improvement over the 0.60 IOU baseline. Our method, which we release with all the code including trained models, can also be used as an open science benchmark for the Sentinel-1 released dataset. Authors: Siddha Ganju (Nvidia Corporation); Sayak Paul (Carted) |
Unsupervised and semi-supervised learning Disaster prediction, management, and relief Earth science and monitoring Computer vision and remote sensing |
NeurIPS 2021 |
Short-term Solar Irradiance Prediction from Sky Images
(Papers Track)
Abstract and authors: (click to expand)Abstract: Solar irradiance forecasting is essential for the integration of the solar power into the power grid system while maintaining its stability. This paper focuses on short-term solar irradiance forecasting (upto 4-hour ahead-of-time prediction) from a past sky image sequence. Most existing work aims for the prediction of the most likely future of the solar irradiance. While it is likely deterministic for intra-hourly prediction, the future solar irradiance is naturally diverse over a relatively long-term horizon (>1h). We therefore introduce approaches to deterministic and stochastic predictions to capture the most likely as well as the diverse future of the solar irradiance. To enable the autoregressive prediction capability of the model, we proposed deep neural networks to predict the future sky images in a deterministic as well as stochastic manner. We evaluate our approaches on benchmark datasets and demonstrate that our approaches achieve superior performance. Authors: Hoang Chuong Nguyen (Australia National University); Miaomiao Liu (The Australian National University) |
Computer vision and remote sensing Power and energy |
NeurIPS 2021 |
Towards Representation Learning for Atmospheric Dynamics
(Papers Track)
Abstract and authors: (click to expand)Abstract: The prediction of future climate scenarios under anthropogenic forcing is critical to understand climate change and to assess the impact of potentially counter-acting technologies. Machine learning and hybrid techniques for this prediction rely on informative metrics that are sensitive to pertinent but often subtle influences. For atmospheric dynamics, a critical part of the climate system, no well established metric exists and visual inspection is currently still often used in practice. However, this ``eyeball metric'' cannot be used for machine learning where an algorithmic description is required. Motivated by the success of intermediate neural network activations as basis for learned metrics, e.g. in computer vision, we present a novel, self-supervised representation learning approach specifically designed for atmospheric dynamics. Our approach, called AtmoDist, trains a neural network on a simple, auxiliary task: predicting the temporal distance between elements of a randomly shuffled sequence of atmospheric fields (e.g. the components of the wind field from reanalysis or simulation). The task forces the network to learn important intrinsic aspects of the data as activations in its layers and from these hence a discriminative metric can be obtained. We demonstrate this by using AtmoDist to define a metric for GAN-based super resolution of vorticity and divergence. Our upscaled data matches both visually and in terms of its statistics a high resolution reference closely and it significantly outperform the state-of-the-art based on mean squared error. Since AtmoDist is unsupervised, only requires a temporal sequence of fields, and uses a simple auxiliary task, it has the potential to be of utility in a wide range of applications. Authors: Sebastian Hoffmann (University of Magdeburg); Christian Lessig (Otto-von-Guericke-Universitat Magdeburg) |
Climate and Earth science Earth science and monitoring Generative modeling Time-series analysis |
NeurIPS 2021 |
Addressing Deep Learning Model Uncertainty in Long-Range Climate Forecasting with Late Fusion
(Papers Track)
Abstract and authors: (click to expand)Abstract: Global warming leads to the increase in frequency and intensity of climate extremes that cause tremendous loss of lives and property. Accurate long-range climate prediction allows more time for preparation and disaster risk management for such extreme events. Although machine learning approaches have shown promising results in long-range climate forecasting, the associated model uncertainties may reduce their reliability. To address this issue, we propose a late fusion approach that systematically combines the predictions from multiple models to reduce the expected errors of the fused results. We also propose a network architecture with the novel denormalization layer to gain the benefits of data normalization without actually normalizing the data. The experimental results on long-range 2m temperature forecasting show that the framework outperforms the 30-year climate normals, and the accuracy can be improved by increasing the number of models. Authors: Ken C. L. Wong (IBM Research – Almaden Research Center); Hongzhi Wang (IBM Almaden Research Center); Etienne E Vos (IBM); Bianca Zadrozny (IBM Research); Campbell D Watson (IBM Reserch); Tanveer Syeda-Mahmood (IBM Research) |
Uncertainty quantification and robustness Climate and Earth science Classification, regression, and supervised learning Time-series analysis |
NeurIPS 2021 |
Predicting Discharge in Catchment Outlet Using Deep Learning: Case Study of the Ansongo-Niamey Basin
(Papers Track)
Abstract and authors: (click to expand)Abstract: Hydrological models are one of the key challenges in hydrology. Their goal is to understand, predict and manage water resources. Most of the hydrological models so far were either physical or conceptual models. But in the past two decades, fully data-driven (empirical) models started to emerge with the breakthroughs of novel deep learning methods in runoff prediction. These breakthroughs were mostly favored by the large volume, variety and velocity of water-related data. Long Short-Term Memory and Gated Recurrent Unit neural networks, particularly achieved the outstanding milestone of outperforming classic hydrological models in less than a decade. Moreover, they have the potential to change the way hydrological modeling is performed. In this study, precipitation, minimal and maximum temperature at the Ansongo-Niamey basin combined with the discharge at Ansongo and Kandadji were used to predict the discharge at Niamey using artificial neural networks. After data preprocessing and hyperparameter optimization, the deep learning models performed well with the LSTM and GRU respectively scoring a Nash-Sutcliffe Efficiency of 0.933 and 0.935. This performance matches those of well-known physically-based models used to simulate Niamey’s discharge and therefore demonstrates the efficiency of deep learning methods in a West African context, especially in Niamey which has been facing severe floods due to climate change. Authors: Peniel J. Y. Adounkpe (WASCAL); Eric Alamou (Université d'Abomey-Calavi); Belko Diallo (WASCAL); Abdou Ali (AGRHYMET Regional Centre) |
Disaster prediction, management, and relief Causal and Bayesian methods Classification, regression, and supervised learning Time-series analysis |
NeurIPS 2021 |
Accurate and Timely Forecasts of Geologic Carbon Storage using Machine Learning Methods
(Papers Track)
Abstract and authors: (click to expand)Abstract: Carbon capture and storage is one strategy to reduce greenhouse gas emissions. One approach to storing the captured CO2 is to inject it into deep saline aquifers. However, dynamics of the injected CO2 plume is uncertain and the potential for leakage back to the atmosphere must be assessed. Thus, accurate and timely forecasts of CO2 storage via real-time measurements integration becomes very crucial. This study proposes a learning-based, inverse-free prediction method that can accurately and rapidly forecast CO2 movement and distribution with uncertainty quantification based on limited simulation and observation data. The machine learning techniques include dimension reduction, multivariate data analysis, and Bayesian learning. The outcome is expected to provide CO2 storage site operators with an effective tool for real-time decision making. Authors: Dan Lu (Oak Ridge National Laboratory); Scott Painter (Oak Ridge National Laboratory); Nicholas Azzolina (University of North Dakota); Matthew Burton-Kelly (University of North Dakota) |
Carbon capture and sequestration Uncertainty quantification and robustness |
NeurIPS 2021 |
Towards debiasing climate simulations using unsuperviserd image-to-image translation networks
(Papers Track)
Abstract and authors: (click to expand)Abstract: Climate models form the basis of a vast portion of earth system research, and inform our climate policy. Due to the complex nature of our climate system, and the approximations which must necessarily be made in simulating it, these climate models may not perfectly match observations. For further research, these outputs must be bias corrected against observations, but current methods of debiasing do not take into account spatial correlations. We evaluate unsupervised image-to-image translation networks, specifically the UNIT model architecture, for their ability to produce more spatially realistic debiasing than the standard techniques used in the climate community. Authors: James Fulton (University of Edinburgh); Ben Clarke (Oxford University) |
Climate and Earth science Generative modeling |
NeurIPS 2021 |
Using Non-Linear Causal Models to Study Aerosol-Cloud Interactions in the Southeast Pacific
(Papers Track)
Abstract and authors: (click to expand)Abstract: Aerosol-cloud interactions include a myriad of effects that all begin when aerosol enters a cloud and acts as cloud condensation nuclei (CCN). An increase in CCN results in a decrease in the mean cloud droplet size (r$_{e}$). The smaller droplet size leads to brighter, more expansive, and longer lasting clouds that reflect more incoming sunlight, thus cooling the earth. Globally, aerosol-cloud interactions cool the Earth, however the strength of the effect is heterogeneous over different meteorological regimes. Understanding how aerosol-cloud interactions evolve as a function of the local environment can help us better understand sources of error in our Earth system models, which currently fail to reproduce the observed relationships. In this work we use recent non-linear, causal machine learning methods to study the heterogeneous effects of aerosols on cloud droplet radius. Authors: Andrew Jesson (University of Oxford); Peter Manshausen (University of Oxford); Alyson Douglas (University of Oxford); Duncan Watson-Parris (University of Oxford); Yarin Gal (University of Oxford); Philip Stier (University of Oxford) |
Climate and Earth science Causal and Bayesian methods |
NeurIPS 2021 |
Memory to Map: Improving Radar Flood Maps With Temporal Context and Semantic Segmentation
(Papers Track)
Abstract and authors: (click to expand)Abstract: Global flood risk has increased due to worsening extreme weather events and human migration into growing flood-prone areas. Accurate, high-resolution, and near-real time flood maps can address flood risk by reducing financial loss and damage. We propose Model to Map, a novel machine learning approach that utilizes bi-temporal context to improve flood water segmentation performance for Sentinel-1 imagery. We show that the inclusion of unflooded context for the area, or "memory," allows the model to tap into a "prior state" of pre-flood conditions, increasing performance in geographic regions in which single-image radar-based flood mapping methods typically underperform (e.g. deserts). We focus on accuracy across different biomes to ensure global performance. Our experiments and novel data processing technique show that the confluence of pre-flood and permanent water context provides a 21% increase in mIoU over the baseline overall, and over 87% increase in deserts. Authors: Veda Sunkara (Cloud to Street); Nicholas Leach (Cloud to Street); Siddha Ganju (Nvidia) |
Computer vision and remote sensing Climate justice Earth science and monitoring |
NeurIPS 2021 |
Hurricane Forecasting: A Novel Multimodal Machine Learning Framework
(Papers Track)
Abstract and authors: (click to expand)Abstract: This paper describes a machine learning (ML) framework for tropical cyclone intensity and track forecasting, combining multiple distinct ML techniques and utilizing diverse data sources. Our framework, which we refer to as Hurricast (HURR), is built upon the combination of distinct data processing techniques using gradient-boosted trees and novel encoder-decoder architectures, including CNN, GRU and Transformers components. We propose a deep-learning feature extractor methodology to mix spatial-temporal data with statistical data efficiently. Our multimodal framework unleashes the potential of making forecasts based on a wide range of data sources, including historical storm data, and visual data such as reanalysis atmospheric images. We evaluate our models with current operational forecasts in North Atlantic (NA) and Eastern Pacific (EP) basins on 2016-2019 for 24-hour lead time, and show our models consistently outperform statistical-dynamical models and compete with the best dynamical models. Furthermore, the inclusion of Hurricast into an operational forecast consensus model leads to a significant improvement of 5% - 15% over NHC's official forecast, thus highlighting the complementary properties with existing approaches. Authors: Léonard Boussioux (MIT, CentraleSupélec); Cynthia Zeng (MIT); Dimitris Bertsimas (MIT); Théo J Guenais (Harvard University) |
Hybrid physical models Climate and Earth science Disaster prediction, management, and relief Classification, regression, and supervised learning Computer vision and remote sensing Time-series analysis Unsupervised and semi-supervised learning |
NeurIPS 2021 |
Improved Drought Forecasting Using Surrogate Quantile And Shape (SQUASH) Loss
(Papers Track)
Abstract and authors: (click to expand)Abstract: Droughts are amongst the most damaging natural hazard with cascading impacts across multiple sectors of the economy and society. Improved forecasting of drought conditions ahead of time can significantly improve strategic planning to mitigate the impacts and enhance resilience. Though significant progress in forecasting approaches has been made, the current approaches focus on the overall improvement of the forecast, with less attention on the extremeness of drought events. In this paper, we focus on improving the accuracy of forecasting extreme and severe drought events by introducing a novel loss function Surrogate Quantile and Shape loss (SQUASH) that combines weighted quantile loss and dynamic time-warping-based shape loss. We show the effectiveness of the proposed loss functions for imbalanced time-series drought forecasting tasks on two regions in India and the USA. Authors: Devyani Lambhate Lambhate (Indian Institute of Science); Smit Marvaniya (IBM Research India); Jitendra Singh (IBM Research - India); David Gold (IBM) |
Disaster prediction, management, and relief Agriculture, forestry and other land use Climate and Earth science Time-series analysis |
NeurIPS 2021 |
Global ocean wind speed estimation with CyGNSSnet
(Papers Track)
Abstract and authors: (click to expand)Abstract: The CyGNSS (Cyclone Global Navigation Satellite System) satellite system measures GNSS signals reflected off the Earth's surface. A global ocean wind speed dataset is derived, which fills a gap in Earth observation data, will improve cyclone forecasting, and could be used to mitigate effects of climate change. We propose CyGNSSnet, a deep learning model for predicting wind speed from CyGNSS observables, and evaluate its potential for operational use. With CyGNSSnet, performance improves by 29\% over the current operational model. We further introduce a hierarchical model, that combines an extreme value classifier and a specialized CyGNSSnet and slightly improves predictions for high winds. Authors: Caroline Arnold (German Climate Computing Center); Milad Asgarimehr (German Research Centre for Geosciences) |
Computer vision and remote sensing Disaster prediction, management, and relief Earth science and monitoring Classification, regression, and supervised learning |
NeurIPS 2021 |
Predicting Critical Biogeochemistry of the Southern Ocean for Climate Monitoring
(Papers Track)
Abstract and authors: (click to expand)Abstract: The Biogeochemical-Argo (BGC-Argo) program is building a network of globally distributed, sensor-equipped robotic profiling floats, improving our understanding of the climate system and how it is changing. These floats, however, are limited in the number of variables measured. In this study, we train neural networks to predict silicate and phosphate values in the Southern Ocean from temperature, pressure, salinity, oxygen, nitrate, and location and apply these models to earth system model (ESM) and BGC-Argo data to expand the utility of this ocean observation network. We trained our neural networks on observations from the Global Ocean Ship-Based Hydrographic Investigations Program (GO-SHIP) and use dropout regularization to provide uncertainty bounds around our predicted values. Our neural network significantly improves upon linear regression but shows variable levels of uncertainty across the ranges of predicted variables. We explore the generalization of our estimators to test data outside our training distribution from both ESM and BGC-Argo data. Our use of out-of-distribution test data to examine shifts in biogeochemical parameters and calculate uncertainty bounds around estimates advance the state-of-the-art in oceanographic data and climate monitoring. We make our data and code publicly available. Authors: Ellen Park (MIT); Jae Deok Kim (MIT-WHOI); Nadege Aoki (MIT); Yumeng Cao (MIT); Yamin Arefeen (Massachusetts Institute of Technology); Matthew Beveridge (Massachusetts Institute of Technology); David P Nicholson (Woods Hole Oceanographic Institution); Iddo Drori (MIT) |
Earth science and monitoring Ecosystems and natural systems Classification, regression, and supervised learning Other |
NeurIPS 2021 |
On the Generalization of Agricultural Drought Classification from Climate Data
(Papers Track)
Abstract and authors: (click to expand)Abstract: Climate change is expected to increase the likelihood of drought events, with severe implications for food security. Unlike other natural disasters, droughts have a slow onset and depend on various external factors, making drought detection in climate data difficult. In contrast to existing works that rely on simple relative drought indices as ground-truth data, we build upon soil moisture index (SMI) obtained from a hydrological model. This index is directly related to insufficiently available water to vegetation. Given ERA5-Land climate input data of six months with land use information from MODIS satellite observation, we compare different models with and without sequential inductive bias in classifying droughts based on SMI. We use PR-AUC as the evaluation measure to account for the class imbalance and obtain promising results despite a challenging time-based split. We further show in an ablation study that the models retain their predictive capabilities given input data of coarser resolutions, as frequently encountered in climate models. Authors: Julia Gottfriedsen (1Deutsches Zentrum für Luft- und Raumfahrt (DLR), LMU); Max Berrendorf (Ludwig-Maximilians-Universität München); Pierre Gentine (Columbia University); Markus Reichstein (Max Planck Institute for Biogeochemistry, Jena; Michael Stifel Center Jena for Data-Driven and Simulation Science, Jena); Katja Weigel (niversity of Bremen, Institute of Environmental Physics (IUP), Bremen, Germany); Birgit Hassler (Deutsches Zentrum für Luft- und Raumfahrt e.V. (DLR), Institut für Physik der Atmosphäre, Oberpfaffenhofen, Germany); Veronika Eyring (Deutsches Zentrum für Luft- und Raumfahrt e.V. (DLR), Institut für Physik der Atmosphäre, Oberpfaffenhofen, Germany; University of Bremen, Institute of Environmental Physics (IUP), Bremen, Germany) |
Climate and Earth science Agriculture, forestry and other land use Disaster prediction, management, and relief |
NeurIPS 2021 |
Mapping Post-Climate Change Biogeographical Regions with Deep Latent Variable Models
(Papers Track)
Abstract and authors: (click to expand)Abstract: Forecasting future changes to biodiversity due to shifts in climate is challenging due to nonlinear interactions between species as recorded in their presence/absence data. This work proposes using variational autoencoders with environmental covariates to identify low-dimensional structure in species’ joint co-occurrence patterns and leveraging this simplified representation to provide multivariate predictions of their habitat extent under future climate scenarios. We pursue a latent space clustering approach to map biogeographical regions of frequently co-occurring species and apply this methodology to a dataset from northern Belgium, generating predictive maps illustrating how these regions may expand or contract with changing temperature under a future climate scenario. Authors: Christopher Krapu (Oak Ridge National Lab - Oak Ridge, TN) |
Generative modeling Agriculture, forestry and other land use Causal and Bayesian methods |
NeurIPS 2021 |
Rotation Equivariant Deforestation Segmentation and Driver Classification
(Papers Track)
Abstract and authors: (click to expand)Abstract: Deforestation has become a significant contributing factor to climate change and, due to this, both classifying the drivers and predicting segmentation maps of deforestation has attracted significant interest. In this work, we develop a rotation equivariant convolutional neural network model to predict the drivers and generate segmentation maps of deforestation events from Landsat 8 satellite images. This outperforms previous methods in classifying the drivers and predicting the segmentation map of deforestation, offering a 9% improvement in classification accuracy and a 7% improvement in segmentation map accuracy. In addition, this method predicts stable segmentation maps under rotation of the input image, which ensures that predicted regions of deforestation are not dependent upon the rotational orientation of the satellite. Authors: Joshua Mitton (University of Glasgow); Roderick Murray-Smith (University of Glasgow) |
Computer vision and remote sensing Classification, regression, and supervised learning |
NeurIPS 2021 |
WiSoSuper: Benchmarking Super-Resolution Methods on Wind and Solar Data
(Papers Track)
Abstract and authors: (click to expand)Abstract: The transition to green energy grids depends on detailed wind and solar forecasts to optimize the siting and scheduling of renewable energy generation. Operational forecasts from numerical weather prediction models, however, only have a spatial resolution of 10 to 20-km, which leads to sub-optimal usage and development of renewable energy farms. Weather scientists have been developing super-resolution methods to increase the resolution, but often rely on simple interpolation techniques or computationally expensive differential equation-based models. Recently, machine learning-based models, specifically the physics-informed resolution-enhancing generative adversarial network (PhIREGAN), have outperformed traditional downscaling methods. We provide a thorough and extensible benchmark of leading deep learning-based super-resolution techniques, including the enhanced super-resolution generative adversarial network (ESRGAN) and an enhanced deep super-resolution (EDSR) network, on wind and solar data. We accompany the benchmark with a novel public, processed, and machine learning-ready dataset for benchmarking super-resolution methods on wind and solar data. Authors: Rupa Kurinchi-Vendhan (Caltech); Björn Lütjens (MIT); Ritwik Gupta (University of California, Berkeley); Lucien D Werner (California Institute of Technology); Dava Newman (MIT); Steven Low (California Institute of Technology) |
Computer vision and remote sensing Climate and Earth science Earth science and monitoring Power and energy Classification, regression, and supervised learning Generative modeling |
NeurIPS 2021 |
Meta-Learned Bayesian Optimization for Calibrating Building Simulation Models with Multi-Source Data
(Papers Track)
Abstract and authors: (click to expand)Abstract: Well-calibrated building simulation models are key to reducing greenhouse gas emissions and optimizing building performance. Current calibration algorithms do not leverage data collected during previous calibration tasks. In this paper, we employ attentive neural processes (ANP) to meta-learn a distribution using multi-source data acquired during previously seen calibration tasks. The ANP informs a meta-learned Bayesian optimizer to accelerate calibration of new, unseen tasks. The few-shot nature of our proposed algorithm is demonstrated on a library of residential buildings validated by the United States Department of Energy (USDoE). Authors: Sicheng Zhan (NUS); Gordon Wichern (Mitsubishi Electric Research Laboratories (MERL)); Christopher Laughman (Mitsubishi Electric Research Laboratories); Ankush Chakrabarty (Mitsubishi Electric Research Labs) |
Buildings and cities Meta- and transfer learning |
NeurIPS 2021 |
MS-nowcasting: Operational Precipitation Nowcasting with Convolutional LSTMs at Microsoft Weather
(Papers Track)
Abstract and authors: (click to expand)Abstract: We present the encoder-forecaster convolutional long short-term memory (LSTM) deep-learning model that powers Microsoft Weather's operational precipitation nowcasting product. This model takes as input a sequence of weather radar mosaics and deterministically predicts future radar reflectivity at lead times up to 6 hours. By stacking a large input receptive field along the feature dimension and conditioning the model's forecaster with predictions from the physics-based High Resolution Rapid Refresh (HRRR) model, we are able to outperform optical flow and HRRR baselines by 20-25% on multiple metrics averaged over all lead times. Authors: Sylwester Klocek (Microsoft Corporation); Haiyu Dong (Microsoft); Matthew Dixon (Microsoft Corporation); Panashe Kanengoni (Microsoft Corporation); Najeeb Kazmi (Microsoft); Pete Luferenko (Microsoft Corporation); Zhongjian Lv (Microsoft Corporation); Shikhar Sharma (); Jonathan Weyn (Microsoft); Siqi Xiang (Microsoft Corporation) |
Computer vision and remote sensing Climate and Earth science Disaster prediction, management, and relief Earth science and monitoring Classification, regression, and supervised learning Hybrid physical models |
NeurIPS 2021 |
SunCast: Solar Irradiance Nowcasting from Geosynchronous Satellite Data
(Papers Track)
Abstract and authors: (click to expand)Abstract: When cloud layers cover photovoltaic (PV) panels, the amount of power the panels produce fluctuates rapidly. Therefore, to maintain enough energy on a power grid to match demand, utilities companies rely on reserve power sources that typically come from fossil fuels and therefore pollute the environment. Accurate short-term PV power prediction enables operators to maximize the amount of power obtained from PV panels and safely reduce the reserve energy needed from fossil fuel sources. While several studies have developed machine learning models to predict solar irradiance at specific PV generation facilities, little work has been done to model short-term solar irradiance on a global scale. Furthermore, models that have been developed are proprietary and have architectures that are not publicly available or rely on computationally demanding Numerical Weather Prediction (NWP) models. Here, we propose a Convolutional Long Short-Term Memory Network model that treats solar nowcasting as a next frame prediction problem, is more efficient than NWP models and has a straightforward, reproducible architecture. Our models can predict solar irradiance for entire North America for up to 3 hours in under 60 seconds on a single machine without a GPU and has a RMSE of 120 W/m^2 when evaluated on 2 months of data. Authors: Dhileeban Kumaresan (UC Berkeley); Richard Wang (UC Berkeley); Ernesto A Martinez (UC Berkeley); Richard Cziva (UC Berkeley); Alberto Todeschini (UC Berkeley); Colorado J Reed (University of California, Berkeley); Puya Vahabi (UC Berkeley) |
Power and energy Classification, regression, and supervised learning Computer vision and remote sensing |
NeurIPS 2021 |
Synthetic Imagery Aided Geographic Domain Adaptation for Rare Energy Infrastructure Detection in Remotely Sensed Imagery
(Papers Track)
Abstract and authors: (click to expand)Abstract: Object detection in remotely sensed data is frequently stymied by applications in geographies that are different from that of the training data. When objects are rare, the problem is exacerbated further. This is true of assessments of energy infrastructure such as generation, transmission, and end-use consumption; key to electrification planning as well as for effective assessment of natural disaster impacts which are varying in frequency and intensity due to climate change. We propose an approach to domain adaptation that requires only unlabeled samples from the target domain and generates synthetic data to augment training data for targeted domain adaptation. This approach is shown to work consistently across four geographically diverse domains, improving object detection average precision by 15.5\% on average for small sample sizes. Authors: Wei Hu (Duke University); Tyler Feldman (Duke University); Eddy Lin (Duke University); Jose Luis Moscoso (Duke); Yanchen J Ou (Duke University); Natalie Tarn (Duke University); Baoyan Ye (Duke University); Wendy Zhang (Duke University); Jordan Malof (Duke University); Kyle Bradbury (Duke University) |
Computer vision and remote sensing Earth science and monitoring Power and energy Classification, regression, and supervised learning |
NeurIPS 2021 |
Being the Fire: A CNN-Based Reinforcement Learning Method to Learn How Fires Behave Beyond the Limits of Physics-Based Empirical Models
(Papers Track)
Abstract and authors: (click to expand)Abstract: Wildland fires pose an increasing threat in light of anthropogenic climate change. Fire-spread models play an underpinning role in many areas of research across this domain, from emergency evacuation to insurance analysis. We study paths towards advancing such models through deep reinforcement learning. Aggregating 21 fire perimeters from the Western United States in 2017, we construct 11-layer raster images representing the state of the fire area. A convolution neural network based agent is trained offline on one million sub-images to create a generalizable baseline for predicting the best action - burn or not burn - given the then-current state on a particular fire edge. A series of online, TD(0) Monte Carlo Q-Learning based improvements are made with final evaluation conducted on a subset of holdout fire perimeters. We examine the performance of the learned agent/model against the FARSITE fire-spread model. We also make available a novel data set and propose more informative evaluation metrics for future progress. Authors: William L Ross (Stanford) |
Disaster prediction, management, and relief Societal adaptation Computer vision and remote sensing Reinforcement learning and control |
NeurIPS 2021 |
Subseasonal Solar Power Forecasting via Deep Sequence Learning
(Papers Track)
Abstract and authors: (click to expand)Abstract: To help mitigate climate change, power systems need to integrate renewable energy sources, such as solar, at a rapid pace. Widespread integration of solar energy into the power system requires major improvements in solar irradiance forecasting, in order to reduce the uncertainty associated with solar power output. While recent works have addressed short lead-time forecasting (minutes to hours ahead), week(s)-ahead and longer forecasts, coupled with uncertainty estimates, will be extremely important for storage applications in future power systems. In this work, we propose machine learning approaches for these longer lead-times as an important new application area in the energy domain. We demonstrate the potential of several deep sequence learning techniques for both point predictions and probabilistic predictions at these longer lead-times. We compare their performance for subseasonal forecasting (forecast lead-times of roughly two weeks) using the SURFRAD data set for 7 stations across the U.S. in 2018. The results are encouraging; the deep sequence learning methods outperform the current benchmark for machine learning-based probabilistic predictions (previously applied at short lead-times in this domain), along with relevant baselines. Authors: Saumya Sinha (University of Colorado, Boulder); Bri-Mathias S Hodge (University of Colorado Boulder); Claire Monteleoni (University of Colorado Boulder) |
Power and energy Time-series analysis |
NeurIPS 2021 |
A Transfer Learning-Based Surrogate Model for Geological Carbon Storage with Multi-Fidelity Training Data
(Papers Track)
Abstract and authors: (click to expand)Abstract: Geologic carbon storage (GCS) entails injecting large volumes of carbon dioxide (CO2) in deep geologic formations to prevent its release to the atmosphere. Reservoir simulation is widely used in GCS applications to predict subsurface pressure and CO2 saturation. High fidelity numerical models are prohibitively expensive for data assimilation and uncertainty quantification, which require a large number of simulation runs. Deep learning-based surrogate models have shown a great promise to alleviate the high computational cost. However, the training cost is high as thousands of high-fidelity simulations are often necessary for generating the training data. In this work, we explore the use of a transfer learning approach to reduce the training cost. Compared with the surrogate model trained with high-fidelity simulations, our new transfer learning-based model shows comparable accuracy but reduces the training cost by 80%. Authors: Su Jiang (Stanford University) |
Carbon capture and sequestration Computer vision and remote sensing |
NeurIPS 2021 |
National Cropland Classification with Agriculture Census Information and EO Datasets
(Papers Track)
Abstract and authors: (click to expand)Abstract: National cropland classification is critical to monitor food security, comprehend environmental circumstances and climate change, and participate in agricultural policy development. The increasing earth observation datasets, especially the free available Sentinel and Landsat, open unprecedented large-scale mapping opportunities. However, most applied machine learning techniques have relied on substantial training datasets, which are not always available and may be expensive to create or collect. Focusing on Japan, this work indicates what kinds of information can be extracted from agriculture census information then used for mapping different crop types. Different classification approaches of pixel-based and parcel-based are compared. Then, the efficient method is used to generate Japan's first national cropland classification with Sentinel-1 C-band and Landsat-8 time series. For 2015, the overall accuracies for the prefectures range between 71\% and 94\%. This national cropland classification map, which particularly succeeds in extracting high-precision rice products for the whole of Japan and other classes for different prefectures, can be treated as the base map of Japan for future studies related to agriculture, environment, and climate change. Authors: Junshi Xia (RIKEN); Naoto Yokoya (The University of Tokyo); Bruno Adriano (RIKEN Center for Advanced Intelligence Project (AIP)) |
Agriculture, forestry and other land use Classification, regression, and supervised learning |
NeurIPS 2021 |
FIgLib & SmokeyNet: Dataset and Deep Learning Model for Real-Time Wildland Fire Smoke Detection
(Papers Track)
Abstract and authors: (click to expand)Abstract: The size and frequency of wildland fires in the western United States have dramatically increased in recent years. On high fire-risk days, a small fire ignition can rapidly grow and get out of control. Early detection of fire ignitions from initial smoke can assist response to such fires before they become difficult to manage. Past deep learning approaches for wildfire smoke detection have suffered from small or unreliable datasets that make it difficult to extrapolate performance to real-world scenarios. In this work, we present the Fire Ignition Library (FIgLib), a publicly-available dataset of nearly 25,000 labeled wildfire smoke images as seen from fixed-view cameras deployed in Southern California. We also introduce SmokeyNet, a novel deep learning architecture using spatio-temporal information from camera imagery for real-time wildfire smoke detection. When trained on the FIgLib dataset, SmokeyNet outperforms comparable baselines. We hope that the availability of the FIgLib dataset and the SmokeyNet architecture will inspire further research into deep learning methods for wildfire smoke detection, leading to automated notification systems to reduce the time to wildfire response. Authors: Anshuman Dewangan (University of California, San Diego); Mai Nguyen (University of California, San Diego); Garrison Cottrell (UC San Diego) |
Disaster prediction, management, and relief Computer vision and remote sensing |
NeurIPS 2021 |
Multitask Learning for Estimating Power Plant Greenhouse Gas Emissions from Satellite Imagery
(Papers Track)
Abstract and authors: (click to expand)Abstract: The burning of fossil fuels produces large amounts of carbon dioxide (CO2), a major Greenhouse Gas (GHG) and a main driver of Climate Change. Quantifying GHG emissions is crucial for accurate predictions of climate effects and to enforce emission trading schemes. The reporting of such emissions is only required in some countries, resulting in insufficient global coverage. In this work, we propose an end-to-end method to predict power generation rates for fossil fuel power plants from satellite images based on which we estimate GHG emission rates. We present a multitask deep learning approach able to simultaneously predict: (i) the pixel-area covered by plumes from a single satellite image of a power plant, (ii) the type of fired fuel, and (iii) the power generation rate. We then convert the predicted power generation rate into estimates for the rate at which CO2 is being emitted. Experimental results show that our model approach allows us to estimate the power generation rate of a power plant to within 139 MW (MAE, for a mean sample power plant capacity of 1177 MW) from a single satellite image and CO2 emission rates to within 311 t/h. This multitask learning approach improves the power generation estimation MAE by 39 % compared to a single-task network trained on the same dataset. Authors: Joëlle Hanna (University of St. Gallen); Michael Mommert (University of St. Gallen); Linus M. Scheibenreif (University of St. Gallen); Damian Borth (University of St. Gallen) |
Computer vision and remote sensing Power and energy |
NeurIPS 2021 |
ClimART: A Benchmark Dataset for Emulating Atmospheric Radiative Transfer in Weather and Climate Models
(Papers Track)
Abstract and authors: (click to expand)Abstract: Numerical simulations of Earth's weather and climate require substantial amounts of computation. This has led to a growing interest in replacing subroutines that explicitly compute physical processes with approximate machine learning (ML) methods that are fast at inference time. Within weather and climate models, atmospheric radiative transfer (RT) calculations are especially expensive. This has made them a popular target for neural network-based emulators. However, prior work is hard to compare due to the lack of a comprehensive dataset and standardized best practices for ML benchmarking. To fill this gap, we introduce the \climart dataset, with more than \emph{10 million samples from present, pre-industrial, and future climate conditions}. ClimART poses several methodological challenges for the ML community, such as multiple out-of-distribution test sets, underlying domain physics, and a trade-off between accuracy and inference speed. We also present several novel baselines that indicate shortcomings of the datasets and network architectures used in prior work. Authors: Salva Rühling Cachay (Technical University of Darmstadt); Venkatesh Ramesh (MILA); Jason N. S. Cole (Environment and Climate Change Canada); Howard Barker (Environment and Climate Change Canada); David Rolnick (McGill University, Mila) |
Climate and Earth science Classification, regression, and supervised learning |
NeurIPS 2021 |
A GNN-RNN Approach for Harnessing Geospatial and Temporal Information: Application to Crop Yield Prediction
(Papers Track)
Best Paper: ML Innovation
Abstract and authors: (click to expand)Abstract: Climate change poses new challenges to agricultural production, as crop yields are extremely sensitive to climatic variation. Accurately predicting the effects of weather patterns on crop yield is crucial for addressing issues such as food insecurity, supply stability, and economic planning. Recently, there have been many attempts to use machine learning models for crop yield prediction. However, these models either restrict their tasks to a relatively small region or a short time-period (e.g. a few years), which makes them hard to generalize spatially and temporally. They also view each location as an i.i.d sample, ignoring spatial correlations in the data. In this paper, we introduce a novel graph-based recurrent neural network for crop yield prediction, which incorporates both geographical and temporal structure. Our method is trained, validated, and tested on over 2000 counties from 41 states in the US mainland, covering years from 1981 to 2019. As far as we know, this is the first machine learning method that embeds geographical knowledge in crop yield prediction and predicts crop yields at the county level nationwide. Experimental results show that our proposed method consistently outperforms a wide variety of existing state-of-the-art methods, validating the effectiveness of geospatial and temporal information. Authors: Joshua Fan (Cornell University); Junwen Bai (Cornell University); Zhiyun Li (Cornell University); Ariel Ortiz-Bobea (Cornell); Carla P Gomes (Cornell University) |
Agriculture, forestry and other land use Classification, regression, and supervised learning Time-series analysis |
NeurIPS 2021 |
Learned Benchmarks for Subseasonal Forecasting
(Papers Track)
Abstract and authors: (click to expand)Abstract: We develop a subseasonal forecasting toolkit of simple learned benchmark models that outperform both operational practice and state-of-the-art machine learning and deep learning methods. Our new models include (a) Climatology++, an adaptive alternative to climatology that, for precipitation, is 9% more accurate and 250\% more skillful than the United States operational Climate Forecasting System (CFSv2); (b) CFSv2++, a learned CFSv2 correction that improves temperature and precipitation accuracy by 7-8% and skill by 50-275%; and (c) Persistence++, an augmented persistence model that combines CFSv2 forecasts with lagged measurements to improve temperature and precipitation accuracy by 6-9% and skill by 40-130%. Across the contiguous U.S., these models consistently outperform standard meteorological baselines, state-of-the-art learning methods, and the European Centre for Medium-Range Weather Forecasts ensemble. Overall, we find that augmenting traditional forecasting approaches with learned enhancements yields an effective and computationally inexpensive strategy for building the next generation of subseasonal forecasting benchmarks. Authors: Soukayna Mouatadid (University of Toronto); Paulo Orenstein (IMPA); Genevieve E Flaspohler (MIT); Miruna Oprescu (Microsoft Research); Judah Cohen (AER); Franklyn Wang (Harvard University); Sean Knight (MIT); Maria Geogdzhayeva (MIT); Sam Levang (Salient Predictions Inc.); Ernest Fraenkel (MIT); Lester Mackey (Microsoft Research) |
Climate and Earth science Classification, regression, and supervised learning |
NeurIPS 2021 |
Emissions-aware electricity network expansion planning via implicit differentiation
(Papers Track)
Abstract and authors: (click to expand)Abstract: We consider a variant of the classical problem of designing or expanding an electricity network. Instead of minimizing only investment and production costs, however, we seek to minimize some mixture of cost and greenhouse gas emissions, even if the underlying dispatch model does not tax emissions. This enables grid planners to directly minimize consumption-based emissions, when expanding or modifying the grid, regardless of whether or not the carbon market incorporates a carbon tax. We solve this problem using gradient descent with implicit differentiation, a technique recently popularized in machine learning. To demonstrate the method, we optimize transmission and storage resources on the IEEE 14-bus test network and compare our solution to one generated by standard planning with a carbon tax. Our solution significantly reduces emissions for the same levelized cost of electricity. Authors: Anthony Degleris (Stanford University); Lucas Fuentes (Stanford); Abbas El Gamal (Stanford University); Ram Rajagopal (Stanford University) |
Power and energy Climate policy Other |
NeurIPS 2021 |
Amortized inference of Gaussian process hyperparameters for improved concrete strength trajectory prediction
(Papers Track)
Abstract and authors: (click to expand)Abstract: Designing and utilizing alternative concrete formulations which supplant the use of ordinary portland cement with alternative binders have been identified as central goals in reducing the greenhouse gas impact of the concrete industry. Given the variability in availability and quality of alternatives, these goals call for an optimal design of experiment approach to designing formulations, which can be adapted to local needs. The realization of this goal hinges on an ability to predict key properties. Here, we present and benchmark a Gaussian process (GP) model for predicting the trajectory of concrete strength, an essential performance measure. GPs are a desirable model class for the application because of their ability to estimate uncertainty and update predictions given additional data. In this work, rather than manually tuning hyperparameters for different concrete mix models, we propose a new method based on amortized inference leveraging mixture attributes, leading to models which are better fit for use in Bayesian optimization of concrete formulation. We demonstrate the success of the approach using a large, industrial concrete dataset. Authors: Kristen Severson (Microsoft Research); Olivia Pfeiffer (MIT); Jie Chen (IBM Research); Kai Gong (MIT); Jeremy Gregory (Massachusetts Institute of Technology); Richard Goodwin (IBM Research); Elsa Olivetti (Massachusetts Institute of Technology) |
Causal and Bayesian methods Industry |
NeurIPS 2021 |
A data integration pipeline towards reliable monitoring of phytoplankton and early detection of harmful algal blooms
(Papers Track)
Abstract and authors: (click to expand)Abstract: Climate change is making oceans warmer and more acidic. Under these conditions phytoplankton can produce harmful algal blooms which cause rapid oxygen depletion and consequent death of marine plants and animals. Some species are even capable of releasing toxic substances endangering water quality and human health. Monitoring of phytoplankton and early detection of harmful algal blooms is essential for protection of marine flaura and fauna. Recent technological advances have enabled in-situ plankton image capture in real-time at low cost. However, available phytoplankton image databases have several limitations that prevent the practical usage of artificial intelligent models. We present a pipeline for integration of heterogeneous phytoplankton image datasets from around the world into a unified database that can ultimately serve as a benchmark dataset for phytoplankton research and therefore act as an important tool in building versatile machine learning models for climate adaptation planning. A machine learning model for early detection of harmful algal blooms is part of ongoing work. Authors: Bruna Guterres (Universidade Federal do Rio Grande - FURG); Sara khalid (University of Oxford); Marcelo Pias (Federal University of Rio Grande); Silvia Botelho (Federal University of Rio Grande) |
Disaster prediction, management, and relief Ecosystems and natural systems Classification, regression, and supervised learning |
NeurIPS 2021 |
Identifying Distributional Differences in Convective Evolution Prior to Rapid Intensification in Tropical Cyclones
(Papers Track)
Abstract and authors: (click to expand)Abstract: Tropical cyclone (TC) intensity forecasts are issued by human forecasters who evaluate spatio-temporal observations (e.g., satellite imagery) and model output (e.g., numerical weather prediction, statistical models) to produce forecasts every 6 hours. Within these time constraints, it can be challenging to draw insight from such data. While high-capacity machine learning methods are well suited for prediction problems with complex sequence data, extracting interpretable scientific information with such methods is difficult. Here we leverage powerful AI prediction algorithms and classical statistical inference to identify patterns in the evolution of TC convective structure leading up to the rapid intensification of a storm, hence providing forecasters and scientists with key insight into TC behavior. Authors: Irwin H McNeely (Carnegie Mellon University); Galen Vincent (Carnegie Mellon University); Rafael Izbicki (UFSCar); Kimberly Wood (Mississippi State University); Ann B. Lee (Carnegie Mellon University) |
Disaster prediction, management, and relief Interpretable ML |
NeurIPS 2021 |
Predicting Atlantic Multidecadal Variability
(Papers Track)
Best Paper: Pathway to Impact
Abstract and authors: (click to expand)Abstract: Atlantic Multidecadal Variability (AMV) describes variations of North Atlantic sea surface temperature with a typical cycle of between 60 and 70 years. AMV strongly impacts local climate over North America and Europe, therefore prediction of AMV, especially the extreme values, is of great societal utility for understanding and responding to regional climate change. This work tests multiple machine learning models to improve the state of AMV prediction from maps of sea surface temperature, salinity, and sea level pressure in the North Atlantic region. We use data from the Community Earth System Model 1 Large Ensemble Project, a state-of-the-art climate model with 3,440 years of data. Our results demonstrate that all of the models we use outperform the traditional persistence forecast baseline. Predicting the AMV is important for identifying future extreme temperatures and precipitation as well as hurricane activity, in Europe and North America up to 25 years in advance. Authors: Glenn Liu (Massachusetts Institute of Technology); Peidong Wang (MIT); Matthew Beveridge (Massachusetts Institute of Technology); Young-Oh Kwon (Woods Hole Oceanographic Institution); Iddo Drori (MIT) |
Climate and Earth science Classification, regression, and supervised learning Other |
NeurIPS 2021 |
Identifying the atmospheric drivers of drought and heat using a smoothed deep learning approach
(Papers Track)
Abstract and authors: (click to expand)Abstract: Europe was hit by several, disastrous heat and drought events in recent summers. Besides thermodynamic influences, such hot and dry extremes are driven by certain atmospheric situations including anticyclonic conditions. Effects of climate change on atmospheric circulations are complex and many open research questions remain in this context, e.g., on future trends of anticyclonic conditions. Based on the combination of a catalog of labeled circulation patterns and spatial atmospheric variables, we propose a smoothed convolutional neural network classifier for six types of anticyclonic circulations that are associated with drought and heat. Our work can help to identify important drivers of hot and dry extremes in climate simulations, which allows to unveil the impact of climate change on these drivers. We address various challenges inherent to circulation pattern classification that are also present in other climate patterns, e.g., subjective labels and unambiguous transition periods. Authors: Magdalena Mittermeier (Ludwig-Maximilians-Universität München); Maximilian Weigert (Ludwig-Maximilians-Universität München); David Ruegamer (LMU Munich) |
Climate and Earth science Classification, regression, and supervised learning |
NeurIPS 2021 |
Learning to identify cracks on wind turbine blade surfaces using drone-based inspection images
(Papers Track)
Abstract and authors: (click to expand)Abstract: Wind energy is expected to be one of the leading ways to achieve the goals of the Paris Agreement but it in turn heavily depends on effective management of its operations and maintenance (O&M) costs. Blade failures account for one-third of all O&M costs thus making accurate detection of blade damages, especially cracks, very important for sustained operations and cost savings. Traditionally, damage inspection has been a completely manual process thus making it subjective, error-prone, and time-consuming. Hence in this work, we bring more objectivity, scalability, and repeatability in our damage inspection process, using deep learning, to miss fewer cracks. We build a deep learning model trained on a large dataset of blade damages, collected by our drone-based inspection, to correctly detect cracks. Our model is already in production and has processed more than a million damages with a recall of 0.96. We also focus on model interpretability using class activation maps to get a peek into the model workings. The model not only performs as good as human experts but also better in certain tricky cases. Thus, in this work, we aim to increase wind energy adoption by decreasing one of its major hurdles - the O&M costs resulting from missing blade failures like cracks. Authors: Akshay B Iyer (SkySpecs, Inc.); Linh V Nguyen (SkySpecs Inc); Shweta Khushu (SkySpecs Inc.) |
Power and energy Climate finance and economics Classification, regression, and supervised learning Computer vision and remote sensing |
NeurIPS 2021 |
Evaluating Pretraining Methods for Deep Learning on Geophysical Imaging Datasets
(Papers Track)
Abstract and authors: (click to expand)Abstract: Machine learning has the potential to automate the analysis of vast amounts of raw geophysical data, allowing scientists to monitor changes in key aspects of our climate such as cloud cover in real-time and at fine spatiotemporal scales. However, the lack of large labeled training datasets poses a significant barrier for effectively applying machine learning to these applications. Transfer learning, which involves first pretraining a neural network on an auxiliary “source” dataset and then finetuning on the “target” dataset, has been shown to improve accuracy for machine learning models trained on small datasets. Across prior work on machine learning for geophysical imaging, different choices are made about what data to pretrain on, and the impact of these choices on model performance is unclear. To address this, we systematically explore various settings of transfer learning for cloud classification, cloud segmentation, and aurora classification. We pretrain on different source datasets, including the large ImageNet dataset as well as smaller geophysical datasets that are more similar to the target datasets. We also experiment with multiple transfer learning steps where we pretrain on more than one source dataset. Despite the smaller source datasets’ similarity to the target datasets, we find that pretraining on the large, general-purpose ImageNet dataset yields significantly better results across all of our experiments. Transfer learning is especially effective for smaller target datasets, and in these cases, using multiple source datasets can give a marginal added benefit. Authors: James Chen (Kirby School) |
Computer vision and remote sensing Earth science and monitoring Meta- and transfer learning |
NeurIPS 2021 |
An Automated System for Detecting Visual Damages of Wind Turbine Blades
(Papers Track)
Abstract and authors: (click to expand)Abstract: Wind energy’s ability to compete with fossil fuels on a market level depends on lowering wind’s high operational costs. Since damages on wind turbine blades are the leading cause for these operational problems, identifying blade damages is critical. However, recent works in visual identification of blade damages are still experimental and focus on optimizing the traditional machine learning metrics such as IoU. In this paper, we argue that pushing models to production long before achieving the "optimal" model performance can still generate real value for this use case. We discuss the performance of Blue, our damage’s suggestion model in production and how this system works in coordination with humans as part of a commercialized product and how it can contribute towards lowering wind energy’s operational costs. Authors: Linh V Nguyen (SkySpecs Inc); Akshay B Iyer (SkySpecs, Inc.); Shweta Khushu (SkySpecs Inc.) |
Computer vision and remote sensing Climate finance and economics Classification, regression, and supervised learning |
NeurIPS 2021 |
Predicting Power System Dynamics and Transients: A Frequency Domain Approach
(Papers Track)
Abstract and authors: (click to expand)Abstract: With the ambition of reducing carbon emissions and mitigating climate change, many regions have set up the goal to generate electricity with close to 100% renewables. However, actual renewable generations are often curtailed by operators because it is too hard to check the dynamic stability of the electric grid under the high uncertainties introduced by the renewables. The dynamics of a power grid are governed by a large number of nonlinear ordinary differential equations (ODEs). To safely operate the system, operators need to check that the states described by this set of ODEs stay within prescribed limits after various potential faults. Limited by the size and stiffness of the ODEs, current numerical integration techniques are often too slow to be useful in real-time or large-scale resource allocation problems. In addition, detailed system parameters are often not exactly known. Machine learning approaches have been proposed to reduce the computational efforts, but existing methods generally suffer from overfitting and failures to predict unstable behaviors. This paper proposes a novel framework for power system dynamic predictions by learning in the frequency domain. The intuition is that although the system behavior is complex in the time domain, there are relatively few dominate modes in the frequency domain. Therefore, we learn to predict by constructing neural networks with Fourier transform and filtering layers. System topology and fault information are encoded by taking a multi-dimensional Fourier transform, allowing us to leverage the fact that the trajectories are sparse both in time and spatial (across different buses) frequencies. We show that the proposed approach does not need detailed system parameters, speeds up prediction computations by orders of magnitude and is highly accurate for different fault types. Authors: Wenqi Cui (University of Washington); Weiwei Yang (Microsoft Research); Baosen Zhang (University of Washington) |
Power and energy Classification, regression, and supervised learning |
NeurIPS 2021 |
HyperionSolarNet: Solar Panel Detection from Aerial Images
(Papers Track)
Abstract and authors: (click to expand)Abstract: With the effects of global climate change impacting the world, collective efforts are needed to reduce greenhouse gas emissions. The energy sector is the single largest contributor to climate change and many efforts are focused on reducing dependence on carbon-emitting power plants and moving to renewable energy sources, such as solar power. A comprehensive database of the location of solar panels is important to assist analysts and policymakers in defining strategies for further expansion of solar energy. In this paper we focus on creating a world map of solar panels. We identify locations and total surface area of solar panels within a given geographic area. We use deep learning methods for automated detection of solar panel locations and their surface area using aerial imagery. The framework, which consists of a two-branch model using an image classifier in tandem with a semantic segmentation model, is trained on our created dataset of satellite images. Our work provides an efficient and scalable method for detecting solar panels, achieving an accuracy of 0.96 for classification and an IoU score of 0.82 for segmentation performance. Authors: Poonam Parhar (UCBerkeley); Ryan Sawasaki (UCBerkeley); Alberto Todeschini (UC Berkeley); Colorado Reed (UC Berkeley); Hossein Vahabi (University California Berkeley); Nathan Nusaputra (UC Berkeley); Felipe Vergara (UC Berkeley) |
Power and energy Buildings and cities Climate policy Classification, regression, and supervised learning Computer vision and remote sensing |
NeurIPS 2021 |
Prediction of Household-level Heat-Consumption using PSO enhanced SVR Model
(Papers Track)
Abstract and authors: (click to expand)Abstract: In combating climate change, an effective demand-based energy supply operation of the district energy system (DES) for heating or cooling is indispensable. As a consequence, an accurate forecast of heat consumption on the consumer side poses an important first step towards an optimal energy supply. However, due to the non-linearity and non-stationarity of heat consumption data, the prediction of the thermal energy demand of DES remains challenging. In this work, we propose a forecasting framework for thermal energy consumption within a district heating system (DHS) based on kernel Support Vector Regression (kSVR) using real-world smart meter data. Particle Swarm Optimization (PSO) is employed to find the optimal hyper-parameter for the kSVR model which leads to the superiority of the proposed methods when compared to a state-of-the-art ARIMA model. The average MAPE is reduced to 2.07% and 2.64% for the individual meter-specific forecasting and for forecasting of societal consumption, respectively. Authors: Satyaki Chatterjee (Pattern Recognition Lab, FAU Erlangen-Nuremberg); Siming Bayer (Pattern Recognition Lab, Friedrich-Alexander University); Andreas K Maier (Pattern Recognition Lab, FAU Erlangen-Nuremberg) |
Classification, regression, and supervised learning Power and energy Other Time-series analysis |
NeurIPS 2021 |
EcoLight: Reward Shaping in Deep Reinforcement Learning for Ergonomic Traffic Signal Control
(Papers Track)
Abstract and authors: (click to expand)Abstract: Mobility, the environment, and human health are all harmed by sub-optimal control policies in transportation systems. Intersection traffic signal controllers are a crucial part of today's transportation infrastructure, as sub-optimal policies may lead to traffic jams and as a result increased levels of air pollution and wasted time. Many adaptive traffic signal controllers have been proposed in the literature, but research on their relative performance differences is limited. On the other hand, to the best of our knowledge there has been no work that directly targets CO2 emission reduction, even though pollution is currently a critical issue. In this paper, we propose a reward shaping scheme for various RL algorithms that not only produces lowers CO2 emissions, but also produces respectable outcomes in terms of other metrics such as travel time. We compare multiple RL algorithms --- sarsa, and A2C --- as well as diverse scenarios with a mix of different road users emitting varied amounts of pollution. Authors: Pedram Agand (Simon Fraser University); Alexey Iskrov (Breeze Labs Inc.); Mo Chen (Simon Fraser University) |
Transportation Power and energy Reinforcement learning and control |
NeurIPS 2021 |
Data Driven Study of Estuary Hypoxia
(Papers Track)
Abstract and authors: (click to expand)Abstract: This paper presents a data driven study of dissolved oxygen times series collected in Atlantic Canada. The main motivation of presented work was to evaluate if machine learning techniques could help to understand and anticipate hypoxic episodes in nutrient-impacted estuaries, a phenomenon that is exacerbated by increasing temperature expected to arise due to changes in climate. A major constraint was to limit ourselves to the use of dissolved oxygen time series only. Our preliminary findings shows that recurring neural networks and in particular LSTM may be suitable to predict short horizon levels while traditional results could benefit in longer range hypoxia prevention. Authors: Md Monwer Hussain (University of New-Brunswick); Guillaume Durand (National Research Council Canada); Michael Coffin (Department of Fisheries and Oceans Canada); Julio J Valdés (National Research Council Canada); Luke Poirier (Department of Fisheries and Oceans Canada) |
Ecosystems and natural systems Time-series analysis |
NeurIPS 2021 |
Decentralized Safe Reinforcement Learning for Voltage Control
(Papers Track)
Abstract and authors: (click to expand)Abstract: Inverter-based distributed energy resources provide the possibility for fast time-scale voltage control by quickly adjusting their reactive power. The power-electronic interfaces allow these resources to realize almost arbitrary control law, but designing these decentralized controllers is nontrivial. Reinforcement learning (RL) approaches are becoming increasingly popular to search for policy parameterized by neural networks. It is difficult, however, to enforce that the learned controllers are safe, in the sense that they may introduce instabilities into the system. This paper proposes a safe learning approach for voltage control. We prove that the system is guaranteed to be exponentially stable if each controller satisfies certain Lipschitz constraints. The set of Lipschitz bound is optimized to enlarge the search space for neural network controllers. We explicitly engineer the structure of neural network controllers such that they satisfy the Lipschitz constraints by design. A decentralized RL framework is constructed to train local neural network controller at each bus in a model-free setting. Authors: Wenqi Cui (University of Washington); Jiayi Li (University of Washington); Baosen Zhang (University of Washington) |
Power and energy Reinforcement learning and control |
NeurIPS 2021 |
NoFADE: Analyzing Diminishing Returns on CO2 Investment
(Papers Track)
Abstract and authors: (click to expand)Abstract: Climate change continues to be a pressing issue that currently affects society at-large. It is important that we as a society, including the Computer Vision (CV) community take steps to limit our impact on the environment. In this paper, we (a) analyze the effect of diminishing returns on CV methods, and (b) propose a \textit{``NoFADE''}: a novel entropy-based metric to quantify model--dataset--complexity relationships. We show that some CV tasks are reaching saturation, while others are almost fully saturated. In this light, NoFADE allows the CV community to compare models and datasets on a similar basis, establishing an agnostic platform. Authors: Andre Fu (University of Toronto); Justin B Tran (University of Toronto); Andy Xie (University of Toronto); Jonathan T Spraggett (University of Toronto); Elisa Ding (University of Toronto); Chang-Won Lee (University of Toronto); Kanav Singla (University of Toronto); Mahdi S. Hosseini (University of New Brunswick); Konstantinos N Plataniotis (UofT) |
Carbon capture and sequestration Computer vision and remote sensing |
NeurIPS 2021 |
High-resolution rainfall-runoff modeling using graph neural network
(Papers Track)
Abstract and authors: (click to expand)Abstract: Time-series modeling has shown great promise in recent studies using the latest deep learning algorithms such as LSTM (Long Short-Term Memory). These studies primarily focused on watershed-scale rainfall-runoff modeling or streamflow forecasting, but the majority of them only considered a single watershed as a unit. Although this simplification is very effective, it does not take into account spatial information, which could result in significant errors in large watersheds. Several studies investigated the use of GNN (Graph Neural Networks) for data integration by decomposing a large watershed into multiple sub-watersheds, but each sub-watershed is still treated as a whole, and the geoinformation contained within the watershed is not fully utilized. In this paper, we propose the GNRRM (Graph Neural Rainfall-Runoff Model), a novel deep learning model that makes full use of spatial information from high-resolution precipitation data, including flow direction and geographic information. When compared to baseline models, GNRRM has less over-fitting and significantly improves model performance. Our findings support the importance of hydrological data in deep learning-based rainfall-runoff modeling, and we encourage researchers to include more domain knowledge in their models. Authors: Zhongrun Xiang (University of Iowa); Ibrahim Demir (The University of Iowa) |
Time-series analysis Agriculture, forestry and other land use Climate and Earth science Disaster prediction, management, and relief Earth science and monitoring Ecosystems and natural systems Classification, regression, and supervised learning Data mining |
NeurIPS 2021 |
Machine Learning for Snow Stratigraphy Classification
(Papers Track)
Abstract and authors: (click to expand)Abstract: Snow-layer segmentation and classification is an essential diagnostic task for a wide variety of cryospheric science and climate research applications. To this end a Snow Micro Pen (SMP) can be used - a portable high-resolution snow penetrometer. However, the penetration-force measurements of the SMP must be labeled manually, which is a time-intensive task that requires training and becomes infeasible for large datasets. Here, we evaluate how well machine learning models can automatically segment and classify SMP profiles. Fourteen different models are trained on the MOSAiC SMP dataset, a unique and large SMP dataset of snow on Arctic sea-ice profiles. Depending on the user's task and needs, the long short-term memory neural network and the random forests are performing the best. The findings presented here facilitate and accelerate SMP data analysis and in consequence, help scientists to analyze the effects of climate change on the cryosphere more efficiently. Authors: Julia Kaltenborn (McGill University); Viviane Clay (Osnabrück University); Amy R. Macfarlane (WSL Institute for Snow and Avalanche Research SLF); Martin Schneebeli (WSL Institute for Snow and Avalanche Research SLF) |
Other Earth science and monitoring Classification, regression, and supervised learning Time-series analysis |
NeurIPS 2021 |
DEM Super-Resolution with EfficientNetV2
(Papers Track)
Abstract and authors: (click to expand)Abstract: Efficient climate change monitoring and modeling rely on high-quality geospatial and environmental datasets. Due to limitations in technical capabilities or resources, the acquisition of high-quality data for many environmental disciplines is costly. Digital Elevation Model (DEM) datasets are such examples whereas their low-resolution versions are widely available, high-resolution ones are scarce. In an effort to rectify this problem, we propose and assess an EfficientNetV2 based model. The proposed model increases the spatial resolution of DEMs up to 16 times without additional information. Authors: Bekir Z Demiray (University of Iowa); Muhammed A Sit (The University of Iowa); Ibrahim Demir (The University of Iowa) |
Computer vision and remote sensing Disaster prediction, management, and relief Earth science and monitoring Classification, regression, and supervised learning |
NeurIPS 2021 |
Learning to Dissipate Traffic Jams with Piecewise Constant Control
(Papers Track)
Abstract and authors: (click to expand)Abstract: Greenhouse gases (GHGs), particularly carbon dioxide, are a key contributor to climate change. The transportation sector makes up 35% of CO2 emissions in the US and more than 70% of it is due to land transport. Previous work shows that simple driving interventions have the ability to significantly improve traffic flow on the road. Recent work shows that 5% of vehicles using piecewise constant controllers, designed to be compatible to the reaction times of human drivers, can prevent the formation of stop-and-go traffic congestion on a single-lane circular track, thereby mitigating land transportation emissions. Our work extends these results to consider more extreme traffic settings, where traffic jams have already formed, and environments with limited cooperation. We show that even with the added realism of these challenges, piecewise constant controllers, trained using deep reinforcement learning, can essentially eliminate stop-and-go traffic when actions are held fixed for up to 5 seconds. Even up to 10-second action holds, such controllers show congestion benefits over a human driving baseline. These findings are a stepping-stone for near-term deployment of vehicle-based congestion mitigation. Authors: Mayuri Sridhar (MIT); Cathy Wu () |
Transportation Reinforcement learning and control |
NeurIPS 2021 |
Multi-objective Reinforcement Learning Controller for Multi-Generator Industrial Wave Energy Converter
(Papers Track)
Abstract and authors: (click to expand)Abstract: Waves are one of the greatest sources of renewable energy and are a promising resource to tackle climate challenges by decarbonizing energy generation. Lowering the Levelized Cost of Energy (LCOE) for wave energy converters is key to competitiveness with other forms of clean energy like wind and solar. Also, the complexity of control has gone up significantly with the state-of-the-art multi-generator multi-legged industrial Wave Energy Converters (WEC). This paper introduces a Multi-Agent Reinforcement Learning controller (MARL) architecture that can handle these multiple objectives for LCOE, helping the increase in energy capture efficiency, boosting revenue, reducing structural stress to limit maintenance and operating cost, and adaptively and proactively protect the wave energy converter from catastrophic weather events, preserving investments and lowering effective capital cost. We use a MARL implementing proximal policy optimization (PPO) with various optimizations to help sustain the training convergence in the complex hyperplane. The MARL is able to better control the reactive forces of the generators on multiple tethers (legs) of WEC than the commonly deployed spring damper controller. The design for trust is implemented to assure the operation of WEC within a safe zone of mechanical compliance and guarantee mechanical integrity. This is achieved through reward shaping for multiple objectives of energy capture and penalty for harmful motions to minimize stress and lower the cost of maintenance. We achieved double-digit gains in energy capture efficiency across the waves of different principal frequencies over the baseline Spring Damper controller with the proposed MARL controllers. Authors: Soumyendu Sarkar (Hewlett Packard Enterprise); Vineet Gundecha (Hewlett Packard Enterpise); Alexander Shmakov (UC Irvine); Sahand Ghorbanpour (Hewlett Packard Enterprise); Ashwin Ramesh Babu (Hewlett Packard Enterprise Labs); Paolo Faraboschi (HPE); mathieu Cocho (Carnegie Clean Energy); Alexandre Pichard (Carnegie Clean Energy); Jonathan Fievez (Carnegie Clean Energy) |
Power and energy Reinforcement learning and control |
NeurIPS 2021 |
Resolving Super Fine-Resolution SIF via Coarsely-Supervised U-Net Regression
(Papers Track)
Abstract and authors: (click to expand)Abstract: Climate change presents challenges to crop productivity, such as increasing the likelihood of heat stress and drought. Solar-Induced Chlorophyll Fluorescence (SIF) is a powerful way to monitor how crop productivity and photosynthesis are affected by changing climatic conditions. However, satellite SIF observations are only available at a coarse spatial resolution (e.g. 3-5km) in most places, making it difficult to determine how individual crop types or farms are doing. This poses a challenging coarsely-supervised regression task; at training time, we only have access to SIF labels at a coarse resolution (3 km), yet we want to predict SIF at a very fine spatial resolution (30 meters), a 100x increase. We do have some fine-resolution input features (such as Landsat reflectance) that are correlated with SIF, but the nature of the correlation is unknown. To address this, we propose Coarsely-Supervised Regression U-Net (CSR-U-Net), a novel approach to train a U-Net for this coarse supervision setting. CSR-U-Net takes in a fine-resolution input image, and outputs a SIF prediction for each pixel; the average of the pixel predictions is trained to equal the true coarse-resolution SIF for the entire image. Even though this is a very weak form of supervision, CSR-U-Net can still learn to predict accurately, due to its inherent localization abilities, plus additional enhancements that facilitate the incorporation of scientific prior knowledge. CSR-U-Net can resolve fine-grained variations in SIF more accurately than existing averaging-based approaches, which ignore fine-resolution spatial variation during training. CSR-U-Net could also be useful for a wide range of "downscaling'" problems in climate science, such as increasing the resolution of global climate models. Authors: Joshua Fan (Cornell University); Di Chen (Cornell University); Jiaming Wen (Cornell University); Ying Sun (Cornell University); Carla P Gomes (Cornell University) |
Agriculture, forestry and other land use Ecosystems and natural systems Computer vision and remote sensing Unsupervised and semi-supervised learning |
NeurIPS 2021 |
PreDisM: Pre-Disaster Modelling With CNN Ensembles for At-Risk Communities
(Papers Track)
Abstract and authors: (click to expand)Abstract: The machine learning community has recently had increased interest in the climate and disaster damage domain due to a marked increased occurrences of natural hazards (e.g., hurricanes, forest fires, floods, earthquakes). However, not enough attention has been devoted to mitigating probable destruction from impending natural hazards. We explore this crucial space by predicting building-level damages on a before-the-fact basis that would allow state actors and non-governmental organizations to be best equipped with resource distribution to minimize or preempt losses. We introduce PreDisM that employs an ensemble of ResNets and fully connected layers over decision trees to capture image-level and meta-level information to accurately estimate weakness of man-made structures to disaster-occurrences. Our model performs well and is responsive to tuning across types of disasters and highlights the space of preemptive hazard damage modelling. Authors: Vishal Anand (Columbia University); Yuki Miura (Columbia University) |
Disaster prediction, management, and relief Buildings and cities Computer vision and remote sensing Interpretable ML |
NeurIPS 2021 |
A hybrid convolutional neural network/active contour approach to segmenting dead trees in aerial imagery
(Papers Track)
Abstract and authors: (click to expand)Abstract: The stability and ability of an ecosystem to withstand climate change is directly linked to its biodiversity. Dead trees are a key indicator of overall forest health, housing one-third of forest ecosystem biodiversity, and constitute 8% of the global carbon stocks. They are decomposed by several natural factors, e.g. climate, insects and fungi. Accurate detection and modeling of dead wood mass is paramount to understanding forest ecology, the carbon cycle and decomposers. We present a novel method to construct precise shape contours of dead trees from aerial photographs by combining established convolutional neural networks with a novel active contour model in an energy minimization framework. Our approach yields superior performance accuracy over state-of-the-art in terms of precision, recall, and intersection over union of detected dead trees. This improved performance is essential to meet emerging challenges caused by climate change (and other man-made perturbations to the systems), particularly to monitor and estimate carbon stock decay rates, monitor forest health and biodiversity, and the overall effects of dead wood on and from climate change. Authors: Jacquelyn Shelton (Hong Kong Polytechnic University); Przemyslaw Polewski (TomTom Location Technology Germany GmbH); Wei Yao (The Hong Kong Polytechnic University); Marco Heurich (Bavarian Forest National Park) |
Computer vision and remote sensing Carbon capture and sequestration Ecosystems and natural systems Agriculture, forestry and other land use |
NeurIPS 2021 |
Semi-Supervised Classification and Segmentation on High Resolution Aerial Images
(Papers Track)
Abstract and authors: (click to expand)Abstract: FloodNet is a high-resolution image dataset acquired by a small UAV platform, DJI Mavic Pro quadcopters, after Hurricane Harvey. The dataset presents a unique challenge of advancing the damage assessment process for post-disaster scenarios using unlabeled and limited labeled dataset. We propose a solution to address their classification and semantic segmentation challenge. We approach this problem by generating pseudo labels for both classification and segmentation during training and slowly incrementing the amount by which the pseudo label loss affects the final loss. Using this semi-supervised method of training helped us improve our baseline supervised loss by a huge margin for classification, allowing the model to generalize and perform better on the validation and test splits of the dataset. In this paper, we compare and contrast the various methods and models for image classification and semantic segmentation on the FloodNet dataset. Authors: Sahil S Khose (Manipal Institute of Technology); Abhiraj Tiwari (Manipal Institute of Technology); Ankita Ghosh (Manipal Institute of Technology) |
Disaster prediction, management, and relief Classification, regression, and supervised learning Meta- and transfer learning Unsupervised and semi-supervised learning |
NeurIPS 2021 |
Reducing the Barriers of Acquiring Ground-truth from Biodiversity Rich Audio Datasets Using Intelligent Sampling Techniques
(Papers Track)
Abstract and authors: (click to expand)Abstract: The potential of passive acoustic monitoring (PAM) as a method to reveal the consequences of climate change on the biodiversity that make up natural soundscapes can be undermined by the discrepancy between the low barrier of entry to acquire large field audio datasets and the higher barrier of acquiring reliable species level training, validation, and test subsets from the field audio. These subsets from a deployment are often required to verify any machine learning models used to assist researchers in understanding the local biodiversity. Especially as many models convey promising results from various sources that may not translate to the collected field audio. Labeling such datasets is a resource intensive process due to the lack of experts capable of identifying bioacoustics at a species level as well as the overwhelming size of many PAM audiosets. To address this challenge, we have tested different sampling techniques on an audio dataset collected over a two-week long August audio array deployment on the Scripps Coastal Reserve (SCR) Biodiversity Trail in La Jolla, California. These sampling techniques involve creating four subsets using stratified random sampling, limiting samples to the daily bird vocalization peaks, and using a hybrid convolutional neural network (CNN) and recurrent neural network (RNN) trained for bird presence/absence audio classification. We found that a stratified random sample baseline only achieved a bird presence rate of 44% in contrast with a sample that randomly selected clips with high hybrid CNN-RNN predictions that were collected during bird activity peaks at dawn and dusk yielding a bird presence rate of 95%. The significantly higher bird presence rate demonstrates how intelligent, machine learning-assisted selection of audio data can significantly reduce the amount of time that domain experts listen to audio without vocalizations of interest while building a ground truth for machine learning models. Authors: Jacob G Ayers (UC San Diego); Sean Perry (UC San Diego); Vaibhav Tiwari (UC San Diego); Mugen Blue (Cal Poly San Luis Obispo); Nishant Balaji (UC San Diego); Curt Schurgers (UC San Diego); Ryan Kastner (University of California San Diego); Mathias Tobler (San Diego Zoo Wildlife Alliance); Ian Ingram (San Diego Zoo Wildlife Alliance) |
Ecosystems and natural systems Computer vision and remote sensing |
NeurIPS 2021 |
Two-phase training mitigates class imbalance for camera trap image classification with CNNs
(Papers Track)
Abstract and authors: (click to expand)Abstract: By leveraging deep learning to automatically classify camera trap images, ecologists can monitor biodiversity conservation efforts and the effects of climate change on ecosystems more efficiently. Due to the imbalanced class-distribution of camera trap datasets, current models are biased towards the majority classes. As a result, they obtain good performance for a few majority classes but poor performance for many minority classes. We used two-phase training to increase the performance for these minority classes. We trained, next to a baseline model, four models that implemented a different versions of two-phase training on a subset of the highly imbalanced Snapshot Serengeti dataset. Our results suggest that two-phase training can improve performance for many minority classes, with limited loss in performance for the other classes. We find that two-phase training based on majority undersampling increases class-specific F1-scores up to 3.0%. We also find that two-phase training outperforms using only oversampling or undersampling by 6.1% in F1-score on average. Finally, we find that a combination of over- and undersampling leads to a better performance than using them individually. Authors: Farjad Malik (KU Leuven); Simon Wouters (KU Leuven); Ruben Cartuyvels (KULeuven); Erfan Ghadery (KU Leuven); Sien Moens (KU Leuven) |
Ecosystems and natural systems Computer vision and remote sensing |
NeurIPS 2021 |
Capturing Electricity Market Dynamics in the Optimal Trading of Strategic Agents using Neural Network Constrained Optimization
(Papers Track)
Abstract and authors: (click to expand)Abstract: In competitive electricity markets the optimal trading problem of an electricity market agent is commonly formulated as a bi-level program, and solved as mathematical program with equilibrium constraints (MPEC). In this paper, an alternative paradigm, labeled as mathematical program with neural network constraint (MPNNC), is developed to incorporate complex market dynamics in the optimal bidding strategy. This method uses input-convex neural networks (ICNNs) to represent the mapping between the upper-level (agent) decisions and the lower-level (market) outcomes, i.e., to replace the lower-level problem by a neural network. In a comparative analysis, the optimal bidding problem of a load agent is formulated via the proposed MPNNC and via the classical bi-level programming method, and compared against each other. Authors: Mihály Dolányi (KU Leuven); Kenneth Bruninx (KU Leuven); Jean-François Toubeau (Faculté Polytechnique (FPMs), Université de Mons (UMONS)); Erik Delaue (KU Leuven) |
Hybrid physical models Climate finance and economics Recommender systems |
NeurIPS 2021 |
Leveraging Machine Learning to Predict the Autoconversion Rates from Satellite Data
(Papers Track)
Abstract and authors: (click to expand)Abstract: One way of reducing the uncertainty involved in determining the radiative forcing of climate change is by understanding the interaction between aerosols, clouds, and precipitation processes. This can be studied using high-resolution simulations such as the ICOsahedral Non-hydrostatic Large-Eddy Model (ICON-LEM). However, due to the extremely high computational cost required, this simulation-based approach can only be run for a limited amount of time within a limited area. To address this, we developed new models using emerging machine learning approaches that leverage a plethora of satellite observations providing long-term global spatial coverage. In particular, our machine learning models are capable of capturing the key process of precipitation formation which greatly control cloud lifetime, namely autoconversion rates -- the term used to describe the collision and coalescence of cloud droplets responsible for raindrop formation. We validate the performance of our models against simulation data, showing that our models are capable of predicting the autoconversion rates fairly well. Authors: Maria C Novitasari (University College London); Johannes Quaas (University of Leipzig); Miguel Rodrigues (University College London) |
Classification, regression, and supervised learning Climate and Earth science Computer vision and remote sensing |
NeurIPS 2021 |
Towards Automatic Transformer-based Cloud Classification and Segmentation
(Papers Track)
Abstract and authors: (click to expand)Abstract: Clouds have been demonstrated to have a huge impact on the energy balance, temperature, and weather of the Earth. Classification and segmentation of clouds and coverage factors is crucial for climate modelling, meteorological studies, solar energy industry, and satellite communication. For example, clouds have a tremendous impact on short-term predictions or 'nowcasts' of solar irradiance and can be used to optimize solar power plants and effectively exploit solar energy. However even today, cloud observation requires the intervention of highly-trained professionals to document their findings, which introduces bias. To overcome these issues and contribute to climate change technology, we propose, to the best of our knowledge, the first two transformer-based models applied to cloud data tasks. We use the CCSD Cloud classification dataset and achieve 90.06% accuracy, outperforming all other methods. To demonstrate the robustness of transformers in this domain, we perform Cloud segmentation on SWIMSWG dataset and achieve 83.2% IoU, also outperforming other methods. With this, we signal a potential shift away from pure CNN networks. Authors: Roshan Roy (Birla Institute of Technology and Science, Pilani); Ahan M R (BITS Pilani); Vaibhav Soni (MANIT Bhopal); Ashish Chittora (BITS Pilani) |
Computer vision and remote sensing Climate and Earth science Earth science and monitoring Classification, regression, and supervised learning Data mining |
NeurIPS 2021 |
Scalable coastal inundation mapping using machine learning
(Papers Track)
Abstract and authors: (click to expand)Abstract: Coastal flooding is a significant climate hazard with impacts across economic sectors and society. This study provides a proof of concept for data-driven models for coastal flood inundation at the country scale, incorporating storm dynamics and geospatial characteristics to improve upon simpler geomorphological models. The best fit machine learning model scores an AUC of 0.92 in predicting flooded locations. For a case study storm event in December 2013 we find that all models over-predict flood extents, but that the machine learning model extents were closest to those observed. Authors: Ophelie Meuriot (IBM Research Europe); Anne Jones (IBM Research) |
Classification, regression, and supervised learning Climate and Earth science Other |
NeurIPS 2021 |
Machine Learning in Automating Carbon Sequestration Site Assessment
(Proposals Track)
Abstract and authors: (click to expand)Abstract: Carbon capture and sequestration are viewed as an indispensable component to achieve the Paris Agreement climate goal, i.e., keep the global warming within 2 degrees Celsius from pre-industrial levels. Once captured, most CO2 needs to be stored securely for at least decades, preferably in deep underground geological formations. It is economical to inject and store CO2 near/around a depleted gas/oil reservoir or well, where a geological trap for CO2 with good sealing properties and some minimum infrastructure exist. In this proposal, with our preliminary work, it is shown that Machine Learning tools like Optical Character Recognition and Natural Language Processing can aid in screening and selection of injection sites for CO2 storage, facilitate identification of possible CO2 leakage paths in the subsurface, and assist in locating a depleted gas/oil well suitable for CO2 injection and long-term storage. The automated process based on ML tools can also drastically decrease the decision-making cycle time in site selection and assessment phase by reducing human effort. In the longer term, we expect ML tools like Deep Neural Networks to be utilized in CO2 storage monitoring, injection optimization etc. By injecting CO2 into a trapping geological underground formation in a safe and sustainable manner, the Energy industry can contribute substantially to reducing global warming and achieving the goals of the Paris Agreement by the end of this century. Authors: Jay Chen (Shell); Ligang Lu (Shell); Mohamed Sidahmed (Shell); Taixu Bai (Shell); Ilyana Folmar (Shell); Puneet Seth (Shell); Manoj Sarfare (Shell); Duane Mikulencak (Shell); Ihab Akil (Shell) |
Carbon capture and sequestration Climate finance and economics Active learning Computer vision and remote sensing |
NeurIPS 2021 |
A Risk Model for Predicting Powerline-induced Wildfires in Distribution System
(Proposals Track)
Abstract and authors: (click to expand)Abstract: The power grid is one of the most common causes of wildfires that result in tremendous economic loss and significant life risk. In this study, we propose to use machine learning techniques to build a risk model for predicting powerline-induced wildfires in distribution system. We collect weather, vegetation, and infrastructure data for all feeders in Pacific Gas & Electricity territory. This study will contribute to a deeper understanding of powerline-induced wildfire prediction and provide valuable suggestions for wildfire mitigation planning. Authors: Mengqi Yao (University of California Berkeley) |
Disaster prediction, management, and relief Power and energy Classification, regression, and supervised learning Data mining |
NeurIPS 2021 |
Detecting Abandoned Oil And Gas Wells Using Machine Learning And Semantic Segmentation
(Proposals Track)
Abstract and authors: (click to expand)Abstract: Around the world, there are millions of unplugged abandoned oil and gas wells, leaking methane into the atmosphere. The locations of many of these wells, as well as their greenhouse gas emissions impacts, are unknown. Machine learning methods in computer vision and remote sensing, such as semantic segmentation, have made it possible to quickly analyze large amounts of satellite imagery to detect salient information. This project aims to automatically identify undocumented oil and gas wells in the province of Alberta, Canada to aid in documentation, estimation of emissions and maintenance of high-emitting wells. Authors: Michelle Lin (McGill University); David Rolnick (McGill University, Mila) |
Power and energy Classification, regression, and supervised learning Computer vision and remote sensing |
NeurIPS 2021 |
Machine learning-enabled model-data integration for predicting subsurface water storage
(Proposals Track)
Abstract and authors: (click to expand)Abstract: Subsurface water storage (SWS) is a key variable of the climate system and a storage component for precipitation and radiation anomalies, inducing persistence in the climate system. It plays a critical role in climate-change projections and can mitigate the impacts of climate change on ecosystems. However, because of the difficult accessibility of the underground, hydrologic properties and dynamics of SWS are poorly known. Direct observations of SWS are limited, and accurate incorporation of SWS dynamics into Earth system land models remains challenging. We propose a machine learning-enabled model-data integration framework to improve the SWS prediction at local to conus scales in a changing climate by leveraging all the available observation and simulation resources, as well as to inform the model development and guide the observation collection. The accurate prediction will enable an optimal decision of water management and land use and improve the ecosystem's resilience to the climate change. Authors: Dan Lu (Oak Ridge National Laboratory); Eric Pierce (Oak Ridge National Laboratory); Shih-Chieh Kao (Oak Ridge National Laboratory); David Womble (Oak Ridge National Laboratory); LI LI (Pennsylvania State University); Daniella Rempe (The University of Texas at Austin) |
Ecosystems and natural systems Earth science and monitoring |
NeurIPS 2021 |
Hybrid physics-based and data-driven modeling with calibrated uncertainty for lithium-ion battery degradation diagnosis and prognosis
(Proposals Track)
Abstract and authors: (click to expand)Abstract: Advancing lithium-ion batteries (LIBs) in both design and usage is key to promoting electrification in the coming decades to mitigate human-caused climate change. Inadequate understanding of LIB degradation is an important bottleneck that limits battery durability and safety. Here, we propose hybrid physics-based and data-driven modeling for online diagnosis and prognosis of battery degradation. Compared to existing battery modeling efforts, we aim to build a model with physics as its backbone and statistical learning techniques as enhancements. Such a hybrid model has better generalizability and interpretability together with a well-calibrated uncertainty associated with its prediction, rendering it more valuable and relevant to safety-critical applications under realistic usage scenarios. Authors: Jing Lin (Institute for Infocomm Research); Yu Zhang (I2R); Edwin Khoo (Institute for Infocomm Research) |
Hybrid physical models Power and energy Causal and Bayesian methods Classification, regression, and supervised learning Time-series analysis Uncertainty quantification and robustness |
NeurIPS 2021 |
On the use of Deep Generative Models for "Perfect" Prognosis Climate Downscaling
(Proposals Track)
Abstract and authors: (click to expand)Abstract: Deep Learning has recently emerged as a "perfect" prognosis downscaling technique to compute high-resolution fields from large-scale coarse atmospheric data. Despite their promising results to reproduce the observed local variability, they are based on the estimation of independent distributions at each location, which leads to deficient spatial structures, especially when downscaling precipitation. This study proposes the use of generative models to improve the spatial consistency of the high-resolution fields, very demanded by some sectoral applications (e.g., hydrology) to tackle climate change. Authors: Jose González-Abad (Institute of Physics of Cantabria); Jorge Baño-Medina (Institute of Physics of Cantabria); Ignacio Heredia (Institute of Physics of Cantabria) |
Climate and Earth science Classification, regression, and supervised learning Generative modeling |
NeurIPS 2021 |
A Deep Learning application towards transparent communication for Payment for Forest Environmental Services (PES)
(Proposals Track)
Abstract and authors: (click to expand)Abstract: Deforestation accounts for more than 20% of global emission. Payments for Environmental Services (PES) is seen by both policy makers and practitioners as an effective market-based instrument to provide financial incentives for forest owners, particularly poor and indigenous households in developing countries. It is a critical instrument to protect forests, and ultimately to mitigate climate change and reduce emission from deforestation. However, previous studies have pointed out a key challenge for PES is to ensure transparent payment to local people, due to i) weak monitoring and evaluation and ii) indigenous inaccessibility to e-banking and complying with procedural and administrative paper works to receive payments. Specifically, the amount and the complexity of forms along with the language barriers is a key issue; and most transactions need several intermediaries and transaction costs which reduce the payments reaching landowners. To address these issues, we propose a communication platform that links across the stakeholders and processes. Our proposal will utilize Machine Learning techniques to lower the language barrier and provide technology solutions to help indigenous people to access payments. This would also help improve the effectiveness and transparency of PES schemes. Specifically, we propose the use of Natural Language Processing techniques in providing a speech-to-text and auto translation capability, and the use of Graph Neural Network to provide link predictions of transaction types, volumes and values. The pathway to impact will be forest protection and local livelihood through providing financial incentives, and subsequently contribution to more carbon sequestration and storage – a key issue in climate change mitigation. Authors: Lan HOANG (IBM Research); Thuy Thu Phan (Center for International Forestry Research (CIFOR)) |
Agriculture, forestry and other land use Natural language processing |
NeurIPS 2021 |
A NLP-based Analysis of Alignment of Organizations' Climate-Related Risk Disclosures with Material Risks and Metrics
(Proposals Track)
Abstract and authors: (click to expand)Abstract: The Sustainability Accounting Standards Board (SASB) establishes standards to guide the disclosures of material sustainability and ESG (Environment, Social, Governance)-related information across industries. The availability of quality, comparable and decision-useful information is required to assess risks and opportunities later integrated into financial decision-making. Particularly, standardized, industry-specific climate risk metrics and topics can support these efforts. SASB’s latest climate risk technical bulletin introduces three climate-related risks that are financially material - physical, transition and regulatory risks - and maps these across industries. The main objective of this work is to create a framework that can analyze climate related risk disclosures using an AI-based tool that automatically extracts and categorizes climate-related risks and related metrics from company disclosures based on SASB’s latest climate risk guidance. This process will help with automating large-scale analysis and add much-needed transparency vis-a-vis the current state of climate-related disclosures, while also assessing how far along companies are currently disclosing information on climate risks relevant to their industry. As it stands, this much needed type of analysis is made mostly manually or using third-party metrics, often opaque and biased, as proxies. In this work, we will first create a climate risk glossary that will be trained on a large amount of climate risk text. By combining climate risk keywords in this glossary with recent advances in natural language processing (NLP), we will then be able to quantitatively and qualitatively compare climate risk information in different sectors and industries using a novel climate risk score that will be based on SASB standards. Authors: Elham Kheradmand (University of Montreal); Didier Serre (Clearsum); Manuel Morales (University of Montreal); Cedric B Robert (Clearsum) |
Climate finance and economics Classification, regression, and supervised learning Natural language processing |
NeurIPS 2021 |
Unsupervised Machine Learning framework for sensor placement optimization: analyzing methane leaks
(Proposals Track)
Abstract and authors: (click to expand)Abstract: Methane is one of the most potent greenhouse gases, with the global oil and gas industry being the second largest source of anthropogenic methane emissions, accounting for about 63% of the whole energy sector. This underscores the importance of detecting and remediating methane leaks for the entire oil and gas value chain. Methane sensor networks are a promising technology to detect methane leaks in a timely manner. While they provide near-real-time monitoring of an area of interest, the density of the network can be cost prohibitive, and the identification of the source of the leak is not apparent, especially where there could be more than one source. To address these issues, we developed a machine learning framework that leverages various data sources including oil and gas facilities data, historical methane leak rate distribution and meteorological data, to optimize sensor placement. The determination of sensor locations follows the objective to maximize the detection of possible methane leaks with a limited sensor budget. Authors: Shirui Wang (University of Houston); Sara Malvar (Microsoft); Leonardo Nunes (Microsoft); Kim Whitehall (Microsoft); YAGNA DEEPIKA ORUGANTI (MICROSOFT); Yazeed Alaudah (Microsoft); Anirudh Badam (Microsoft) |
Earth science and monitoring Other Unsupervised and semi-supervised learning |
NeurIPS 2021 |
Multi-agent reinforcement learning for renewable integration in the electric power grid
(Proposals Track)
Abstract and authors: (click to expand)Abstract: As part of the fight against climate change, the electric power system is transitioning from fuel-burning generators to renewable sources of power like wind and solar. To allow for the grid to rely heavily on renewables, important operational changes must be done. For example, novel approaches for frequency regulation, i.e., for balancing in real-time demand and generation, are required to ensure the stability of a renewable electric system. Demand response programs in which loads adjust in part their power consumption for the grid's benefit, can be used to provide frequency regulation. In this proposal, we present and motivate a collaborative multi-agent reinforcement learning approach to meet the algorithmic requirements for providing real-time power balancing with demand response. Authors: Vincent Mai (Mila, Université de Montréal); Tianyu Zhang (Mila, Université de Montréal); Antoine Lesage-Landry (Polytechnique Montréal & GERAD) |
Power and energy Reinforcement learning and control |
NeurIPS 2021 |
Machine Learning Speeding Up the Development of Portfolio of New Crop Varieties to Adapt to and Mitigate Climate Change
(Proposals Track)
Abstract and authors: (click to expand)Abstract: Climate change poses serious challenges to achieving food security in a time of a need to produce more food to keep up with the world’s increasing demand for food. There is an urgent need to speed up the development of new high yielding varieties with traits of adaptation and mitigation to climate change. Mathematical approaches, including ML approaches, have been used to search for such traits, leading to unprecedented results as some of the traits, including heat traits that have been long sought-for, have been found within a short period of time. Authors: Abdallah Bari (OperAI Canada - Operational AI); Hassan Ouabbou (INRA); Abderrazek Jilal (INRA); Frederick Stoddard (University of Helsinki); Mikko Sillanpää (University of Oulu); Hamid Khazaei (World Vegetable Center) |
Agriculture, forestry and other land use Carbon capture and sequestration Causal and Bayesian methods Classification, regression, and supervised learning Time-series analysis Unsupervised and semi-supervised learning |
NeurIPS 2021 |
Toward Foundation Models for Earth Monitoring: Proposal for a Climate Change Benchmark
(Proposals Track)
Abstract and authors: (click to expand)Abstract: Recent progress in self-supervision shows that pre-training large neural networks on vast amounts of unsupervised data can lead to impressive increases in generalisation for downstream tasks. Such models, recently coined as foundation models, have been transformational to the field of natural language processing. While similar models have also been trained on large corpuses of images, they are not well suited for remote sensing data. To stimulate the development of foundation models for Earth monitoring, we propose to develop a new benchmark comprised of a variety of downstream tasks related to climate change. We believe that this can lead to substantial improvements in many existing applications and facilitate the development of new applications. This proposal is also a call for collaboration with the aim of developing a better evaluation process to mitigate potential downsides of foundation models for Earth monitoring. Authors: Alexandre Lacoste (ServiceNow); Evan D Sherwin (Stanford University, Energy and Resources Engineering); Hannah R Kerner (University of Maryland); Hamed Alemohammad (Radiant Earth Foundation); Björn Lütjens (MIT); Jeremy A Irvin (Stanford); David Dao (ETH Zurich); Alex Chang (Service Now); Mehmet Gunturkun (Element Ai); Alexandre Drouin (ServiceNow); Pau Rodriguez (Element AI); David Vazquez (ServiceNow) |
Computer vision and remote sensing Agriculture, forestry and other land use Climate and Earth science Disaster prediction, management, and relief Ecosystems and natural systems Meta- and transfer learning |
NeurIPS 2021 |
Optimization of Agricultural Management for Soil Carbon Sequestration based on Deep Reinforcement Learning and Large-Scale Simulations
(Proposals Track)
Abstract and authors: (click to expand)Abstract: Soil carbon sequestration in croplands has tremendous potential to help mitigate climate change; however, it is challenging to develop the optimal management practices for maximization of the sequestered carbon as well as the crop yield. This project aims to develop an intelligent agricultural management system using deep reinforcement learning (RL) and large-scale soil and crop simulations. To achieve this, we propose to build a simulator to model and simulate the complex soil-water-plant-atmosphere interaction. By formulating the management decision as an RL problem, we can leverage the state-of-the-art algorithms to train management policies through extensive interactions with the simulated environment. The trained policies are expected to maximize the stored organic carbon while maximizing the crop yield in the presence of uncertain weather conditions. The whole system will be tested using data of soil and crops in both mid-west of the United States and the central region of Portugal. The proposed research will impact food security and climate change, two of the most significant challenges currently facing humanity. Authors: Jing Wu (University of Illinois Urbana-Champaign); Pan Zhao (University of Illinois Urbana-Champaign); Ran Tao (University of Illinois Urbana-Champaign); Naira Hovakimyan (UIUC); Guillermo Marcillo (University of Illinois at Urbana-Champaign); Nicolas Martin (University of Illinois at Urbana-Champaign); Carla Ferreira (Royal Institute of Technology); Zahra Kalantari (Royal Institute of Technology); Jennifer Hobbs (IntelinAir Inc.) |
Reinforcement learning and control Carbon capture and sequestration |
NeurIPS 2021 |
Leveraging machine learning for identify hydrological extreme events under global climate change
(Proposals Track)
Abstract and authors: (click to expand)Abstract: Hydrological extreme events, such as droughts and floods, are highly destructive natural disasters and its occurrence is expected to increase under the future climate change. Accurate and efficient approach to detect such events will provide timely information to assist management strategies for minimizing socio-economic damages. Despite the threshold approach has established to detect extreme events, the missing data from hydroclimate data and accurately identifying these events are still major challenges. The advent of machine learning models can help to identify the occurrence of droughts and floods events accurately and efficiently. Therefore, this proposed study will develop a machine learning model with semi-supervised anomaly detection approach to identify hydrological extreme events with ground-based data. As a test case, we will use 45-years record of hydroclimate data in coastal California, where was the driest region in 2012-2015, following with flash floods events. The expected results will increase communities’ awareness for hydrological extreme events and enable environmental planning and resource management under climate change Authors: Ying-Jung C Deweese (Georgia Insititute of Technology) |
Earth science and monitoring Classification, regression, and supervised learning |
NeurIPS 2021 |
Predicting Cascading Failures in Power Systems using Graph Convolutional Networks
(Proposals Track)
Abstract and authors: (click to expand)Abstract: Worldwide targets are set for the increase of renewable power generation in electricity networks on the way to combat climate change. Consequently, a secure power system that can handle the complexities resulted from the increased renewable power integration is crucial. One particular complexity is the possibility of cascading failures — a quick succession of multiple component failures that takes down the system and might also lead to a blackout. Viewing the prediction of cascading failures as a binary classification task, we explore the efficacy of Graph Convolution Networks (GCNs), to detect the early onset of a cascading failure. We perform experiments based on simulated data from a benchmark IEEE test system. Our preliminary findings show that GCNs achieve higher accuracy scores than other baselines which bodes well for detecting cascading failures. It also motivates a more comprehensive study of graph-based deep learning techniques for the current problem. Authors: Tabia Ahmad (University of Strathclyde); Yongli Zhu (Texas A&M Universersity); Panagiotis Papadopoulos (University of Strathclyde) |
Power and energy Carbon capture and sequestration Classification, regression, and supervised learning Data mining Unsupervised and semi-supervised learning |
NeurIPS 2021 |
DeepQuake: Artificial Intelligence for Earthquake Forecasting Using Fine-Grained Climate Data
(Proposals Track)
Best Paper: Proposals
Abstract and authors: (click to expand)Abstract: Earthquakes are one of the most catastrophic natural disasters, making accurate, fine-grained, and real-time earthquake forecasting extremely important for the safety and security of human lives. In this work, we propose DeepQuake, a hybrid physics and deep learning model for fine-grained earthquake forecasting using time-series data of the horizontal displacement of earth’s surface measured from continuously operating Global Positioning System (cGPS) data. Recent studies using cGPS data have established a link between transient deformation within earth's crust to climate variables. DeepQuake’s physics-based pre-processing algorithm extracts relevant features including the x, y, and xy components of strain in earth’s crust, capturing earth’s elastic response to these climate variables, and feeds it into a deep learning neural network to predict key earthquake variables such as the time, location, magnitude, and depth of a future earthquake. Results across California show promising correlations between cGPS derived strain patterns and the earthquake catalog ground truth for a given location and time. Authors: Yash Narayan (The Nueva School) |
Time-series analysis Climate and Earth science Earth science and monitoring Other |
NeurIPS 2021 |
A day in a sustainable life
(Tutorials Track)
Abstract and authors: (click to expand)Abstract: In this notebook, we show the reader how to use an electrical battery to minimize the operational carbon intensity of a building. The central idea is to charge the battery when the carbon intensity of the grid energy mix is low, and vice versa. The same methodology is used in practice to optimise for a number of different objective functions, including energy costs. Taking the hypothetical case of Pi, an eco-conscious and tech-savvy householder in the UK, we walk the reader through getting carbon intensity data, and how to use this with a number of different optimisation algorithms to decarbonise. Starting off with easy-to-understand, brute force search, we establish a baseline for subsequent (hopefully smarter) optimization algorithms. This should come naturally, since in their day job Pi is a data scientist where they often use grid and random search to tune hyperparameters of ML models. The second optimization algorithm we explore is a genetic algorithm, which belongs to the class of derivative free optimizers and is consequently extremely versatile. However, the flexibility of these algorithms comes at the cost of computational speed and effort. In many situations, it makes sense to utilize an optimization method which can make use of the special structure in the problem. As the final step, we see how Pi can optimally solve the problem of minimizing their carbon intensity by formulating it as a linear program. Along the way, we also keep an eye out for some of the most important challenges that arise in practice. Authors: Hussain Kazmi (KU Leuven); Attila Balint (KU Leuven); Jolien Despeghel (KU Leuven) |
Buildings and cities Power and energy Classification, regression, and supervised learning Other Time-series analysis |
NeurIPS 2021 |
Open Catalyst Project: An Introduction to ML applied to Molecular Simulations
(Tutorials Track)
Abstract and authors: (click to expand)Abstract: As the world continues to battle energy scarcity and climate change, the future of our energy infrastructure is a growing challenge. Renewable energy technologies offer the opportunity to drive efficient carbon-neutral means for energy storage and generation. Doing so, however, requires the discovery of efficient and economic catalysts (materials) to accelerate associated chemical processes. A common approach in discovering high performance catalysts is using molecular simulations. Specifically, each simulation models the interaction of a catalyst surface with molecules that are commonly seen in electrochemical reactions. By predicting these interactions accurately, the catalyst's impact on the overall rate of a chemical reaction may be estimated. The Open Catalyst Project (OCP) aims to develop new ML methods and models to accelerate the catalyst simulation process for renewable energy technologies and improve our ability to predict properties across catalyst composition. The initial release of the Open Catalyst 2020 (OC20) dataset presented the largest open dataset of molecular combinations, spanning 55 unique elements and over 130M+ data points. We will present a comprehensive tutorial of the Open Catalyst Project repository, including (1) Accessing & visualizing the dataset, (2) Overview of the various tasks, (3) Training graph neural network (GNN) models, (4) Developing your own model for OCP, (5) Running ML-driven simulations, and (6) Visualizing the results. Primary tools include PyTorch and PyTorch Geometric. No background in chemistry is assumed. Following this tutorial we hope to better equip attendees with a basic understanding of the data and repository. Authors: Muhammed Shuaibi (Carnegie Mellon University); Anuroop Sriram (Facebook); Abhishek Das (Facebook AI Research); Janice Lan (Facebook AI Research); Adeesh Kolluru (Carnegie Mellon University); Brandon Wood (NERSC); Zachary Ulissi (Carnegie Mellon University); Larry Zitnick (Facebook AI Research) |
Carbon capture and sequestration Other Power and energy |
ICML 2021 |
Examining the nexus of environmental policy, climate physics, and maritime shipping with deep learning models and space-borne data
(Papers Track)
Abstract and authors: (click to expand)Abstract: Ship-tracks are produced by ship exhaust interacting with marine low clouds. They provide an ideal lab for constraining a critical climate forcing. However, no global survey of ship ship-tracks has been made since its discovery 55 years ago, which limits research progress. Here we present the first global map of ship-tracks produced by applying deep segmentation models to large satellite data. Our model generalizes well and is validated against independent data. Large-scale ship-track data are at the nexus of environmental policy, climate physics, and maritime shipping industry: they can be used to study aerosol-cloud interactions, the largest uncertainty source in climate forcing; to evaluate compliance and impacts of environmental policies; and to study the impact of significant socioeconomic events on maritime shipping. Based on twenty years of global data, we show cloud physics responses in ship-tracks strongly depend on the cloud regime. Inter-annual fluctuation in ship-track frequency clearly reflects international trade/economic trends. Emission policies strongly affect the pattern of shipping routes and ship-track occurrence. The combination of stricter fuel standard and the COVID-19 pandemic pushed global ship-track frequency to the lowest level in the record. More applications of our technique and data are envisioned such as detecting illicit shipping activity and checking policy compliance of individual ships. Authors: Tianle Yuan (University of Maryland, NASA); Hua Song (NASA, SSAI); Chenxi Wang (University of Maryland, NASA); Kerry Meyer (NASA); Siobhan Light (University of Maryland); Sophia von Hippel (University of Arizona); Steven Platnick (NASA); Lazaros Oreopoulos (NASA); Robert Wood (University of Washington); Hans Mohrmann (University of Washington) |
Computer vision and remote sensing Climate policy Meta- and transfer learning |
ICML 2021 |
A human-labeled Landsat-8 contrails dataset
(Papers Track)
Abstract and authors: (click to expand)Abstract: Contrails (condensation trails) are the ice clouds that trail behind aircraft as they fly through cold and moist regions of the atmosphere. Avoiding these regions could potentially be an inexpensive way to reduce over half of aviation's impact on global warming. Development and evaluation of these avoidance strategies greatly benefits from the ability to detect contrails on satellite imagery. Since little to no public data is available to develop such contrail detectors, we construct and release a dataset of several thousand Landsat-8 scenes with pixel-level annotations of contrails. The dataset will continue to grow, but currently contains 4289 scenes (of which 47% have at least one contrail) representing 950+ person-hours of labeling time. Authors: Kevin McCloskey (Google); Scott Geraedts (Google); Brendan Jackman (Google); Vincent R. Meijer (Laboratory for Aviation and the Environment, Massachusetts Institute of Technology); Erica Brand (Google); Dave Fork (Google); John C. Platt (Google); Carl Elkin (Google); Christopher Van Arsdale (Google) |
Computer vision and remote sensing Climate and Earth science |
ICML 2021 |
Urban Tree Species Classification Using Aerial Imagery
(Papers Track)
Abstract and authors: (click to expand)Abstract: Urban trees help regulate temperature, reduce energy consumption, improve urban air quality, reduce wind speeds, and mitigating the urban heat island effect. Urban trees also play a key role in climate change mitigation and global warming by capturing and storing atmospheric carbon-dioxide which is the largest contributor to greenhouse gases. Automated tree detection and species classification using aerial imagery can be a powerful tool for sustainable forest and urban tree management. Hence, This study first offers a pipeline for generating labelled dataset of urban trees using Google Map's aerial images and then investigates how state of the art deep Convolutional Neural Network models such as VGG and ResNet handle the classification problem of urban tree aerial images under different parameters. Experimental results show our best model achieves an average accuracy of 60% over 6 tree species. Authors: Emily Waters (Anglia Ruskin University); Mahdi Maktabdar Oghaz (Anglia Ruskin University); Lakshmi Babu Saheer (Anglia Ruskin University) |
Carbon capture and sequestration Buildings and cities Climate and Earth science Agriculture, forestry and other land use Computer vision and remote sensing |
ICML 2021 |
Estimation of Corporate Greenhouse Gas Emissions via Machine Learning
(Papers Track)
Abstract and authors: (click to expand)Abstract: As an important step to fulfill the Paris Agreement and achieve net-zero emissions by 2050, the European Commission adopted the most ambitious package of climate impact measures in April 2021 to improve the flow of capital towards sustainable activities. For these and other international measures to be successful, reliable data is key. The ability to see the carbon footprint of companies around the world will be critical for investors to comply with the measures and hit climate neutrality. However, with only a small portion of companies volunteering to disclose their greenhouse gas (GHG) emissions, it is nearly impossible for investors to align their investment strategies with the measures. By training a machine learning model on disclosed GHG emissions, we are able to estimate the emissions of other companies globally who do not disclose their emissions. In this paper, we show that our model provides accurate estimates of corporate GHG emissions to investors such that they are able to align their investments with the regulatory measures and achieve net-zero goals. Authors: You Han (Bloomberg L.P.); Achintya Gopal (Bloomberg LP); Liwen Ouyang (Bloomberg L.P.); Aaron Key (Bloomberg LP) |
Climate finance and economics Industry Climate policy Classification, regression, and supervised learning Generative modeling Uncertainty quantification and robustness Unsupervised and semi-supervised learning |
ICML 2021 |
ANP-BBO: Attentive Neural Processes and Batch Bayesian Optimization for Scalable Calibration of Physics-Informed Digital Twins
(Papers Track)
Abstract and authors: (click to expand)Abstract: Physics-informed dynamical system models form critical components of digital twins of the built environment. These digital twins enable the design of energy-efficient infrastructure, but must be properly calibrated to accurately reflect system behavior for downstream prediction and analysis. Dynamical system models of modern buildings are typically described by a large number of parameters and incur significant computational expenditure during simulations. To handle large-scale calibration of digital twins without exorbitant simulations, we propose ANP-BBO: a scalable and parallelizable batch-wise Bayesian optimization (BBO) methodology that leverages attentive neural processes (ANPs). Authors: Ankush Chakrabarty (Mitsubishi Electric Research Labs); Gordon Wichern (Mitsubishi Electric Research Laboratories (MERL)); Christopher Laughman (Mitsubishi Electric Research Laboratories (MERL)) |
Buildings and cities Causal and Bayesian methods |
ICML 2021 |
Seasonal Sea Ice Presence Forecasting of Hudson Bay using Seq2Seq Learning
(Papers Track)
Abstract and authors: (click to expand)Abstract: Accurate and timely forecasts of sea ice conditions are crucial for safe shipping operations in the Canadian Arctic and other ice-infested waters. Given the advancement of machine-learning methods and the recent observations on the declining trend of Arctic sea ice extent over the past decades due to global warming, new machine learning approaches are deployed to provide additional sea ice forecasting products. This study is novel in comparison with previous machine learning (ML) approaches in the sea-ice forecasting domain as it provides a daily spatial map of probability of ice presence in the domain up to 90 days. The predictions are further used to predict freeze-up/breakup dates and show their capability to capture both the variability and the increasing trend of open water season in the domain over the past decades. Authors: Nazanin Asadi (University of Waterloo); K Andrea Scott (University of Waterloo); Philippe Lamontagne (National Research Council Canada) |
Climate and Earth science Classification, regression, and supervised learning Computer vision and remote sensing Time-series analysis |
ICML 2021 |
Semantic Segmentation on Unbalanced Remote Sensing Classes for Active Fire
(Papers Track)
Abstract and authors: (click to expand)Abstract: Wildfires generate considerable research interest due to their high frequency of occurrence along with global climate change. Future wildfire detection sensors would equip an on-orbit processing module that filters the useless raw images before data transmission. To efficiently detect heat anomalies from the single large scene, we need to handle the unbalanced sample sets between small active fire pixels and large-size complex background information. In this study, we contribute to solving this problem by enhancing the target feature representation in three ways. We first preprocess training images by constraining sampling ranges and removing background patches. Then we use the object-contextual representation (OCR) module to strengthen the active fire pixel representation based on the self-attention unit. The HRNet backbone provides multi-scale pixel representation as input to the OCR module. Finally, the combined loss of weighted cross-entropy loss and Lovasz hinge loss improve the segmentation accuracy further by optimizing the IoU of the foreground class. The performance is tested on the aerial FLAME dataset, whose ratio between labeled active fire and background pixels is 5.6%. The proposed framework improves the mIoU from 83.10% (baseline U-Net) to 90.81%. Future research will expand the technique for active fire detection using satellite images. Authors: Xikun Hu (KTH Royal Institute of Technology); Alberto Costa Nogueira Junior (IBM Research Brazil); Tian Jin (College of Electronic Science, National University of Defense Technology) |
Disaster prediction, management, and relief Computer vision and remote sensing |
ICML 2021 |
Improving Image-Based Characterization of Porous Media with Deep Generative Models
(Papers Track)
Abstract and authors: (click to expand)Abstract: Micro- and nanoscale imaging are important for characterizing subsurface formations for carbon sequestration, shale gas recovery, and hydrogen storage. Common imaging techniques, however, are often sample-destructive, expensive, require high levels of expertise, or only acquire planar data. The resulting image datasets therefore may not allow for a representative estimation of rock properties. In this work, we address these challenges in image-based characterization of porous media using deep generative models. We present a machine learning workflow for characterizing porous media from limited imaging data. We develop methods for 3D image volume translation and synthesis from 2D training data, apply this method to grayscale and multimodal image datasets of sandstones and shales, and simulate flow through the generated volumes. Results show that the proposed image reconstruction and generation approaches produce realistic pore-scale 3D representations of rock samples using only 2D training data. The models proposed here expand our capabilities for characterization of rock samples and enable new understanding of pore-scale storage and recovery processes. Authors: Timothy Anderson (Stanford University); Kelly Guan (Stanford University); Bolivia Vega (Stanford University); Laura Froute (Stanford University); Anthony Kovscek (Stanford University) |
Generative modeling Carbon capture and sequestration |
ICML 2021 |
Forest Terrain Identification using Semantic Segmentation on UAV Images
(Papers Track)
Abstract and authors: (click to expand)Abstract: Beavers' habitat is known to alter the terrain, providing biodiversity in the area, and recently their lifestyle is linked to climatic changes by reducing greenhouse gases levels in the region. To analyse the impact of beavers’ habitat on the region, it is, therefore, necessary to estimate the terrain alterations caused by beaver actions. Furthermore, such terrain analysis can also play an important role in domains like wildlife ecology, deforestation, land-cover estimations, and geological mapping. Deep learning models are known to provide better estimates on automatic feature identification and classification of a terrain. However, such models require significant training data. Pre-existing terrain datasets (both real and synthetic) like CityScapes, PASCAL, UAVID, etc, are mostly concentrated on urban areas and include roads, pathways, buildings, etc. Such datasets, therefore, are unsuitable for forest terrain analysis. This paper contributes, by providing a finely labelled novel dataset of forest imagery around beavers’ habitat, captured from a high-resolution camera on an aerial drone. The dataset consists of 100 such images labelled and classified based on 9 different classes. Furthermore, a baseline is established on this dataset using state-of-the-art semantic segmentation models based on performance metrics including Intersection Over Union (IoU), Overall Accuracy (OA), and F1 score. Authors: Muhammad Umar (Anglia Ruskin University); Lakshmi Babu Saheer (Anglia Ruskin University); Javad Zarrin (Anglia Ruskin University) |
Agriculture, forestry and other land use Ecosystems and natural systems Classification, regression, and supervised learning Computer vision and remote sensing |
ICML 2021 |
Climate-based ensemble machine learning model to forecast Dengue epidemics
(Papers Track)
Abstract and authors: (click to expand)Abstract: Dengue fever is one of the most common and rapidly spreading arboviral diseases in the world, with major public health and economic consequences in tropical and sub-tropical regions. Countries such as Peru, 17.143 cases of dengue were reported in 2019, where 81.4% of cases concentrated in five of the 25 departments. When predicting infectious disease outbreaks, it is crucial to model the long-term dependency in time series data. However, this is challenging when performed on a countrywide level since dengue incidence varies across administrative areas. Therefore, this study developed and applied a climate-based ensemble model using multiple machine learning (ML) approaches to forecast dengue incidence rate (DIR) by department. The ensemble combined the outputs from Long Short-Term Memory (LSTM) recurrent neural network and Categorical Boosting (CatBoost) methods to predict DIR one month ahead for each department in Peru. Monthly dengue cases stratified by Peruvian departments were analysed in conjunction with associated demographic, geographic, and satellite-based meteorological data for the period January 2010–December 2019. The results demonstrated that the ensemble model was able to forecast DIR in low-transmission departments, while the model was less able to detect sudden DIR peaks in some departments. Air temperature and wind components demonstrated to be the significant predictors for DIR predictions. This dengue forecast model is timely and can help local governments to implement effective control measures and mitigate the effects of the disease. This study advances the state-of-the-art of climate services for the public health sector, by informing what are the key climate factors responsible for triggering dengue transmission. Finally, this project summarises how important it is to perform collaborative work with complementary expertise from intergovernmental organizations and public health universities to advance knowledge and address societal challenges. Authors: Rochelle Schneider (European Space Agency); Alessandro Sebastianelli (European Space Agency); Dario Spiller (Italian Space Agency); James Wheeler (European Space Agency); Raquel Carmo (European Space Agency); Artur Nowakowski (Warsaw University of Technology); Manuel Garcia-Herranz (UNICEF); Dohyung Kim (UNICEF); Hanoch Barlevi (UNICEF LACRO); Zoraya El Raiss Cordero (UNICEF LACRO); Silvia Liberata Ullo (University of Sannio); Pierre-Philippe Mathieu (European Space Agency); Rachel Lowe (London School of Hygiene & Tropical Medicine) |
Societal adaptation Climate Change and Health |
ICML 2021 |
Physics-Informed Graph Neural Networks for Robust Fault Location in Power Grids
(Papers Track)
Best Paper: ML Innovation
Abstract and authors: (click to expand)Abstract: The reducing cost of renewable energy resources, such as solar photovoltaics (PV) and wind farms, is accelerating global energy transformation to mitigate climate change. However, a high level of intermittent renewable energy causes power grids to have more stability issues. This accentuates the need for quick location of system failures and follow-up control actions. In recent events such as in California, line failures have resulted in large-scale wildfires leading to loss of life and property. In this article, we propose a two-stage graph learning framework to locate power grid faults in the challenging but practical regime characterized by (a) sparse observations, (b) low label rates, and (c) system variability. Our approach embeds the geometrical structure of power grids into the graph neural networks (GNN) in stage I for fast fault location, and then stage II further enhances the location accuracy by employing the physical similarity of the labeled and unlabeled data samples. We compare our approach with three baselines in the IEEE 123-node benchmark system and show that it outperforms the others by significant margins in various scenarios. Authors: Wenting Li (Los Alamos National Laboratory); Deepjyoti Deka (Los Alamos National Laboratory) |
Power and energy Classification, regression, and supervised learning Hybrid physical models Interpretable ML Uncertainty quantification and robustness Unsupervised and semi-supervised learning |
ICML 2021 |
Prediction of Boreal Peatland Fires in Canada using Spatio-Temporal Methods
(Papers Track)
Abstract and authors: (click to expand)Abstract: Peat fires are the largest fires on earth in terms of fuel consumption and are responsible for a significant portion of global carbon emissions. Predicting fires in the peatlands can help decision-makers and researchers monitor and prevent peat fires. Despite this, research on predicting peatland fires remains largely understudied as compared to the prediction of other forms of fires. However, peatland fires are unique among fires and therefore require datasets and architectures attuned to their particular characteristics. In this paper, we present a new dataset, PeatSet, designed specifically for the problem of peatland fire prediction. In addition, we propose several models to tackle the problem of fire prediction for the peatlands. We develop novel neural architectures for peatland fire prediction, PeatNet, and PT-Net, with a graph-based and a transformer-based architecture, respectively. Our results indicate that these new deep-learning architectures outperform a regression baseline from existing peatland research. Among all the tested models, PT-Net achieves the highest F1 score of 0.1006 and an overall accuracy of 99.84%. Authors: Shreya Bali (Carnegie Mellon University); Sydney Zheng (Carnegie Mellon University); Akshina Gupta (Carnegie Mellon University); Yue Wu (None); Blair Chen (Carnegie Mellon University); Anirban Chowdhury (Carnegie Mellon University); Justin Khim (Carnegie Mellon University) |
Disaster prediction, management, and relief Climate and Earth science Classification, regression, and supervised learning |
ICML 2021 |
Revealing the impact of global warming on climate modes using transparent machine learning and a suite of climate models
(Papers Track)
Abstract and authors: (click to expand)Abstract: The ocean is key to climate through its ability to store and transport heat and carbon. From studies of past climates, it is clear that the ocean can exhibit a range of dramatic variability that could have catastrophic impacts on society, such as changes in rainfall, severe weather, sea level rise and large scale climate patterns. The mechanisms of change remain obscure, but are explored using a transparent machine learning method, Tracking global Heating with Ocean Regimes (THOR) presented here. We investigate two future scenarios, one where CO2 is increased by 1% per year, and one where CO2 is abruptly quadrupled. THOR is engineered combining interpretable and explainable methods to reveal its source of predictive skill. At the core of THOR, is the identification of dynamically coherent regimes governing the circulation, a fundamental question within oceanography. Three key regions are investigated here. First, the North Atlantic circulation that delivers heat to the higher latitudes is seen to weaken and we identify associated dynamical changes. Second, the Southern Ocean circulation, the strongest circulation on earth, is seen to intensify where we reveal the implications for interactions with the ice on Antarctica. Third, shifts in ocean circulation regimes are identified in the tropical Pacific region, with potential impacts on the El Nino Southern Oscillation, Earth’s dominant source of year-to-year climate variations affecting weather extremes, ecosystems, agriculture, and fisheries. Together with revealing these climatically relevant ocean dynamics, THOR also constitutes a step towards trustworthy machine learning called for within oceanography and beyond because its predictions are physically tractable. We conclude with by highlighting open questions and potentially fruitful avenues of further machine learning applications to climate research. Authors: Maike Sonnewald (Princeton University); Redouane Lguensat (LSCE-IPSL); Aparna Radhakrishnan (Geophysical Fluid Dynamics Laboratory); Zoubero Sayibou (Bronx Community College); Venkatramani Balaji (Princeton University); Andrew Wittenberg (NOAA) |
Climate and Earth science Interpretable ML |
ICML 2021 |
Challenges in Applying Audio Classification Models to Datasets Containing Crucial Biodiversity Information
(Papers Track)
Abstract and authors: (click to expand)Abstract: The acoustic signature of a natural soundscape can reveal consequences of climate change on biodiversity. Hardware costs, human labor time, and expertise dedicated to labeling audio are impediments to conducting acoustic surveys across a representative portion of an ecosystem. These barriers are quickly eroding away with the advent of low-cost, easy to use, open source hardware and the expansion of the machine learning field providing pre-trained neural networks to test on retrieved acoustic data. One consistent challenge in passive acoustic monitoring (PAM) is a lack of reliability from neural networks on audio recordings collected in the field that contain crucial biodiversity information that otherwise show promising results from publicly available training and test sets. To demonstrate this challenge, we tested a hybrid recurrent neural network (RNN) and convolutional neural network (CNN) binary classifier trained for bird presence/absence on two Peruvian bird audiosets. The RNN achieved an area under the receiver operating characteristics (AUROC) of 95% on a dataset collected from Xeno-canto and Google’s AudioSet ontology in contrast to 65% across a stratified random sample of field recordings collected from the Madre de Dios region of the Peruvian Amazon. In an attempt to alleviate this discrepancy, we applied various audio data augmentation techniques in the network’s training process which led to an AUROC of 77% across the field recordings. Authors: Jacob G Ayers (UC San Diego); Yaman Jandali (University of California, San Diego); Yoo-Jin Hwang (Harvey Mudd College); Erika Joun (University of California, San Diego); Gabriel Steinberg (Binghampton University); Mathias Tobler (San Diego Zoo Wildlife Alliance); Ian Ingram (San Diego Zoo Wildlife Alliance); Ryan Kastner (University of California San Diego); Curt Schurgers (University of California San Diego) |
Ecosystems and natural systems Natural language processing |
ICML 2021 |
Learning Optimal Power Flow with Infeasibility Awareness
(Papers Track)
Abstract and authors: (click to expand)Abstract: Optimal power flow provides an energy-efficient operating point for power grids and therefore supports climate change mitigation. This function has to be run every few minutes day and night, thus a reliable and computationally efficient solution method is of vital importance. Deep learning seems a promising direction, and related works have emerged recently. However, considering feasible scenarios only during the learning process, existing works will mislead system operators in infeasible scenarios and pose a new threat to system resilience. Paying attention to infeasibility in the decision making process, this paper tackles this emerging threat with multi-task learning. Case studies on the IEEE test system validate the effectiveness of the proposed method. Authors: Gang Huang (Zhejiang Lab); Longfei Liao (Zhejiang Lab); Lechao Cheng (Zhejiang Lab); Wei Hua (Zhejiang Lab) |
Power and energy |
ICML 2021 |
Reconstructing Aerosol Vertical Profiles with Aggregate Output Learning
(Papers Track)
Abstract and authors: (click to expand)Abstract: Aerosol-cloud interactions constitute the largest source of uncertainty in assessments of the anthropogenic climate change. This uncertainty arises in part from the inability to observe aerosol amounts at the cloud formation levels, and, more broadly, the vertical distribution of aerosols. Hence, we often have to settle for less informative two-dimensional proxies, i.e. vertically aggregated data. In this work, we formulate the problem of disaggregation of vertical profiles of aerosols. We propose some initial solutions for such aggregate output regression problem and demonstrate their potential on climate model data. Authors: Sofija Stefanovic (University of Oxford); Shahine Bouabid (University of Oxford); Philip Stier (University of Oxford); Athanasios Nenes (EPFL); Dino Sejdinovic (University of Oxford) |
Climate and Earth science Classification, regression, and supervised learning Unsupervised and semi-supervised learning |
ICML 2021 |
Self-Attentive Ensemble Transformer: Representing Ensemble Interactions in Neural Networks for Earth System Models
(Papers Track)
Abstract and authors: (click to expand)Abstract: Ensemble data from Earth system models has to be calibrated and post-processed. I propose a novel member-by-member post-processing approach with neural networks. I bridge ideas from ensemble data assimilation with self-attention, resulting into the self-attentive ensemble transformer. Here, interactions between ensemble members are represented as additive and dynamic self-attentive part. As proof-of-concept, I regress global ECMWF ensemble forecasts to 2-metre-temperature fields from the ERA5 reanalysis. I demonstrate that the ensemble transformer can calibrate the ensemble spread and extract additional information from the ensemble. As it is a member-by-member approach, the ensemble transformer directly outputs multivariate and spatially-coherent ensemble members. Therefore, self-attention and the transformer technique can be a missing piece for a non-parametric post-processing of ensemble data with neural networks. Authors: Tobias S Finn (Universität Hamburg) |
Climate and Earth science Classification, regression, and supervised learning |
ICML 2021 |
DeepOPF-NGT: A Fast Unsupervised Learning Approach for Solving AC-OPF Problems without Ground Truth
(Papers Track)
Abstract and authors: (click to expand)Abstract: AC optimal power flow (AC-OPF) problems need to be solved more frequently in the future to maintain the reliable and cost-effective operation of power systems. Recently, supervised-learning approaches have been developed to speed up the solving time of AC-OPF problems without incurring infeasibility or much optimality loss by learning the load-solution mapping embedded in the training dataset. However, it is non-trivial and computationally expensive to prepare the training dataset with single embedded mapping, due to that AC-OPF problems are non-convex and may admit multiple optimal solutions. In this paper, we develop an unsupervised learning approach (DeepOPF-NGT) for solving AC-OPF problems, which does not require training datasets with ground truth to operate. Instead, it uses a properly designed loss function to guide the tuning of the neural network parameters to directly learn one load-solution mapping. Preliminary results on the IEEE 30-bus test system show that the unsupervised DeepOPF-NGT approach can achieve comparable optimality, feasibility, and speedup performance against an existing supervised learning approach. Authors: Wanjun Huang (City University of Hong Kong); Minghua Chen (City University of Hong Kong) |
Power and energy Unsupervised and semi-supervised learning |
ICML 2021 |
Commercial Vehicle Traffic Detection from Satellite Imagery with Deep Learning
(Papers Track)
Abstract and authors: (click to expand)Abstract: Road freight traffic is a major greenhouse gas emitter: commercial vehicles (CVs) contribute ∼7% to the global CO 2 emission budget, a fraction that is likely to increase in the future. The quantitative monitoring of CV traffic rates, while essential for the implementation of targeted road emission regulations, is costly and as such only available in developed regions. In this work, we investigate the feasibility of estimating hourly CV traffic rates from freely available Sentinel-2 satellite imagery. We train a modified Faster R-CNN object detection model to detect individual CVs in satellite images and feed the resulting counts into a regression model to predict hourly CV traffic rates. This architecture, when trained on ground-truth data for Switzerland, is able to estimate hourly CV traffic rates for any freeway section within 58% (MAPE) of the actual value; for freeway sections with historic information on CV traffic rates, we can predict hourly CV traffic rates up to within 4% (MAPE). We successfully apply our model to freeway sections in other coun tries and show-case its utility by quantifying the change in traffic patterns as a result of the first CoVID-19 lockdown in Switzerland. Our results show that it is possible to estimate hourly CV traffic rates from satellite images, which can guide civil engineers and policy makers, especially in developing countries, in monitoring and reducing greenhouse gas emissions from CV traffic. Authors: Moritz Blattner (University of St. Gallen); Michael Mommert (University of St. Gallen); Damian Borth (University of St. Gallen) |
Computer vision and remote sensing Transportation |
ICML 2021 |
Probabilistic Short-Term Low-Voltage Load Forecasting using Bernstein-Polynomial Normalizing Flows
(Papers Track)
Abstract and authors: (click to expand)Abstract: The transition to a fully renewable energy grid requires better forecasting of demand at the low-voltage level. However, high fluctuations and increasing electrification cause huge forecast errors with traditional point estimates. Probabilistic load forecasts take future uncertainties into account and thus enables various applications in low-carbon energy systems. We propose an approach for flexible conditional density forecasting of short-term load based on Bernstein-Polynomial Normalizing Flows where a neural network controls the parameters of the flow. In an empirical study with 363 smart meter customers, our density predictions compare favorably against Gaussian and Gaussian mixture densities and also outperform a non-parametric approach based on the pinball loss for 24h-ahead load forecasting for two different neural network architectures. Authors: Marcel Arpogaus (Konstanz University of Applied Sciences); Marcus Voß (Technische Universität Berlin (DAI-Labor)); Beate Sick (ZHAW and University of Zurich); Mark Nigge-Uricher (Bosch.IO GmbH); Oliver Dürr (Konstanz University of Applied Sciences) |
Power and energy Buildings and cities Industry Classification, regression, and supervised learning Generative modeling Interpretable ML Time-series analysis Uncertainty quantification and robustness |
ICML 2021 |
Guided A* Search for Scheduling Power Generation Under Uncertainty
(Papers Track)
Abstract and authors: (click to expand)Abstract: Increasing renewables penetration motivates the development of new approaches to operating power systems under uncertainty. We apply a novel approach combining self-play reinforcement learning (RL) and traditional planning to solve the unit commitment problem, an essential power systems scheduling task. Applied to problems with stochastic demand and wind generation, our results show significant cost reductions and improvements to security of supply as compared with an industry-standard mixed-integer linear programming benchmark. Applying a carbon price of \$50/tCO$_2$ achieves carbon emissions reductions of up to 10\%. Our results demonstrate scalability to larger problems than tackled in existing literature, and indicate the potential for RL to contribute to decarbonising power systems. Authors: Patrick de Mars (UCL); Aidan O'Sullivan (UCL) |
Power and energy Reinforcement learning and control |
ICML 2021 |
DroughtED: A dataset and methodology for drought forecasting spanning multiple climate zones
(Papers Track)
Abstract and authors: (click to expand)Abstract: Climate change exacerbates the frequency, duration and extent of extreme weather events such as drought. Previous attempts to forecast drought conditions using machine learning have focused on regional models which have two major limitations for national drought management: (i) they are trained on localised climate data and (ii) their architectures prevent them from being applied to new heterogeneous regions. In this work, we present a new large-scale dataset for training machine learning models to forecast national drought conditions, named DroughtED. The dataset consists of globally available meteorological features widely used for drought prediction, paired with location meta-data which has not previously been utilised for drought forecasting. Here we also establish a baseline on DroughtED and present the first research to apply deep learning models - Long Short-Term Memory (LSTMs) and Transformers - to predict county-level drought conditions across the full extent of the United States. Our results indicate that DroughtED enables deep learning models to learn cross-region patterns in climate data that contribute to drought conditions and models trained on DroughtED compare favourably to state-of-the-art drought prediction models trained on individual regions. Authors: Christoph D Minixhofer (The University of Edinburgh); Mark Swan (The University of Edinburgh); Calum McMeekin (The University of Edinburgh); Pavlos Andreadis (The University of Edinburgh) |
Disaster prediction, management, and relief Agriculture, forestry and other land use Climate and Earth science Classification, regression, and supervised learning Data mining Meta- and transfer learning Time-series analysis |
ICML 2021 |
Estimation of Air Pollution with Remote Sensing Data: Revealing Greenhouse Gas Emissions from Space
(Papers Track)
Abstract and authors: (click to expand)Abstract: Air pollution is a major driver of climate change. Anthropogenic emissions from the burning of fossil fuels for transportation and power generation emit large amounts of problematic air pollutants, including Greenhouse Gases (GHGs). Despite the importance of limiting GHG emissions to mitigate climate change, detailed information about the spatial and temporal distribution of GHG and other air pollutants is difficult to obtain. Existing models for surface-level air pollution rely on extensive land-use datasets which are often locally restricted and temporally static. This work proposes a deep learning approach for the prediction of ambient air pollution that only relies on remote sensing data that is globally available and frequently updated. Combining optical satellite imagery with satellite-based atmospheric column density air pollution measurements enables the scaling of air pollution estimates (in this case NO2) to high spatial resolution (up to ~10m) at arbitrary locations and adds a temporal component to these estimates. The proposed model performs with high accuracy when evaluated against air quality measurements from ground stations (mean absolute error <6 microgram/m^3). Our results enable the identification and temporal monitoring of major sources of air pollution and GHGs. Authors: Linus M. Scheibenreif (University of St. Gallen); Michael Mommert (University of St. Gallen); Damian Borth (University of St. Gallen) |
Computer vision and remote sensing Climate and Earth science |
ICML 2021 |
Emulating Aerosol Microphysics with a Machine Learning
(Papers Track)
Abstract and authors: (click to expand)Abstract: Aerosol particles play an important role in the climate system by absorbing and scattering radiation and influencing cloud properties. They are also one of the biggest sources of uncertainty for climate modeling. Many climate models do not include aerosols in sufficient detail. In order to achieve higher accuracy, aerosol microphysical properties and processes have to be accounted for. This is done in the ECHAM-HAM global climate aerosol model using the M7 microphysics model, but increased computational costs make it very expensive to run at higher resolutions or for a longer time. We aim to use machine learning to approximate the microphysics model at sufficient accuracy and reduce the computational cost by being fast at inference time. The original M7 model is used to generate data of input-output pairs to train a neural network on it. By using a special logarithmic transform we are able to learn the variables tendencies achieving an average $R^2$ score of $89\%$. On a GPU we achieve a speed-up of 120 compared to the original model. Authors: Paula Harder (Fraunhofer ITWM); Duncan Watson-Parris (University of Oxford); Dominik Strassel (Fraunhofer ITWM); Nicolas Gauger (TU Kaiserslautern); Philip Stier (University of Oxford); Janis Keuper (hs-offenburg) |
Climate and Earth science Classification, regression, and supervised learning |
ICML 2021 |
Automated Identification of Climate Risk Disclosures in Annual Corporate Reports
(Papers Track)
Abstract and authors: (click to expand)Abstract: It is important for policymakers to understand which financial policies are effective in increasing climate risk disclosure in corporate reporting. We use machine learning to automatically identify disclosures of five different types of climate-related risks. For this purpose, we have created a dataset of over 120 manually-annotated annual reports by European firms. Applying our approach to reporting of 337 firms over the last 20 years, we find that risk disclosure is increasing. Disclosure of transition risks grows more dynamically than physical risks, and there are marked differences across industries. Country-specific dynamics indicate that regulatory environments potentially have an important role to play for increasing disclosure. Authors: David Friederich (University of Bern); Lynn Kaack (ETH Zurich); Sasha Luccioni (Mila); Bjarne Steffen (ETH Zurich) |
Climate finance and economics Natural language processing |
ICML 2021 |
Forecasting Black Sigatoka Infection Risks with Latent Neural ODEs
(Papers Track)
Abstract and authors: (click to expand)Abstract: Black Sigatoka is the most widely-distributed and destructive disease affecting banana plants. Due to the heavy financial burden of managing this infectious disease, farmers in developing countries face significant banana crop losses. The spread of black Sigatoka is highly dependent on weather conditions and though scientists have produced mathematical models of infectious diseases, adapting these models to incorporate climate effects is difficult. We present MR. NODE (Multiple predictoR Neural ODE), a neural network that models the dynamics of black Sigatoka infection learnt directly from data via Neural Ordinary Differential Equations. Our method encodes external predictor factors into the latent space in addition to the variable that we infer, and it can also predict the infection risk at an arbitrary point in time. Empirically, we demonstrate on historical climate data that our method has superior generalization performance on time points up to one month in the future and unseen irregularities. We believe that our method can be a useful tool to control the spread of black Sigatoka. Authors: Yuchen Wang (University of Toronto); Matthieu Chan Chee (University of Toronto); Ziyad Edher (University of Toronto); Minh Duc Hoang (University of Toronto); Shion Fujimori (University of Toronto); Jesse Bettencourt (University of Toronto) |
Agriculture, forestry and other land use Time-series analysis |
ICML 2021 |
A Reinforcement Learning Approach to Home Energy Management for Modulating Heat Pumps and Photovoltaic Systems
(Papers Track)
Abstract and authors: (click to expand)Abstract: Efficient sector coupling in residential buildings plays a key role in supporting the energy transition. In this study, we analyze the potential of using reinforcement learning (RL) to control a home energy management system. We conduct this study by modeling a representative building with a modulating air-sourced heat pump, a photovoltaic system, a battery, and thermal storage systems for floor heating and hot-water supply. In our numerical analysis, we benchmark our reinforcement learning results using DDPG with the optimal solution generated with model predictive control using a mixed-integer linear model under full information. Our input data, models, and the RL environment, developed using the Julia programming language, will be available in an open-source manner. Authors: Lissy Langer (TU Berlin) |
Buildings and cities Reinforcement learning and control |
ICML 2021 |
Reinforcement Learning for Optimal Frequency Control: A Lyapunov Approach
(Papers Track)
Abstract and authors: (click to expand)Abstract: Renewable energy resources play a vital role in reducing carbon emissions and are becoming increasingly common in the grid. On one hand, they are challenging to integrate into a power system because the lack of rotating mechanical inertia can lead to frequency instabilities. On the other hand, these resources have power electronic interfaces that are capable of implementing almost arbitrary control laws. To design these controllers, reinforcement learning has emerged as a popular method to search for policy parameterized by neural networks. The key challenge with learning based approaches is enforcing the constraint that the learned controller need to be stabilizing. Through a Lyapunov function, we explicitly identify the structure of neural network-based controllers such that they guarantee system stability by design. A recurrent RL architecture is used to efficiently train the controllers and they outperform other approaches as demonstrated by simulations. Authors: Wenqi Cui (University of Washington); Baosen Zhang (University of Washington) |
Power and energy Reinforcement learning and control |
ICML 2021 |
Modeling Bird Migration by Disaggregating Population Level Observations
(Papers Track)
Abstract and authors: (click to expand)Abstract: Birds are shifting migratory routes and timing in response to climate change, but modeling migration to better understand these changes is difficult. Some recent work leverages fluid dynamics models, but this requires individual flight speed and directional data which may not be readily available. We developed an alternate modeling method which only requires population level positional data and use it to model migration routes of the American Woodcock (Scolopax minor). We use our model to sample simulated bird trajectories and compare them to real trajectories in order to evaluate the model. Authors: Miguel Fuentes (University of Massachusetts, Amherst); Benjamin Van Doren (Cornell University); Daniel Sheldon (University of Massachusetts, Amherst) |
Ecosystems and natural systems Other |
ICML 2021 |
Power Grid Cascading Failure Mitigation by Reinforcement Learning
(Papers Track)
Abstract and authors: (click to expand)Abstract: This paper proposes a cascading failure mitigation strategy based on Reinforcement Learning (RL) method. Firstly, the principles of RL are introduced. Then, the Multi-Stage Cascading Failure (MSCF) problem is formulated, and its challenges are investigated. The problem is then tackled by the RL based on DCOPF (Direct Current Optimal Power Flow). Designs of the RL framework (rewards, states, etc.) are illustrated in detail. Experiments on the IEEE 118-bus system by the proposed RL method demonstrate promising performance in reducing system collapses. Authors: Yongli Zhu (Texas A&M University) |
Disaster prediction, management, and relief Power and energy |
ICML 2021 |
Decadal Forecasts with ResDMD: a residual DMD neural network
(Papers Track)
Abstract and authors: (click to expand)Abstract: Significant investment is being made by operational forecasting centers to produce decadal (1-10 year) forecasts that can support long-term decision making for a more climate-resilient society. One method that has been employed for this task is the Dynamic Mode Decomposition (DMD) algorithm – also known as the Linear Inverse Model– which is used to fit linear dynamical models to data. While the DMD usually approximates non-linear terms in the true dynamics as a linear system with random noise, we investigate an extension to the DMD to explicitly represent the non-linear terms as a neural network. Our weight initialization allows the network to produce sensible results before training and then improve the prediction after training as data becomes available. In this short paper, we evaluate the proposed architecture for simulating global sea surface temperatures and compare the results with the standard DMD and seasonal forecasts produced by the state-of-the-art dynamical model, CFSv2. Authors: EDUARDO ROCHA RODRIGUES (IBM Research); Campbell Watson (IBM Reserch); Bianca Zadrozny (IBM Research); David Gold (IBM Global Business Services) |
Classification, regression, and supervised learning Climate and Earth science Time-series analysis |
ICML 2021 |
TweetDrought: A Deep-Learning Drought Impacts Recognizer based on Twitter Data
(Papers Track)
Abstract and authors: (click to expand)Abstract: Acquiring a better understanding of drought impacts becomes increasingly vital under a warming climate. Traditional drought indices describe mainly biophysical variables and not impacts on social, economic, and environmental systems. We utilized natural language processing and bidirectional encoder representation from Transformers (BERT) based transfer learning to fine-tune the model on the data from the news-based Drought Impact Report (DIR) and then apply it to recognize seven types of drought impacts based on the filtered Twitter data from the United States. Our model achieved a satisfying macro-F1 score of 0.89 on the DIR test set. The model was then applied to California tweets and validated with keyword-based labels. The macro-F1 score was 0.58. However, due to the limitation of keywords, we also spot-checked tweets with controversial labels. 83.5% of BERT labels were correct compared to the keyword labels. Overall, the fine-tuned BERT-based recognizer provided proper predictions and valuable information on drought impacts. The interpretation and analysis of the model were consistent with experiential domain expertise. Authors: Beichen Zhang (University of Nebraska-Lincoln); Frank Schilder (Thomson Reuters); Kelly Smith (National Drought Mitigation Center); Michael Hayes (University of Nebraska-Lincoln); Sherri Harms (University of Nebraska-Kearney); Tsegaye Tadesse (University of Nebraska-Lincoln) |
Disaster prediction, management, and relief Natural language processing |
ICML 2021 |
Graph Neural Networks for Learning Real-Time Prices in Electricity Market
(Papers Track)
Abstract and authors: (click to expand)Abstract: Solving the optimal power flow (OPF) problem in real-time electricity market improves the efficiency and reliability in the integration of low-carbon energy resources into the power grids. To address the scalability and adaptivity issues of existing end-to-end OPF learning solutions, we propose a new graph neural network (GNN) framework for predicting the electricity market prices from solving OPFs. The proposed GNN-for-OPF framework innovatively exploits the locality property of prices and introduces physics-aware regularization, while attaining reduced model complexity and fast adaptivity to varying grid topology. Numerical tests have validated the learning efficiency and adaptivity improvements of our proposed method over existing approaches. Authors: Shaohui Liu (University of Texas at Austin); Chengyang Wu (University of Texas at Austin); Hao Zhu (University of Texas at Austin) |
Power and energy Classification, regression, and supervised learning |
ICML 2021 |
Learning Granger Causal Feature Representations
(Papers Track)
Abstract and authors: (click to expand)Abstract: Tackling climate change needs to understand the complex phenomena occurring on the Planet. Discovering teleconnection patterns is an essential part of the endeavor. Events like El Niño Southern Oscillation (ENSO) impact essential climate variables at large distances, and influence the underlying Earth system dynamics. However, their automatic identification from the wealth of observational data is still unresolved. Nonlinearities, nonstationarities and the (ab)use of correlation analyses hamper the discovery of true causal patterns. We here introduce a deep learning methodology that extracts nonlinear latent functions from spatio-temporal Earth data and that are Granger causal with the index altogether. We illustrate its use to study the impact of ENSO on vegetation, which allows for a more rigorous study of impacts on ecosystems globally. Authors: Gherardo Varando (Universitat de València); Miguel-Ángel Fernández-Torres (Universitat de València); Gustau Camps-Valls (Universitat de València) |
Climate and Earth science Interpretable ML Unsupervised and semi-supervised learning |
ICML 2021 |
DeepPolicyTracker: Tracking Changes In Environmental Policy In The Brazilian Federal Official Gazette With Deep Learning
(Papers Track)
Abstract and authors: (click to expand)Abstract: Even though most of its energy generation comes from renewable sources, Brazil is one of the largest emitters of greenhouse gases in the world, due to intense farming and deforestation of biomes, such as the Amazon Rainforest, whose preservation is essential for compliance with the Paris Agreement. Still, regardless of lobbies or prevailing political orientation, all government legal actions are published daily in the Federal Official Gazette. However, with hundreds of decrees issued every day by the authorities, it is absolutely burdensome to manually analyze all these processes and find out which ones can pose serious environmental hazards. In this paper, we propose the DeepPolicyTracker, a promising deep learning model that uses a state-of-the-art pre-trained natural language model to classify government acts and track harmful changes in the environmental policies. We also provide the used dataset annotated by domain experts and show some results already obtained. In the future, this system should serve to scale up the high-quality tracking of all oficial documents with a minimum of human supervision and contribute to increasing society's awareness of every government action. Authors: Flávio N Cação (University of Sao Paulo); Anna Helena Reali Costa (Universidade de São Paulo); Natalie Unterstell (Política por Inteiro); Liuca Yonaha (Política por Inteiro); Taciana Stec (Política por Inteiro); Fábio Ishisaki (Política por Inteiro) |
Natural language processing Agriculture, forestry and other land use Classification, regression, and supervised learning |
ICML 2021 |
Fast-Slow Streamflow Model Using Mass-Conserving LSTM
(Papers Track)
Abstract and authors: (click to expand)Abstract: Streamflow forecasting is key to effectively managing water resources and preparing for the occurrence of natural calamities being exacerbated by climate change. Here we use the concept of fast and slow flow components to create a new mass-conserving Long Short-Term Memory (LSTM) neural network model. It uses hydrometeorological time series and catchment attributes to predict daily river discharges. Preliminary results evidence improvement in skills for different scores compared to the recent literature. Authors: Miguel Paredes Quinones (IBM Research); Maciel Zortea (IBM Research); Leonardo Martins (IBM Research) |
Climate and Earth science Time-series analysis |
ICML 2021 |
Attention For Damage Assessment
(Papers Track)
Abstract and authors: (click to expand)Abstract: Due to climate change the hurricanes are getting stronger and having longer impacts. To reduce the detrimental effects of these hurricanes faster and accurate assessments of damages are essential to the rescue teams. Like other computer vision techniques semantic segmentation can identify the damages and help in proper and prompt damage assessment. Current segmentation methods can be classified into attention and non-attention based methods. Existing non-attention based methods suffers from low accuracy and therefore attention based methods are becoming popular. Self-attention based methods can map the mutual relationship and dependencies among pixels of an image and thus improve semantic segmentation accuracy. In this paper, we present a self-attention semantic segmentation method on UAV imageries to assess the damages inflicted by a natural disaster. The proposed method outperforms four state-of-art segmentation methods both quantitatively and qualitatively with a mean IoU score of 84.03 %. Authors: Tashnim Chowdhury (University of Maryland Baltimore County); Maryam Rahnemoonfar (University of Maryland Baltimore County) |
Disaster prediction, management, and relief Buildings and cities Computer vision and remote sensing |
ICML 2021 |
Online LSTM Framework for Hurricane Trajectory Prediction
(Papers Track)
Abstract and authors: (click to expand)Abstract: Hurricanes are high-intensity tropical cyclones that can cause severe damages when the storms make landfall. Accurate long-range prediction of hurricane trajectories is an important but challenging problem due to the complex interactions between the ocean and atmosphere systems. In this paper, we present a deep learning framework for hurricane trajectory forecasting by leveraging the outputs from an ensemble of dynamical (physical) models. The proposed framework employs a temporal decay memory unit for imputing missing values in the ensemble member outputs, coupled with an LSTM architecture for dynamic path prediction. The framework is extended to an online learning setting to capture concept drift present in the data. Empirical results suggest that the proposed framework significantly outperforms various baselines including the official forecasts from U.S. National Hurricane Center (NHC). Authors: Ding Wang (Michigan State University); Pang-Ning Tan (MSU) |
Disaster prediction, management, and relief Data mining |
ICML 2021 |
Controlling Weather Field Synthesis Using Variational Autoencoders
(Papers Track)
Abstract and authors: (click to expand)Abstract: One of the consequences of climate change is an observed increase in the frequency of extreme climate events. That poses a challenge for weather forecast and generation algorithms, which learn from historical data but should embed an often uncertain bias to create correct scenarios. This paper investigates how mapping climate data to a known distribution using variational autoencoders might help explore such biases and control the synthesis of weather fields towards more extreme climate scenarios. We experimented using a monsoon-affected precipitation dataset from southwest India, which should give a roughly stable pattern of rainy days and ease our investigation. We report compelling results showing that mapping complex weather data to a known distribution implements an efficient control for weather field synthesis towards more (or less) extreme scenarios. Authors: Dario Augusto Borges Oliveira (IBM Research); Jorge Luis Guevara Diaz (IBM Research); Bianca Zadrozny (IBM Research); Campbell Watson (IBM Reserch) |
Climate and Earth science Generative modeling |
ICML 2021 |
ForestViT: A Vision Transformer Network for Convolution-free Multi-label Image Classification in Deforestation Analysis
(Papers Track)
Abstract and authors: (click to expand)Abstract: Understanding the dynamics of deforestation as well as land uses of neighboring areas is of vital importance for the design and development of appropriate forest conservation and management policies. In this paper, we approach deforestation as a multi-label classification problem in an endeavor to capture the various relevant land uses from satellite images. To this end, we propose a multi-label vision transformer model, ForestViT, which leverages the benefits of self-attention mechanism, obviating any convolution operations involved in commonly used deep learning models utilized for deforestation detection. Authors: Maria Kaselimi (National Technical University of Athens); Athanasios Voulodimos (University of West Attica); Ioannis Daskalopoulos (University of West Attica); Nikolaos Doulamis (National Technical University of Athens); Anastasios Doulamis (Technical University of Crete) |
Agriculture, forestry and other land use Computer vision and remote sensing |
ICML 2021 |
Reducing Carbon in the Design of Large Infrastructure Scheme with Evolutionary Algorithms
(Papers Track)
Abstract and authors: (click to expand)Abstract: The construction and operations of large infrastructure schemes such as railways, roads, pipelines and power lines account for a significant proportion of global carbon emissions. Opportunities to reduce the embodied and operational carbon emissions of new infrastructure schemes are greatest during the design phase. However, schedule and cost constraints limit designers from assessing a large number of design options in detail to identify the solution with the lowest lifetime carbon emissions using conventional methods. Here, we develop an evolutionary algorithm to rapidly evaluate in detail the lifetime carbon emissions of thousands of possible design options for new water transmission pipeline schemes. Our results show that this approach can help designers in some cases to identify design solutions with more than 10% lower operational carbon emissions compared with conventional methods, saving more than 1 million tonnes in lifetime carbon emissions for a new water transmission pipeline scheme. We also find that this evolutionary algorithm can be applied to design other types of infrastructure schemes such as non-water pipelines, railways, roads and power lines. Authors: Matt Blythe (Continuum Industries) |
Buildings and cities Industry Power and energy Transportation |
ICML 2021 |
An Accurate and Scalable Subseasonal Forecasting Toolkit for the United States
(Papers Track)
Abstract and authors: (click to expand)Abstract: We develop a subseasonal forecasting toolkit of accurate and highly scalable benchmarks that outperform both the United States operational Climate Forecasting System (CFSv2) and state-of-the-art learning methods from the literature. Our new learned benchmarks include (a) Climatology++, an enhanced form of climatology using knowledge of only the day of the year; (b) CFSv2++, a learned correction for CFSv2; and (c) Persistence++, an augmented persistence model that combines lagged measurements with CFSv2forecasts. These methods alone improve upon CFSv2 accuracy by 9% for US precipitation and 6% for US temperature over 2011-2020. Ensembling our benchmarks with diverse forecasting methods leads to even further gains. Overall, we find that augmenting classical forecasting approaches with learned corrections yields an effective, low-cost strategy for building next-generation subseasonal forecasting models. Authors: Soukayna Mouatadid (University of Toronto); Paulo Orenstein (IMPA); Genevieve E Flaspohler (MIT); Miruna Oprescu (Microsoft Research); Judah Cohen (AER); Franklyn Wang (Harvard); Sean Knight (MIT); Ernest Fraenkel (MIT); Lester Mackey (Microsoft Research) |
Classification, regression, and supervised learning |
ICML 2021 |
Self-supervised Contrastive Learning for Irrigation Detection in Satellite Imagery
(Papers Track)
Abstract and authors: (click to expand)Abstract: Climate change has caused reductions in river runoffs and aquifer recharge resulting in an increasingly unsustainable crop water demand from reduced freshwater availability. Achieving food security while deploying water in a sustainable manner will continue to be a major challenge necessitating careful monitoring and tracking of agricultural water usage. Historically, monitoring water usage has been a slow and expensive manual process with many imperfections and abuses. Ma-chine learning and remote sensing developments have increased the ability to automatically monitor irrigation patterns, but existing techniques often require curated and labelled irrigation data, which are expensive and time consuming to obtain and may not exist for impactful areas such as developing countries. In this paper, we explore an end-to-end real world application of irrigation detection with uncurated and unlabeled satellite imagery. We apply state-of-the-art self-supervised deep learning techniques to optical remote sensing data, and find that we are able to detect irrigation with up to nine times better precision, 90% better recall and 40% more generalization ability than the traditional supervised learning methods. Authors: Chitra Agastya (UC Berkeley, IBM); Sirak Ghebremusse (UC Berkeley); Ian Anderson (UC Berkeley); Colorado Reed (UC Berkeley); Hossein Vahabi (University California Berkeley); Alberto Todeschini (UC Berkeley) |
Unsupervised and semi-supervised learning Agriculture, forestry and other land use Climate and Earth science Classification, regression, and supervised learning Computer vision and remote sensing Meta- and transfer learning |
ICML 2021 |
Extreme Precipitation Seasonal Forecast Using a Transformer Neural Network
(Papers Track)
Abstract and authors: (click to expand)Abstract: An impact of climate change is the increase in frequency and intensity of extreme precipitation events. However, confidently predicting the likelihood of extreme precipitation at seasonal scales remains an outstanding challenge. Here, we present an approach to forecasting the quantiles of the maximum daily precipitation in each week up to six months ahead using the temporal fusion transformer (TFT) model. Through experiments in two regions, we compare TFT predictions with those of two baselines: climatology and a calibrated ECMWF SEAS5 ensemble forecast (S5). Our results show that, in terms of quantile risk at six month lead time, the TFT predictions significantly outperform those from S5 and show an overall small improvement compared to climatology. The TFT also responds positively to departures from normal that climatology cannot. Authors: Daniel Salles Civitarese (IBM Research, Brazil); Daniela Szwarcman (IBM Research); Bianca Zadrozny (IBM Research); Campbell Watson (IBM Reserch) |
Disaster prediction, management, and relief Climate and Earth science Time-series analysis |
ICML 2021 |
BERT Classification of Paris Agreement Climate Action Plans
(Papers Track)
Abstract and authors: (click to expand)Abstract: As the volume of text-based information on climate policy increases, natural language processing (NLP) tools can distill information from text to better inform decision making on climate policy. We investigate how large pretrained transformers based on the BERT architecture classify sentences on a dataset of climate action plans which countries submitted to the United Nations following the 2015 Paris Agreement. We use the document header structure to assign noisy policy-relevant labels such as mitigation, adaptation, energy, and land use to text elements. Our models provide an improvement in out-of-sample classification over simple heuristics though fall short of the consistency observed between human annotators. We hope to extend this framework to a wider class of textual climate change data such as climate legislation and corporate social responsibility filings and build tools to streamline the extraction of information from these documents for climate change researchers. Authors: Tom Corringham (Scripps Institution of Oceanography); Daniel Spokoyny (Carnegie Mellon University); Eric Xiao (University of California San Diego); Christopher Cha (University of California San Diego); Colin Lemarchand (University of California San Diego); Mandeep Syal (University of California San Diego); Ethan Olson (University of California San Diego); Alexander Gershunov (Scripps Institution of Oceanography) |
Natural language processing Climate policy |
ICML 2021 |
Quantification of Carbon Sequestration in Urban Forests
(Papers Track)
Abstract and authors: (click to expand)Abstract: Vegetation, trees in particular, sequester carbon by absorbing carbon dioxide from the atmosphere, however, the lack of efficient quantification methods of carbon stored in trees renders it difficult to track the process. Here we present an approach to estimate the carbon storage in trees based on fusing multispectral aerial imagery and LiDAR data to identify tree coverage, geometric shape, and tree species, which are crucial attributes in carbon storage quantification. We demonstrate that tree species information and their three-dimensional geometric shapes can be estimated from remote imagery in order to calculate the tree's biomass. Specifically, for Manhattan, New York City, we estimate a total of 52,000 tons of carbon sequestered in trees. Authors: Levente Klein (IBM Research); Wang Zhou (IBM Research); Conrad M Albrecht (IBM Research) |
Carbon capture and sequestration Climate and Earth science Agriculture, forestry and other land use |
ICML 2021 |
A comparative study of stochastic and deep generative models for multisite precipitation synthesis
(Papers Track)
Abstract and authors: (click to expand)Abstract: Future climate change scenarios are usually hypothesized using simulations from weather generators. However, there only a few works comparing and evaluating multisite weather generators, and there is no existing work contrasting promising deep learning approaches for weather generation against classical stochastic weather generators. This study shows preliminary results evaluating stochastic weather generators and deep generative models for multisite precipitation synthesis. Using a variety of metrics, we compare two open source weather generators: XWeathergen (an extension of the Weathergen library) and RGeneratePrec, and two deep generative models: GAN and VAE. Our preliminary results can serve as a guide for improving the design of deep learning architectures and algorithms for the multisite precipitation synthesis task. Authors: Jorge Luis Guevara Diaz (IBM Research); Dario Augusto Borges Oliveira (IBM Research); Bianca Zadrozny (IBM Research); Campbell Watson (IBM Reserch) |
Generative modeling Other |
ICML 2021 |
Self-Supervised Learning on Multispectral Satellite Data for Near-Term Solar Forecasting
(Papers Track)
Abstract and authors: (click to expand)Abstract: With the unprecedented increase in distributed photovoltaic (PV) capacity across the globe, there is an increasing need for reliable and accurate forecasting of solar power generation. While PV output is affected by many factors, the atmosphere, i.e., cloud cover, plays a dominant role in determining the amount of downwelling solar irradiance that reaches PV modules. This paper demonstrates that self-supervised learning of multispectral satellite data from the recently launched GOES-R series of satellites can improve near-term (15 minutes) solar forecasting. We develop deep auto-regressive models using convolutional neural networks (CNN) and long short-term memory networks (LSTM) that are globally trained across many solar sites on the raw spatio-temporal data from GOES-R satellites. This self-supervised model provides estimates of future solar irradiance that can be fed directly to a regression model trained on smaller site-specific solar data to provide near-term solar PV forecasts at the site. The regression implicitly models site-specific characteristics, such as capacity, panel tilt, orientation, etc, while the self-supervised CNN-LSTM implicitly captures global atmospheric patterns affecting a site's solar irradiation. Results across 25 solar sites show the utility of such self-supervised modeling by providing accurate near-term forecast with errors close to that of a model using current ground-truth observations. Authors: Akansha Singh Bansal (University of Massachusetts Amherst); Trapit Bansal (University of Massachusetts Amherst); David Irwin (University of Massachusetts Amherst) |
Unsupervised and semi-supervised learning Other Time-series analysis |
ICML 2021 |
Refining Ice Layer Tracking through Wavelet combined Neural Networks
(Papers Track)
Abstract and authors: (click to expand)Abstract: Rise in global temperatures is resulting in polar ice caps to melt away, which can lead to drastic sea level rise and coastal floods. Accurate calculation of the ice cap reduction is necessary in order to project its climatic impact. Ice sheets are monitored through Snow Radar sensors which give noisy profiles of subsurface ice layers. The sensors take snapshots of the entire ice sheet regularly, and thus result in large datasets. In this work, we use convolutional neural networks (CNNs) for their property of feature extraction and generalizability on large datasets. We also use wavelet transforms and embed them as a layer in the architecture to help in denoising the radar images and refine ice layer detection. Our results show that incorporating wavelets in CNNs helps in detecting the position of deep subsurface ice layers, which can be used to analyse their change overtime. Authors: Debvrat Varshney (University of Maryland Baltimore County); Masoud Yari (College of Engineering and Information Technology, University of Maryland Balitimore County); Tashnim Chowdhury (University of Maryland Baltimore County); Maryam Rahnemoonfar (University of Maryland Baltimore County) |
Computer vision and remote sensing Climate and Earth science |
ICML 2021 |
Forecasting Sea Ice Concentrations using Attention-based Ensemble LSTM
(Papers Track)
Abstract and authors: (click to expand)Abstract: Accurately forecasting Arctic sea ice from sub-seasonal to seasonal scales has been a major scientific effort with fundamental challenges at play. In addition to physics-based earth system models, researchers have been applying multiple statistical and machine learning models for sea ice forecasting. Looking at the potential of data-driven sea ice forecasting, we propose an attention-based Long Short Term Memory (LSTM) ensemble method to predict monthly sea ice extent up to 1 month ahead. Using daily and monthly satellite retrieved sea ice data from NSIDC and atmospheric and oceanic variables from ERA5 reanalysis product for 39 years, we show that our multi-temporal ensemble method outperforms several baseline and recently proposed deep learning models. This will substantially improve our ability in predicting future Arctic sea ice changes, which is fundamental for forecasting transporting routes, resource development, coastal erosion, threats to Arctic coastal communities and wildlife. Authors: Sahara Ali (University of Maryland, Baltimore County); Yiyi Huang (University of Maryland, Baltimore County); Xin Huang (University of Maryland, Baltimore County); Jianwu Wang (University of Maryland, Baltimore County) |
Climate and Earth science Ecosystems and natural systems Classification, regression, and supervised learning Time-series analysis |
ICML 2021 |
Toward efficient calibration of higher-resolution Earth System Models
(Papers Track)
Best Paper: Pathway to Impact
Abstract and authors: (click to expand)Abstract: Projections of future climate change to support decision-making require high spatial resolution, but this is computationally prohibitive with modern Earth system models (ESMs). A major challenge is the calibration (parameter tuning) process, which requires running large numbers of simulations to identify the optimal parameter values. Here we train a convolutional neural network (CNN) on simulations from two lower-resolution (and thus much less expensive) versions of the same ESM, and a smaller number of higher-resolution simulations. Cross-validated results show that the CNN's skill exceeds that of a climatological baseline for most variables with as few as 5-10 examples of the higher-resolution ESM, and for all variables (including precipitation) with at least 20 examples. This proof-of-concept study offers the prospect of significantly more efficient calibration of ESMs, by reducing the required CPU time for calibration by 20-40 %. Authors: Christopher Fletcher (University of Waterloo); William McNally (University of Waterloo); John Virgin (University of Waterloo) |
Generative modeling Climate and Earth science |
ICML 2021 |
Visual Question Answering: A Deep Interactive Framework for Post-Disaster Management and Damage Assessment
(Papers Track)
Abstract and authors: (click to expand)Abstract: Each natural disaster has left a trail of destruction and damage, which must be managed very effectively to reduce the disaster's impact. Lack of proper decision making in post-disaster managerial level can increase human suffering and waste a great amount of money. Our objective is to incorporate a deep interactive approach in the decision-making system especially in a rescue mission after any natural disaster for the systematic distribution of the limited resources and accelerating the recovery process. We believe that Visual Question Answering (VQA) is the finest way to address this issue. In visual question answering, a query-based answer regarding the situation of the affected area can add value to the decision-making system. Our main purpose of this study is to develop a Visual Question Answering model for post-disaster damage assessment purposes. With this aim, we collect the images by UAV (Unmanned Aerial Vehicle) after Hurricane Harvey and develop a dataset that includes the questions that are very important in the decision support system after a natural disaster. In addition, We propose a supervised attention-based approach in the modeling segment. We compare our approach with the two other baseline attention-based VQA algorithms namely Multimodal Factorized Bilinear (MFB) and Stacked Attention Network (SAN). Our approach outperforms in providing answers for several types of queries including simple counting, complex counting compares to the baseline models. Authors: Argho Sarkar (University of Maryland, Baltimore County); Maryam Rahnemoonfar (University of Maryland Baltimore County) |
Disaster prediction, management, and relief Computer vision and remote sensing |
ICML 2021 |
Designing Bounded min-knapsack Bandits algorithm for Sustainable Demand Response
(Papers Track)
Abstract and authors: (click to expand)Abstract: Around 40% of global energy produced is consumed by buildings. By using renewable energy resources we can alleviate the dependence on electrical grids. Recent trends focus on incentivizing consumers to reduce their demand consumption during peak hours for sustainable demand response. To minimize the loss, the distributor companies should target the right set of consumers and demand the right amount of electricity reductions. This paper proposes a novel bounded integer min-knapsack algorithm and shows that the algorithm, while allowing for multiple unit reduction, also optimizes the loss to the distributor company within a factor of two (multiplicative) and a problem-dependent additive constant. Existing CMAB algorithms fail to work in this setting due to non-monotonicity of reward function and time-varying optimal sets. We propose a novel algorithm Twin-MinKPDR-CB to learn these compliance probabilities efficiently. Twin-MinKPDR-CB works for non-monotone reward functions bounded min-knapsack constraints and time-varying optimal sets. We find that Twin-MinKPDR-CB achieves sub-linear regret of O(log T) with T being the number of rounds demand response is run. Authors: Akansha Singh (Indian Institute of Technology, Ropar); Meghana Reddy (Indian Institute of Technology, Ropar); Zoltan Nagy (University of Texas); Sujit P. Gujar (Machine Learning Laboratory, International Institute of Information Technology, Hyderabad); Shweta Jain (Indian Institute of Technology Ropar) |
Power and energy Buildings and cities Reinforcement learning and control Uncertainty quantification and robustness |
ICML 2021 |
Sky Image Prediction Using Generative Adversarial Networks for Solar Forecasting
(Papers Track)
Abstract and authors: (click to expand)Abstract: Large-scale integration of solar photovoltaics (PV) is challenged by high variability in its power output, mainly due to local and short-term cloud events. To achieve accurate solar forecasting, it is paramount to accurately predict the movement of clouds. Here, we use generative adversarial networks (GANs) to predict future sky images based on past sky image sequences and show that our trained model can generate realistic future sky images and capture the dynamics of clouds in the context frames. The generated images are then evaluated for a downstream solar forecasting task; results show promising performance. Authors: Yuhao Nie (Stanford University); Andea Scott (Stanford University); Eric Zelikman (Stanford University); Adam Brandt (Stanford University) |
Generative modeling Climate and Earth science Computer vision and remote sensing |
ICML 2021 |
EVGen: Adversarial Networks for Learning Electric Vehicle Charging Loads and Hidden Representations
(Papers Track)
Abstract and authors: (click to expand)Abstract: The nexus between transportation, the power grid, and consumer behavior is much more pronounced than ever before as the race to decarbonize intensifies. Electrification in the transportation sector has led to technology shifts and rapid deployment of electric vehicles (EVs). The potential increase in stochastic and spatially heterogeneous charging load presents a unique challenge that is not well studied, and will have significant impacts on grid operations, emissions, and system reliability if not managed effectively. Realistic scenario generators can help operators prepare, and machine learning can be leveraged to this end. In this work, we develop generative adversarial networks (GANs) to learn distributions of electric vehicle (EV) charging sessions and disentangled representations. We show that this model successfully parameterizes unlabeled temporal and power patterns and is able to generate synthetic data conditioned on these patterns. We benchmark the generation capability of this model with Gaussian Mixture Models (GMMs), and empirically show that our proposed model framework is better at capturing charging distributions and temporal dynamics. Authors: Robert Buechler (Stanford University); Emmanuel O Balogun (Stanford University); Arun Majumdar (Stanford University); Ram Rajagopal (Stanford University) |
Power and energy Transportation Generative modeling Unsupervised and semi-supervised learning |
ICML 2021 |
Reconstruction of Long-Term Historical Electricity Demand Data
(Papers Track)
Abstract and authors: (click to expand)Abstract: Long-term planning of a robust power system requires the understanding of changing demand patterns. Electricity demand is highly weather sensitive. Thus, the supply side variation from introducing intermittent renewable sources, juxtaposed with variable demand, will introduce additional challenges in the grid planning process. By understanding the spatial and temporal variability of temperature over the US, the response of demand to natural variability and climate change-related effects on temperature can be separated, especially because the effects due to the former factor are not known. Through this project, we aim to better support the technology& policy development process for power systems by developing machine and deep learning ’back-forecasting’ models to reconstruct multidecadaldemand records and study the natural variabilityof temperature and its influence on demand. Authors: Reshmi Ghosh (Carnegie Mellon University); Michael Craig (University of Michigan); H.Scott Matthews (Carnegie Mellon University); Laure Berti-Equille (IRD) |
Classification, regression, and supervised learning Deep Learning |
ICML 2021 |
A Set-Theoretic Approach to Safe Reinforcement Learning in Power Systems
(Papers Track)
Abstract and authors: (click to expand)Abstract: Reducing the carbon footprint of the energy sector will be a vital part of the fight against climate change, and doing so will require the widespread adoption of renewable energy resources. Optimally integrating a large number of these resources requires new control techniques that can both compensate for the variability of renewables and satisfy hard engineering constraints. Reinforcement learning (RL) is a promising approach to data-driven control, but it is difficult to verify that the policies derived from data will be safe. In this paper, we combine RL with set-theoretic control to propose a computationally efficient approach to safe RL. We demonstrate the method on a simplified power system model and compare it with other RL techniques. Authors: Daniel Tabas (University of Washington); Baosen Zhang (University of Washington) |
Reinforcement learning and control Power and energy |
ICML 2021 |
A study of battery SoC scheduling using machine learning with renewable sources
(Papers Track)
Abstract and authors: (click to expand)Abstract: An open energy system (OES) enables the shared distribution of energy resources within a community autonomously and efficiently. For this distributed system a rooftop solar panel and a battery are installed in each house of the community. The OES system monitors the State of Charge (SoC) of each battery independently, arbitrates energy-exchange requests from each house, and physically controls peer-to-peer energy exchanges. In this study, our goal is to optimize those energy exchanges to maximize the renewable energy penetration within the community using machine learning techniques. Future household electricity consumption is predicted using machine learning from the past time series. The predicted consumption is used to determine the next energy-exchange strategy, i.e. when and how much energy should be exchanged to minimize the surplus of solar energy. The simulation results show that the proposed method can increase the amount of renewable energy penetration within the community. Authors: Daisuke Kawamoto (Sony Computer Science Laboratories, Inc.); Gopinath Rajendiran (CSIR Central Scientific Instruments Organisation, Chennai) |
Power and energy Time-series analysis |
ICML 2021 |
Multivariate climate downscaling with latent neural processes
(Papers Track)
Abstract and authors: (click to expand)Abstract: Statistical downscaling is a vital tool in generating high resolution projections for climate impact studies. This study applies convolutional latent neural processes to multivariate downscaling of maximum temperature and precipitation. In contrast to existing downscaling methods, this model is shown to produce spatially coherent predictions at arbitrary locations specified at test time, regardless of whether training data are available at these points. Authors: Anna Vaughan (Univeristy of Cambridge); Nic Lane (University of Cambridge); Michael Herzog (University of Cambridge) |
Climate and Earth science Causal and Bayesian methods |
ICML 2021 |
FIRE-ML: A Remotely-sensed Daily Wildfire Forecasting Dataset for the Contiguous United States
(Papers Track)
Abstract and authors: (click to expand)Abstract: Wildfires are natural phenomena that can have devastating effects on ecosystems, urban developments, and the environment. Improving the scientific understanding of these events and the ability to forecast how they will evolve in the short- and long-term are ongoing multi-decadal challenges. We present a large-scale dataset, well-suited to machine learning, that aggregates and aligns multiple remotely-sensed and forecasted data products to provide a holistic set of features for forecasting wildfires on daily timescales. This dataset includes 4.2 million unique active fire detections, covers the majority of the contiguous United States from 2012 to 2020, and includes active fire detections, land cover, topography, and meteorology. Authors: Casey A Graff (UC Irvine) |
Computer vision and remote sensing Agriculture, forestry and other land use Classification, regression, and supervised learning |
ICML 2021 |
IowaRain: A Statewide Rain Event Dataset Based on Weather Radars and Quantitative Precipitation Estimation
(Papers Track)
Abstract and authors: (click to expand)Abstract: Effective environmental planning and management to address climate change could be achieved through extensive environmental modeling with machine learning and conventional physical models. In order to develop and improve these models, practitioners and researchers need comprehensive benchmark datasets that are prepared and processed with environmental expertise that they can rely on. This study presents an extensive dataset of rainfall events for the state of Iowa (2016-2019) acquired from the National Weather Service Next Generation Weather Radar (NEXRAD) system and processed by a quantitative precipitation estimation system. The dataset presented in this study could be used for better disaster monitoring, response and recovery by paving the way for both predictive and prescriptive modeling. Authors: Muhammed A Sit (The University of Iowa); Bongchul Seo (IIHR—Hydroscience & Engineering, The University of Iowa); Ibrahim Demir (The University of Iowa) |
Data mining Benchmark Datasets |
ICML 2021 |
Short-term Hourly Streamflow Prediction with Graph Convolutional GRU Networks
(Papers Track)
Abstract and authors: (click to expand)Abstract: The frequency and impact of floods are expected to increase due to climate change. It is crucial to predict streamflow, consequently flooding, in order to prepare and mitigate its consequences in terms of property damage and fatalities. This paper presents a Graph Convolutional GRUs based model to predict the next 36 hours of streamflow for a sensor location using the upstream river network. As shown in experiment results, the model presented in this study provides better performance than the persistence baseline and a Convolutional Bidirectional GRU network for the selected study area in short-term streamflow prediction. Authors: Muhammed A Sit (The University of Iowa); Bekir Demiray (The University of Iowa); Ibrahim Demir (The University of Iowa) |
Disaster prediction, management, and relief Climate and Earth science Computer vision and remote sensing Interpretable ML Time-series analysis |
ICML 2021 |
Wildfire Smoke Plume Segmentation Using Geostationary Satellite Imagery
(Papers Track)
Abstract and authors: (click to expand)Abstract: Wildfires have increased in frequency and severity over the past two decades, especially in the Western United States. Beyond physical infrastructure damage caused by these wildfire events, researchers have increasingly identified harmful impacts of particulate matter generated by wildfire smoke on respiratory, cardiovascular, and cognitive health. This inference is difficult due to the spatial and temporal uncertainty regarding how much particulate matter is specifically attributable to wildfire smoke. One factor contributing to this challenge is the reliance on manually drawn smoke plume annotations, which are often noisy representations limited to the United States. This work uses deep convolutional neural networks to segment smoke plumes from geostationary satellite imagery. We compare the performance of predicted plume segmentations versus the noisy annotations using causal inference methods to estimate the amount of variation each explains in Environmental Protection Agency (EPA) measured surface level particulate matter <2.5μm in diameter (PM2.5). Authors: Jeffrey L Wen (Stanford University); Marshall Burke (Stanford University) |
Computer vision and remote sensing Behavioral and social science Climate and Earth science Disaster prediction, management, and relief Societal adaptation Classification, regression, and supervised learning |
ICML 2021 |
Deep Spatial Temporal Forecasting of Electrical Vehicle Charging Demand
(Papers Track)
Abstract and authors: (click to expand)Abstract: Electric vehicles can offer a low carbon emission solution to reverse rising emission trends. However, this requires that the energy used to meet the demand is green. To meet this requirement, accurate forecasting of the charging demand is vital. Short and long-term charging demand forecasting will allow for better optimisation of the power grid and future infrastructure expansions. In this paper, we propose to use publicly available data to forecast the electric vehicle charging demand. To model the complex spatial-temporal correlations between charging stations, we argue that Temporal Graph Convolution Models are the most suitable to capture the correlations. The proposed Temporal Graph Convolutional Networks provide the most accurate forecasts for short and long-term forecasting compared with other forecasting methods. Authors: Frederik B Hüttel (Technical University of Denmark (DTU)); Filipe Rodrigues (Technical University of Denmark (DTU)); Inon Peled (Technical University of Denmark (DTU)); Francisco Pereira (DTU) |
Transportation Power and energy Classification, regression, and supervised learning Time-series analysis |
ICML 2021 |
Powering Effective Climate Communication with a Climate Knowledge Base
(Proposals Track)
Abstract and authors: (click to expand)Abstract: While many accept climate change and its growing impacts, few converse about it well, limiting the adoption speed of societal changes necessary to address it. In order to make effective climate communication easier, we aim to build a system that presents to any individual the climate information predicted to best motivate and inspire them to take action given their unique set of personal values. To alleviate the cold-start problem, the system relies on a knowledge base (ClimateKB) of causes and effects of climate change, and their associations to personal values. Since no such comprehensive ClimateKB exists, we revisit knowledge base construction techniques and build a ClimateKB from free text. We plan to open source the ClimateKB and associated code to encourage future research and applications. Authors: Kameron B. Rodrigues (Stanford University); Shweta Khushu (SkySpecs Inc); Mukut Mukherjee (ClimateMind); Andrew Banister (Climate Mind); Anthony Hevia (ClimateMind); Sampath Duddu (ClimateMind); Nikita Bhutani (Megagon Labs) |
Natural language processing Knowledge engineering |
ICML 2021 |
Solar PV Maps for Estimation and Forecasting of Distributed Solar Generation
(Proposals Track)
Abstract and authors: (click to expand)Abstract: Rapid uptake of distributed solar PV is starting to make the operation of grids and energy markets more challenging, and better methods are needed for measuring and forecasting distributed solar PV generation across entire regions. We propose a method for converting time series data from a number of point sources (power measurements at individual sites) into 2-dimensional maps that estimate total solar PV generation across large areas. These maps may be used on their own, or in conjunction with additional data sources (such as satellite imagery) in a deep learning framework that enables improved regional solar PV estimation and forecasting. We provide some early validation and results, discuss anticipated benefits of this approach, and argue that this method has the potential to further enable significant uptake of solar PV, assisting a shift away from fossil fuel-based generation. Authors: Julian de Hoog (The University of Melbourne); Maneesha Perera (The University of Melbourne); Kasun Bandara (The University of Melbourne); Damith Senanayake (The University of Melbourne); Saman Halgamuge (University of Melbourne) |
Power and energy Classification, regression, and supervised learning Time-series analysis |
ICML 2021 |
An Iterative Approach to Finding Global Solutions of AC Optimal Power Flow Problems
(Proposals Track)
Abstract and authors: (click to expand)Abstract: To achieve a cleaner energy system, a diverse set of energy resources such as solar PV, battery storage and electric vehicles are entering the electric grid. Their operation is typically controlled by solving a resource allocation problem, called the AC optimal power flow (ACOPF) problem. This problem minimizes the cost of generation subject to supply/demand balance and various other engineering constraints. It is nonlinear and nonconvex, and existing solvers are generally successful in finding local solutions. As the share of renewable energy resources increases, it is becoming increasingly important to find globally optimal solutions to utilize these resources to the full extent. In this paper, we propose a simple iterative approach to find globally optimal solutions to ACOPF problems. First, we call an existing solver for the ACOPF problem and we form a partial Lagrangian from the associated dual variables. This partial Lagrangian has a much better optimization landscape and we use its solution as a warm start for the ACOPF problem. By repeating this process, we can iteratively improve the solution quality, moving from local solutions to global ones. We demonstrate the effectiveness of our algorithm on standard benchmarks. We also show how the dual variables could be found by using a neural network to further speed up the algorithm. Authors: Ling Zhang (University of Washington); Baosen Zhang (University of Washington) |
Power and energy |
ICML 2021 |
Deep learning applied to sea surface semantic segmentation: Filtering sunglint from aerial imagery
(Proposals Track)
Abstract and authors: (click to expand)Abstract: Water waves are an ubiquitous feature of the oceans, which serve as a pathway for interactions with the atmosphere. Wave breaking in particular is crucial in developing better understanding of the exchange of momentum, heat, and gas fluxes between the ocean and the atmosphere. Characterizing the properties of wave breaking using orbital or suborbital imagery of the surface of the ocean can be challenging, due to contamination from sunglint, a persistent feature in certain lighting conditions. Here we propose a supervised learning approach to accurately detect whitecaps from airborne imagery obtained under a broad range of lighting conditions. Finally, we discuss potential applications for improving ocean and climate models. Authors: Teodor Vrecica (UCSD); Quentin Paletta (University of Cambridge); Luc Lenain (UCSD) |
Climate and Earth science Classification, regression, and supervised learning Computer vision and remote sensing |
ICML 2021 |
Technical support project and analysis of the dissemination of carbon dioxide and methane from Lake Kivu in nature and its impact on biodiversity in the Great Lakes region since 2012
(Proposals Track)
Abstract and authors: (click to expand)Abstract: Straddling the Democratic Republic of the Congo and Rwanda, at an altitude of 1,460 m, Lake Kivu is one of the ten great lakes in Africa, alongside the main ones that are Victoria and Tanganyika. Kivu contains very high concentrations of gases (carbon dioxide and methane in particular), produced by volcanic activity in the region and the decomposition of organic matter. It has 2,700 km2 of this body of water, a depth that approaches 500 meters in places. It is estimated to contain 60 billion cubic meters of dissolved methane and about 300 billion cubic meters of carbon dioxide accumulated over time. Lake Kivu, located north of Lake Tanganyika and contains a very high amount of carbon dioxide and methane. Carbon dioxide (CO2) and methane (CH4) are both greenhouse gases that affect how well the planet works. The first stays in the atmosphere for a hundred years while the second stays there only for a dozen years. The effect of the dissemination of these in nature prompts me to collect as much data as possible on their circulation and to suggest possible solutions that are consistent with the Paris Agreement. In addition, many wastes come from households and/or small industries in the towns of Bukavu, Goma for the DRC and those of Gyangugu and Gisenyi for Rwanda constitute a high source of CH4 emissions which also contribute to global warming. The exploitation of methane expected in the near future is an additional threat to the sustainable development of ecosystem resources. For various reasons, Lake Kivu constitutes an adequate model for studying the responses of large tropical lakes to changes linked to human activity: indeed, despite its physical and biogeochemical peculiarities, the limnological and ecological processes of its pelagic waters are subject to the same forcings as in other large lakes in the same region, as shown by recent studies. Authors: Bulonze Chibaderhe (FEMAC Asbl) |
Reinforcement learning and control Carbon capture and sequestration |
ICML 2021 |
Virtual Screening for Perovskites Discovery
(Proposals Track)
Abstract and authors: (click to expand)Abstract: Re-inventing the global energy harvesting system from fossil fuels to renewables is essential for reducing greenhouse gas emissions in line with current climate targets. Perovskite photovoltaics (PVs), the class of materials with relatively unexplored material configurations, play key role in solar energy generation due to their low manufacturing cost and exceptional optoelectronic properties. Without the efficient utilisation of machine learning, the discovery and manufacturing process could take a dangerously long time. We present a high-throughput computational design that leverages machine learning algorithms at various steps in order to assess the suitability of the organic molecules for the perovskite crystals. Authors: Andrea Karlova (UCL); Cameron C.L. Underwood (University of Surrey); Ravi Silva (University of Surrey) |
Power and energy Photovoltaics |
ICML 2021 |
Leveraging Machine Learning for Equitable Transition of Energy Systems
(Proposals Track)
Abstract and authors: (click to expand)Abstract: Our planet is facing overlapping crises of climate change, global pandemic, and systemic inequality. To respond climate change, the energy system is in the midst of its most foundational transition since its inception, from traditional fuel-based energy sources to clean renewable sources. While the transition to a low-carbon energy system is in progress, there is an opportunity to make the new system more just and equitable than the current one that is inequitable in many forms. Measuring inequity in the energy system is a formidable task since it is large scale and the data is coming from abundant data sources. In this work, we lay out a plan to leverage and develop scalable machine learning (ML) tools to measure the equity of the current energy system and to facilitate a just transition to a clean energy system. We focus on two concrete examples. First, we explore how ML can help to measure the inequity in the energy inefficiency of residential houses in the scale of a town or a country. Second, we explore how deep learning techniques can help to estimate the solar potential of residential buildings to facilitate a just installation and incentive allocation of solar panels. The application of ML for energy equity is much broader than the above two examples and we highlight some others as well. The result of this research could be used by policymakers to efficiently allocate energy assistance subsidies in the current energy systems and to ensure justice in their energy transition plans. Authors: Enea Dodi (UMass Amherst); Anupama A Sitaraman (University of Massachusetts Amherst); Mohammad Hajiesmaili (UMass Amherst); Prashant Shenoy (University of Massachusetts Amherst) |
Classification, regression, and supervised learning Climate justice Time-series analysis |
ICML 2021 |
Long-term Burned Area Reconstruction through Deep Learning
(Proposals Track)
Abstract and authors: (click to expand)Abstract: Wildfire impact studies are significantly hampered by the absence of a global long-term burned area dataset. This prevents conclusive statements on the role of anthropogenic activity on wildfire impacts over the last century. Here, we propose a workflow to construct a 1901-2014 reanalysis of monthly global burned area at a 0.5° by 0.5° scale. A neural network will be trained with weather-related, vegetational, societal and economic input parameters, and burned area as output label for the 1982-2014 time period. This model can then be applied to the whole 1901-2014 time period to create a data-driven, long-term burned area reanalysis. This reconstruction will allow to investigate the long-term effect of anthropogenic activity on wildfire impacts, will be used as basis for detection and attribution studies and could help to reduce the uncertainties in future predictions. Authors: Seppe Lampe (Vrije Universiteit Brussel); Bertrand Le Saux (European Space Agency (ESA)); Inne Vanderkelen (Vrije Universiteit Brussel); Wim Thiery (Vrije Universiteit Brussel) |
Disaster prediction, management, and relief Classification, regression, and supervised learning Computer vision and remote sensing |
ICML 2021 |
Preserving the integrity of the Canadian northern ecosystems through insights provided by reinforcement learning-based Arctic fox movement models
(Proposals Track)
Abstract and authors: (click to expand)Abstract: Realistic modeling of the movement of the Arctic fox, one of the main predators of the circumpolar world, is crucial to understand the processes governing the distribution of the Canadian Arctic biodiversity. Current methods, however, are unable to adequately account for complex behaviors as well as intra- and interspecific relationships. We propose to harness the potential of reinforcement learning to develop innovative models that will address these shortcomings and provide the backbone to predict how vertebrate communities may be affected by environmental changes in the Arctic, an essential step towards the elaboration of rational conservation actions. Authors: Catherine Villeneuve (Université Laval); Frédéric Dulude-De Broin (Université Laval); Pierre Legagneux (Université Laval); Dominique Berteaux (Université du Québec à Rimouski); Audrey Durand (Université Laval) |
Ecosystems and natural systems Reinforcement learning and control |
ICML 2021 |
Street2Sat: A Machine Learning Pipeline for Generating Ground-truth Geo-referenced Labeled Datasets from Street-Level Images
(Proposals Track)
Abstract and authors: (click to expand)Abstract: Ground-truth labels on crop type and other variables are critically needed to develop machine learning methods that use satellite observations to combat climate change and food insecurity. These labels difficult and costly to obtain over large areas, particularly in Sub-Saharan Africa where they are most scarce. We propose Street2Sat, a new framework for obtaining large data sets of geo-referenced crop type labels obtained from vehicle-mounted cameras that can be extended to other applications. Using preliminary data from Kenya, we present promising results from this approach and identify future to improve the method before operational use in 5 countries. Authors: Madhava Paliyam (University of Maryland); Catherine L Nakalembe (University of Maryland); Kevin Liu (University of Maryland); Richard Nyiawung (University of Guelph); Hannah R Kerner (University of Maryland) |
Agriculture, forestry and other land use Classification, regression, and supervised learning Computer vision and remote sensing |
ICML 2021 |
From Talk to Action with Accountability: Monitoring the Public Discussion of Policy Makers with Deep Neural Networks and Topic Modelling
(Proposals Track)
Abstract and authors: (click to expand)Abstract: Decades of research on climate have provided a consensus that human activity has changed the climate and we are currently heading into a climate crisis. While public discussion and research efforts on climate change mitigation have increased, potential solutions need to not only be discussed but also effectively deployed. For preventing mismanagement and holding policy makers accountable, transparency and degree of information about government processes have been shown to be crucial. However, currently the quantity of information about climate change discussions and the range of sources make it increasingly difficult for the public and civil society to maintain an overview to hold politicians accountable. In response, we propose a multi-source topic aggregation system (MuSTAS) which processes policy makers speech and rhetoric from several publicly available sources into an easily digestible topic summary. MuSTAS uses novel multi-source hybrid latent Dirichlet allocation to model topics from a variety of documents. This topic digest will serve the general public and civil society in assessing where, how, and when politicians talk about climate and climate policies, enabling them to hold politicians accountable for their actions to mitigate climate change and lack thereof. Authors: Vili Hätönen (Emblica); Fiona Melzer (University of Edinburgh) |
Natural language processing Behavioral and social science Climate policy Unsupervised and semi-supervised learning |
ICML 2021 |
NeuralNERE: Neural Named Entity Relationship Extraction for End-to-End Climate Change Knowledge Graph Construction
(Proposals Track)
Abstract and authors: (click to expand)Abstract: This paper proposes an end-to-end Neural Named Entity Relationship Extraction model (called NeuralNERE) for climate change knowledge graph (KG) construction, directly from the raw text of relevant news articles. The proposed model will not only remove the need for any kind of human supervision for building knowledge bases for climate change KG construction (used in the case of supervised or dictionary-based KG construction methods), but will also prove to be highly valuable for analyzing climate change by summarising relationships between different factors responsible for climate change, extracting useful insights & reasoning on pivotal events, and helping industry leaders in making more informed future decisions. Additionally, we also introduce the Science Daily Climate Change dataset (called SciDCC) that contains over 11k climate change news articles scraped from the Science Daily website, which could be used for extracting prior knowledge for constructing climate change KGs. Authors: Prakamya Mishra (Independent Researcher); Rohan Mittal (Independent Researcher) |
Natural language processing Behavioral and social science Climate policy Unsupervised and semi-supervised learning |
ICML 2021 |
A multi-task learning approach to enhance sustainable biomolecule production in engineered microorganisms
(Proposals Track)
Abstract and authors: (click to expand)Abstract: A sustainable alternative to sourcing many materials humans need is metabolic engineering: a field that aims to engineer microorganisms into biological factories that convert renewable feedstocks into valuable biomolecules (i.e., jet fuel, medicine). Microorganism factories must be genetically optimized using predictable DNA sequence tools, however, for many organisms, the exact DNA sequence signals defining their genetic control systems are poorly understood. To better decipher these DNA signals, we propose a multi-task learning approach that uses deep learning and feature attribution methods to identify DNA sequence signals that control gene expression in the methanotroph M. buryatense. This bacterium consumes methane, a potent greenhouse gas. If successful, this work would enhance our ability to build gene expression tools to more effectively engineer M. buryatense into an efficient biomolecule factory that can divert methane pollution into valuable, everyday materials. Authors: Erin Wilson (University of Washington); Mary Lidstrom (University of Washington); David Beck (University of Washington) |
Other Sustainable molecule production |
ICML 2021 |
MethaNet - an AI-driven approach to quantifying methane point-source emission from high-resolution 2-D plume imagery
(Proposals Track)
Abstract and authors: (click to expand)Abstract: Methane (CH4) is one of the most powerful anthropogenic greenhouse gases with a significant impact on global warming trajectory and tropospheric air quality. Quantifying an emission rate of observed CH4 plumes from aerial or satellite images is a critical step for understanding the local distributions and subsequently prioritizing mitigation target sites. However, there exists no method that can reliably predict emission rates from detected plumes in real-time without ancillary data. Here, we trained a convolutional neural network model, called MethaNet, to predict methane point-source emission directly from high-resolution 2-D plume images without relying on other local measurements such as background wind speeds. Our results support the basis for the applicability of using deep learning techniques to quantify CH4 point sources in an automated manner over large geographical areas. MethaNet opens the way for real-time monitoring systems, not only for present and future airborne field campaigns but also for upcoming space-based observations in this decade. Authors: Siraput Jongaramrungruang (Caltech) |
Computer vision and remote sensing Carbon capture and sequestration Climate and Earth science Classification, regression, and supervised learning |
ICML 2021 |
Tackling the Overestimation of Forest Carbon with Deep Learning and Aerial Imagery
(Proposals Track)
Abstract and authors: (click to expand)Abstract: Forest carbon offsets are increasingly popular and can play a significant role in financing climate mitigation, forest conservation, and reforestation. Measuring how much carbon is stored in forests is, however, still largely done via expensive, time-consuming, and sometimes unaccountable field measurements. To overcome these limitations, many verification bodies are leveraging machine learning (ML) algorithms to estimate forest carbon from satellite or aerial imagery. Aerial imagery allows for tree species or family classification, which improves on the satellite imagery-based forest type classification. However, aerial imagery is significantly more expensive to collect and it is unclear by how much the higher resolution improves the forest carbon estimation. This proposal paper describes the first systematic comparison of forest carbon estimation from aerial imagery, satellite imagery, and “ground-truth“ field measurements via deep learning-based algorithms for a tropical reforestation project. Our initial results show that forest carbon estimates from satellite imagery can overestimate above-ground biomass by more than 10-times for tropical reforestation projects. The significant difference between aerial and satellite-derived forest carbon measurements shows the potential for aerial imagery-based ML algorithms and raises the importance to extend this study to a global benchmark between options for carbon measurements. Authors: Gyri Reiersen (TUM); David Dao (ETH Zurich); Björn Lütjens (MIT); Konstantin Klemmer (University of Warwick); Xiaoxiang Zhu (Technical University of Munich,Germany); Ce Zhang (ETH) |
Carbon capture and sequestration Computer vision and remote sensing |
ICML 2021 |
Learning Why: Data-Driven Causal Evaluations of Climate Models
(Proposals Track)
Abstract and authors: (click to expand)Abstract: We plan to use nascent data-driven causal discovery methods to find and compare causal relationships in observed data and climate model output. We will look at ten different features in the Arctic climate collected from public databases and from the Energy Exascale Earth System Model (E3SM). In identifying and comparing the resulting causal networks, we hope to find important differences between observed causal relationships and those in climate models. With these, climate modeling experts will be able to improve the coupling and parameterization of E3SM and other climate models. Authors: Jeffrey J Nichol (University of New Mexico); Matthew Peterson (Sandia National Laboratories); George M Fricke (UNM); Kara Peterson (Sandia National Laboratories) |
Climate and Earth science Causal and Bayesian methods |
ICML 2021 |
Enhancing Laboratory-scale Flow Imaging of Fractured Geological Media with Deep Learning Super Resolution
(Proposals Track)
Abstract and authors: (click to expand)Abstract: Injection into deep geological formations is a promising approach for the utilization, sequestration, and removal from the atmosphere of CO2 emissions. Laboratory experiments are essential to characterize how CO2 flows and reacts in various types of geological media. We reproduce such dynamic injection processes while imaging using Computed Tomography (CT) at sufficient temporal resolution to visualize changes in the flow field. The resolution of CT, however, is on the order of 100's of micrometers and insufficient to characterize fine-scale reaction-induced alterations to micro-fractures. Super resolution deep learning is, therefore, an essential tool to improve spatial resolution of dynamic CT images. We acquired and processed pairs of multi-scale low- and high-resolution CT rock images. We also show the performance of our baseline model on fractured rock images using peak signal to noise ratio and structural similarity index. Coupling dynamic CT imaging with deep learning results in visualization with enhanced spatial resolution of about a factor of 4 thereby enabling improved interpretation. Authors: Manju Pharkavi Murugesu (Stanford University); Timothy Anderson (Stanford University); Niccolo Dal Santo (MathWorks, Inc.); Vignesh Krishnan (The MathWorks Ltd); Anthony Kovscek (Stanford University) |
Carbon capture and sequestration Climate and Earth science Computer vision and remote sensing Generative modeling Unsupervised and semi-supervised learning |
ICML 2021 |
Forecasting emissions through Kaya identity using Neural ODEs
(Proposals Track)
Abstract and authors: (click to expand)Abstract: Starting from Kaya identity, we used a Neural ODE model to predict the evolution of several indicators related to carbon emissions, on a country-level : population, GDP per capita, energy intensity of GDP, carbon intensity of energy. We compared the model with a baseline statistical model - VAR - and obtained good performances. We conclude that this machine-learning approach can be used to produce a wide range of results and give relevant insight to policymakers. Authors: Pierre Browne (Imperial College London) |
Time-series analysis Climate policy Power and energy |
ICML 2021 |
On the Role of Spatial Clustering Algorithms in Building Species Distribution Models from Community Science Data
(Proposals Track)
Best Paper: Proposals
Abstract and authors: (click to expand)Abstract: This paper discusses opportunities for developments in spatial clustering methods to help leverage broad scale community science data for building species distribution models (SDMs). SDMs are tools that inform the science and policy needed to mitigate the impacts of climate change on biodiversity. Community science data span spatial and temporal scales unachievable by expert surveys alone, but they lack the structure imposed in smaller scale studies to allow adjustments for observational biases. Spatial clustering approaches can construct the necessary structure after surveys have occurred, but more work is needed to ensure that they are effective for this purpose. In this proposal, we describe the role of spatial clustering for realizing the potential of large biodiversity datasets, how existing methods approach this problem, and ideas for future work. Authors: Mark Roth (Oregon State University); Tyler Hallman (Swiss Ornithological Institute); W. Douglas Robinson (Oregon State University); Rebecca Hutchinson (Oregon State University) |
Ecosystems and natural systems Unsupervised and semi-supervised learning |
ICML 2021 |
Machine Learning for Climate Change: Guiding Discovery of Sorbent Materials for Direct Air Capture of CO2
(Proposals Track)
Abstract and authors: (click to expand)Abstract: The global climate crisis requires interdisciplinary collaboration. The same is true for making significant strides in materials discovery for direct air capture (DAC) of carbon dioxide (CO2). DAC is an emerging technology that captures CO2 directly from the atmosphere and it is part of the solution to achieving carbon neutrality by 2050. The proposed project is a collaborative effort that tackles climate change by using machine learning to guide scientists to novel, optimized, advanced sorbent materials for direct air capture of CO2. Immediate impacts will include high throughput machine learning tools for developing new, cost-effective CO2 sorption materials, and continued, expanded collaborations with potential domestic and international stakeholders. Authors: Diana L Ortiz-Montalvo (NIST); Aaron Gilad Kusne (NIST); Austin McDannald (NIST); Daniel Siderius (NIST); Kamal Choudhary (NIST); Taner Yildirim (NIST) |
Carbon capture and sequestration Active learning |
ICML 2021 |
Reducing greenhouse gas emissions by optimizing room temperature set-points
(Proposals Track)
Abstract and authors: (click to expand)Abstract: We design a learning and optimization framework to mitigate greenhouse gas emissions associated with heating and cooling buildings. The framework optimizes room temperature set-points based on forecasts of weather, occupancy, and the greenhouse gas intensity of electricity. We compare two approaches: the first one combines a linear load forecasting model with convex optimization that offers a globally optimal solution, whereas the second one combines a nonlinear load forecasting model with nonconvex optimization that offers a locally optimal solution. The project explores the two approaches with a simulation testbed in EnergyPlus and experiments in university-campus buildings. Authors: Yuan Cai (MIT); Subhro Das (MIT-IBM Watson AI Lab, IBM Research); Leslie Norford (Massachusetts Institute of Technology); Jeremy Gregory (Massachusetts Institute of Technology); Julia Wang (Massachusetts Institute of Technology); Kevin J Kircher (MIT); Jasmina Burek (Massachusetts Institute of Technology) |
Buildings and cities Power and energy Classification, regression, and supervised learning Time-series analysis |
ICML 2021 |
Deep learning network to project future Arctic ocean waves
(Proposals Track)
Abstract and authors: (click to expand)Abstract: The Arctic Ocean is warming at an alarming rate and will likely become ice-free in summer by mid-century. This will be accompanied by higher ocean surface waves, which pose a risk to coastal communities and marine operations. In order to develop climate change adaptation strategies, it is imperative to robustly assess the future changes in the Arctic ocean wave climate. This requires a large ensemble of regional ocean wave projections to properly capture the range of climate modeling uncertainty in the Arctic region. This has been proven challenging, as ocean wave information is typically not provided by climate models, ocean wave numerical modeling is computationally expensive, and most global wave climate ensembles exclude the Arctic region. Here we present a framework to develop a deep learning network based on CNN and LSTM which could be potentially used to obtain such a large ensemble of Arctic wave projections with an affordable cost. Authors: Merce Casas Prat (Environment and Climate Change Canada); Lluis Castrejon (Mila, Université de Montréal, Facebook AI Research); Shady Moahmmed (University of Ottawa) |
Climate and Earth science Computer vision and remote sensing |
ICML 2021 |
Deep Learning for Spatiotemporal Anomaly Forecasting: A Case Study of Marine Heatwaves
(Proposals Track)
Abstract and authors: (click to expand)Abstract: Spatiotemporal data have unique properties and require specific considerations. Forecasting spatiotemporal processes is a difficult task because the data are high-dimensional, often are limited in extent, and temporally correlated. Hence, we propose to evaluate several deep learning-based approaches that are relevant to spatiotemporal anomaly forecasting. We will use marine heatwaves as a case study. Those are observed around the world and have strong impacts on marine ecosystems. The evaluated deep learning methods will be integrated for the task of marine heatwave prediction in order to overcome the limitations of spatiotemporal data and improve data-driven seasonal marine heatwave forecasts. Authors: Ding Ning (University of Canterbury); Varvara Vetrova (University of Canterbury); Karin Bryan (University of Waikato); Sébastien Delaux (Meteorological Service of New Zealand) |
Classification, regression, and supervised learning Climate and Earth science |
ICML 2021 |
Leveraging Domain Adaptation for Low-Resource Geospatial Machine Learning
(Proposals Track)
Abstract and authors: (click to expand)Abstract: Machine learning in remote sensing has matured alongside a proliferation in availability and resolution of geospatial imagery, but its utility is bottlenecked by the need for labeled data. What's more, many labeled geospatial datasets are specific to certain regions, instruments, or extreme weather events. We investigate the application of modern domain-adaptation to multiple proposed geospatial benchmarks, uncovering unique challenges and proposing solutions to them. Authors: John M Lynch (NC State University); Sam Wookey (Masterful AI) |
Unsupervised and semi-supervised learning Ecosystems and natural systems Computer vision and remote sensing |
ICML 2021 |
APPLYING TRANSFORMER TO IMPUTATION OF MULTI-VARIATE ENERGY TIME SERIES DATA
(Proposals Track)
Abstract and authors: (click to expand)Abstract: To reduce the greenhouse gas emissions from electricity production, it is necessaryto switch to an energy system based on renewable energy sources (RES). However,intermittent electricity generation from RES poses challenges for energy systems.The primary input for data-driven solutions is data on electricity generation fromRES, which usually contain many missing values. This proposal studies the useof attention-based algorithms to impute missing values of electricity production,electricity demand and electricity prices. Since attention mechanisms allow us totake into account dependencies between time series across multiple dimensionsefficiently, our approach goes beyond classic statistical methods and incorporatesmany related variables, such as electricity price, demand and production by othersources. Our preliminary results show that while transformers can come at highercomputational costs, they are more precise than classical imputation methods. Authors: Hasan Ümitcan Yilmaz (Karlsruhe Institute of Technology); Max Kleinebrahm (Karlsruhe Institut für Technologie); Christopher Bülte (Karlsruhe Institute of Technology); Juan Gómez-Romero (Universidad de Granada) |
Power and energy Time-series analysis Unsupervised and semi-supervised learning |
NeurIPS 2020 |
Electric Vehicle Range Improvement by Utilizing Deep Learning to Optimize Occupant Thermal Comfort
(Papers Track)
Abstract and authors: (click to expand)Abstract: Heating, ventilation and air-conditioning (HVAC) systems can have a significant impact on the driving range of battery electric vehicles (EV’s). Predicting thermal comfort in an automotive vehicle cabin’s highly asymmetric and dynamic thermal environment is critical for developing energy-efficient HVAC systems. In this study we have coupled high-fidelity Computational Fluid Dynamics (CFD) simulations and Artificial Neural Networks (ANN) to predict vehicle occupant thermal comfort for any combination of steady-state boundary conditions. A vehicle cabin CFD model, validated against climatic wind tunnel measurements, was used to systematically generate training and test data that spanned the entire range of boundary conditions which impact occupant thermal comfort in an electric vehicle. Artificial neural networks (ANN) were applied to the simulation data to predict the overall Equivalent Homogeneous Temperature (EHT) comfort index for each occupant. An ensemble of five neural network models was able to achieve a mean absolute error of 2 ºC or less in predicting the overall EHT for all occupants in the vehicle on unseen or test data, which is acceptable for rapid evaluation and optimization of thermal comfort energy demand. The deep learning model developed in this work enables predictions of thermal comfort for any combination of steady-state boundary conditions in real-time without being limited by time-consuming and expensive CFD simulations or climatic wind tunnel tests. This model has been deployed as an easy-to-use web application within the organization for HVAC engineers to optimize thermal comfort energy demand and, thereby, driving range of electric vehicle programs. Authors: Alok Warey (General Motors Global Research and Development); Shailendra Kaushik (General Motors Global Research and Development); Bahram Khalighi (General Motors Global Research and Development); Michael Cruse (Siemens Digital Industries Software); Ganesh Venkatesan (Siemens Digital Industries Software) |
Transportation Classification, regression, and supervised learning |
NeurIPS 2020 |
Is Africa leapfrogging to renewables or heading for carbon lock-in? A machine-learning-based approach to predicting success of power-generation projects
(Papers Track)
Abstract and authors: (click to expand)Abstract: Several extant energy-planning studies, comprising wide-ranging assumptions about the future, feature projections of Africa’s rapid transition in the next decade towards renewables-based power generation. Here, we develop a novel empirical approach to predicting medium-term generation mix that can complement traditional energy planning. Relying on the largest dataset on historic and planned power plants available for Africa, combined with country-level characteristics, we build a machine-learning-based model, using gradient boosted trees, that demonstrates high predictive performance. Training our model on past successful and failed projects, we find that the most relevant factors for commissioning are plant-level: capacity, fuel, ownership and grid connection type. We then apply the trained model to predict the realisation of the current project pipeline. Contrary to the rapid transition scenarios, our results show that the share of non-hydro renewables in generation is likely to remain below 10% in 2030, despite total generation more than doubling. These findings point to high carbon lock-in risks in Africa, highlighting the urgency to shift its pipeline of projects towards low-carbon energy and improve the realisation chances of renewable energy plants. Authors: Galina Alova (University of Oxford); Philipp Trotter (University of Oxford); Alex Money (University of Oxford) |
Power and energy Climate finance and economics Classification, regression, and supervised learning Interpretable ML |
NeurIPS 2020 |
pymgrid: An Open-Source Python Microgrid Simulator for Applied Artificial Intelligence Research
(Papers Track)
Abstract and authors: (click to expand)Abstract: Microgrids – self-contained electrical grids that are capable of disconnecting from the main grid – hold potential in both tackling climate change mitigation via reducing CO$_2$ emissions and adaptation by increasing infrastructure resiliency. Due to their distributed nature, microgrids are often idiosyncratic; as a result, control of these systems is nontrivial. While microgrid simulators exist, many are limited in scope and in the variety of microgrids they can simulate. We propose \HL{pymgrid}, an open-source Python package to generate and simulate a large number of microgrids, and the first open-source tool that can generate more than 600 different microgrids. \HL{pymgrid} abstracts most of the domain expertise, allowing users to focus on control algorithms. In particular, \HL{pymgrid} is built to be a reinforcement learning (RL) platform, and includes the ability to model microgrids as Markov decision processes. \HL{pymgrid} also introduces two pre-computed list of microgrids, intended to allow for research reproducibility in the microgrid setting. Authors: Gonzague Henri (Total); Tanguy Levent (Ecole Polytechnique); Avishai Halev (Total, UC Davis); Reda ALAMI (Total R&D); Philippe Cordier (Total S.A.) |
Power and energy Reinforcement learning and control |
NeurIPS 2020 |
Towards Optimal District Heating Temperature Control in China with Deep Reinforcement Learning
(Papers Track)
Abstract and authors: (click to expand)Abstract: Achieving efficiency gains in Chinese district heating networks, thereby reducing their carbon footprint, requires new optimal control methods going beyond current industry tools. Focusing on the secondary network, we propose a data-driven deep reinforcement learning (DRL) approach to address this task. We build a recurrent neural network, trained on simulated data, to predict the indoor temperatures. This model is then used to train two DRL agents, with or without expert guidance, for the optimal control of the supply water temperature. Our tests in a multi-apartment setting show that both agents can ensure a higher thermal comfort and at the same time a smaller energy cost, compared to an optimized baseline strategy. Authors: Adrien Le Coz (EDF); Tahar Nabil (EDF); Francois Courtot (EDF) |
Buildings and cities Reinforcement learning and control |
NeurIPS 2020 |
Deep Reinforcement Learning in Electricity Generation Investment for the Minimization of Long-Term Carbon Emissions and Electricity Costs
(Papers Track)
Abstract and authors: (click to expand)Abstract: A change from a high-carbon emitting electricity power system to one based on renewables would aid in the mitigation of climate change. Decarbonization of the electricity grid would allow for low-carbon heating, cooling and transport. Investments in renewable energy must be made over a long time horizon to maximise return of investment of these long life power generators. Over these long time horizons, there exist multiple uncertainties, for example in future electricity demand and costs to consumers and investors. To mitigate for imperfect information of the future, we use the deep deterministic policy gradient (DDPG) deep reinforcement learning approach to optimize for a low-cost, low-carbon electricity supply using a modified version of the FTT:Power model. In this work, we model the UK and Ireland electricity markets. The DDPG algorithm is able to learn the optimum electricity mix through experience and achieves this between the years of 2017 and 2050. We find that a change from fossil fuels and nuclear power to renewables, based upon wind, solar and wave would provide a cheap and low-carbon alternative to fossil fuels. Authors: Alexander J. M. Kell (Newcastle University); Pablo Salas (University of Cambridge); Jean-Francois Mercure (University of Exeter); Matthew Forshaw (Newcastle University); A. Stephen McGough (Newcastle University) |
Power and energy Reinforcement learning and control |
NeurIPS 2020 |
Short-Term Solar Irradiance Forecasting Using Calibrated Probabilistic Models
(Papers Track)
Abstract and authors: (click to expand)Abstract: Advancing probabilistic solar forecasting methods is essential to supporting the integration of solar energy into the electricity grid. In this work, we develop a variety of state-of-the-art probabilistic models for forecasting solar irradiance. We investigate the use of post-hoc calibration techniques for ensuring well-calibrated probabilistic predictions. We train and evaluate the models using public data from seven stations in the SURFRAD network, and demonstrate that the best model, NGBoost, achieves higher performance at an intra-hourly resolution than the best benchmark solar irradiance forecasting model across all stations. Further, we show that NGBoost with CRUDE post-hoc calibration achieves comparable performance to a numerical weather prediction model on hourly-resolution forecasting. Authors: Eric Zelikman (Stanford University); Sharon Zhou (Stanford University); Jeremy A Irvin (Stanford); Cooper Raterink (Stanford University); Hao Sheng (Stanford University); Avati Anand (Stanford University); Jack Kelly (Open Climate Fix); Ram Rajagopal (Stanford University); Andrew Ng (Stanford University); David J Gagne (National Center for Atmospheric Research) |
Power and energy Climate and Earth science Causal and Bayesian methods Uncertainty quantification and robustness |
NeurIPS 2020 |
Deep Learning for Climate Model Output Statistics
(Papers Track)
Best ML Innovation
Abstract and authors: (click to expand)Abstract: Climate models are an important tool for the assessment of prospective climate change effects but they suffer from systematic and representation errors, especially for precipitation. Model output statistics (MOS) reduce these errors by fitting the model output to observational data with machine learning. In this work, we explore the feasibility and potential of deep learning with convolutional neural networks (CNNs) for MOS. We propose the CNN architecture ConvMOS specifically designed for reducing errors in climate model outputs and apply it to the climate model REMO. Our results show a considerable reduction of errors and mostly improved performance compared to three commonly used MOS approaches. Authors: Michael Steininger (University of Würzburg); Daniel Abel (University of Würzburg); Katrin Ziegler (University of Würzburg); Anna Krause (Universität Würzburg, Department of Computer Science, CHair X Data Science); Heiko Paeth (University of Würzburg); Andreas Hotho (Universitat Wurzburg) |
Climate and Earth science Classification, regression, and supervised learning |
NeurIPS 2020 |
A Temporally Consistent Image-based Sun Tracking Algorithm for Solar Energy Forecasting Applications
(Papers Track)
Abstract and authors: (click to expand)Abstract: Improving irradiance forecasting is critical to further increase the share of solar in the energy mix. On a short time scale, fish-eye cameras on the ground are used to capture cloud displacements causing the local variability of the electricity production. As most of the solar radiation comes directly from the Sun, current forecasting approaches use its position in the image as a reference to interpret the cloud cover dynamics. However, existing Sun tracking methods rely on external data and a calibration of the camera, which requires access to the device. To address these limitations, this study introduces an image-based Sun tracking algorithm to localise the Sun in the image when it is visible and interpolate its daily trajectory from past observations. We validate the method on a set of sky images collected over a year at SIRTA's lab. Experimental results show that the proposed method provides robust smooth Sun trajectories with a mean absolute error below 1% of the image size. Authors: Quentin Paletta (University of Cambridge); Joan Lasenby (University of Cambridge) |
Power and energy Classification, regression, and supervised learning Computer vision and remote sensing Time-series analysis |
NeurIPS 2020 |
Characterization of Industrial Smoke Plumes from Remote Sensing Data
(Papers Track)
Abstract and authors: (click to expand)Abstract: The major driver of global warming has been identified as the anthropogenic release of greenhouse gas (GHG) emissions from industrial activities. The quantitative monitoring of these emissions is mandatory to fully understand their effect on the Earth’s climate and to enforce emission regulations on a large scale. In this work, we investigate the possibility to detect and quantify industrial smoke plumes from globally and freely available multi-band image data from ESA’s Sentinel-2 satellites. Using a modified ResNet-50, we can detect smoke plumes of different sizes with an accuracy of 94.3%. The model correctly ignores natural clouds and focuses on those imaging channels that are related to the spectral absorption from aerosols and water vapor, enabling the localization of smoke. We exploit this localization ability and train a U-Net segmentation model on a labeled sub-sample of our data, resulting in an Intersection-over-Union (IoU) metric of 0.608 and an overall accuracy for the detection of any smoke plume of 94.0%; on average, our model can reproduce the area covered by smoke in an image to within 5.6%. The performance of our model is mostly limited by occasional confusion with surface objects, the inability to identify semi-transparent smoke, and human limitations to properly identify smoke based on RGB-only images. Nevertheless, our results enable us to reliably detect and qualitatively estimate the level of smoke activity in order to monitor activity in industrial plants across the globe. Our data set and code base are publicly available. Authors: Michael Mommert (University of St. Gallen); Mario Sigel (Sociovestix Labs Ltd.); Marcel Neuhausler (ISS Technology Innovation Lab); Linus M. Scheibenreif (University of St. Gallen); Damian Borth (University of St. Gallen) |
Computer vision and remote sensing Industry |
NeurIPS 2020 |
Learning the distribution of extreme precipitation from atmospheric general circulation model variables
(Papers Track)
Abstract and authors: (click to expand)Abstract: Precipitation extremes are projected to become more frequent and severe in a warming atmosphere over the coming decades. However, the accurate prediction of precipitation, in particular of extremes, remains a challenge for numerical weather prediction models. A large source of error are subgrid-scale parameterizations of processes that play a crucial role in the complex, multi-scale dynamics of precipitation, but are not explicitly resolved in the model formulation. Here we follow a hybrid, data-driven approach, in which atmospheric variables such as wind fields are forecast in time by a general circulation model (GCM) ensemble and then mapped to precipitation using a deep convolutional autoencoder. A frequency-based weighting of the loss function is introduced to improve the learning with regard to extreme values. Our results show an improved representation of extreme precipitation frequencies, as well as better error and correlation statistics compared to a state-of-the-art GCM ensemble. Authors: Philipp Hess (Free University Berlin); Niklas Boers (Free University Berlin) |
Climate and Earth science Hybrid physical models |
NeurIPS 2020 |
Towards Tracking the Emissions of Every Power Plant on the Planet
(Papers Track)
Best Pathway to Impact
Abstract and authors: (click to expand)Abstract: Greenhouse gases emitted from fossil-fuel-burning power plants are a major contributor to climate change. Current methods to track emissions from individual sources are expensive and only used in a few countries. While carbon dioxide concentrations can be measured globally using remote sensing, background fluctuations and low spatial resolution make it difficult to attribute emissions to individual sources. We use machine learning to infer power generation and emissions from visible and thermal power plant signatures in satellite images. By training on a data set of power plants for which we know the generation or emissions, we are able to apply our models globally. This paper demonstrates initial progress on this project by predicting whether a power plant is on or off from a single satellite image. Authors: Heather D Couture (Pixel Scientia Labs); Joseph O'Connor (Carbon Tracker); Grace Mitchell (WattTime); Isabella Söldner-Rembold (Carbon Tracker); Durand D'souza (Carbon Tracker); Krishna Karra (WattTime); Keto Zhang (WattTime); Ali Rouzbeh Kargar (WattTime); Thomas Kassel (WattTime); Brian Goldman (Google); Daniel Tyrrell (Google); Wanda Czerwinski (Google); Alok Talekar (Google); Colin McCormick (Georgetown University) |
Power and energy Classification, regression, and supervised learning Computer vision and remote sensing |
NeurIPS 2020 |
Spatio-Temporal Learning for Feature Extraction inTime-Series Images
(Papers Track)
Abstract and authors: (click to expand)Abstract: Earth observation programs have provided highly useful information in global climate change research over the past few decades and greatly promoted its development, especially through providing biological, physical, and chemical parameters on a global scale. Programs such as Landsat, Sentinel, SPOT, and Pleiades can be used to acquire huge volume of medium to high resolution images every day. In this work, we organize these data in time series and we exploit both temporal and spatial information they provide to generate accurate and up-to-date land cover maps that can be used to monitor vulnerable areas threatened by the ongoing climatic and anthropogenic global changes. For this purpose, we combine a fully convolutional neural network with a convolutional long short-term memory. Implementation details of the proposed spatio-temporal neural network architecture are described. Examples are provided for the monitoring of roads and mangrove forests on the West African coast. Authors: Gael Kamdem De Teyou (Huawei) |
Agriculture, forestry and other land use Ecosystems and natural systems |
NeurIPS 2020 |
Meta-modeling strategy for data-driven forecasting
(Papers Track)
Abstract and authors: (click to expand)Abstract: Accurately forecasting the weather is a key requirement for climate change mitigation. Data-driven methods offer the ability to make more accurate forecasts, but lack interpretability and can be expensive to train and deploy if models are not carefully developed. Here, we make use of two historical climate data sets and tools from machine learning, to accurately predict temperature fields. Furthermore, we are able to use low fidelity models that are cheap to train and evaluate, to selectively avoid expensive high fidelity function evaluations, as well as uncover seasonal variations in predictive power. This allows for an adaptive training strategy for computationally efficient geophysical emulation. Authors: Dominic J Skinner (MIT); Romit Maulik (Argonne National Laboratory) |
Climate and Earth science Interpretable ML Time-series analysis |
NeurIPS 2020 |
Short-term prediction of photovoltaic power generation using Gaussian process regression
(Papers Track)
Abstract and authors: (click to expand)Abstract: Photovoltaic (PV) power is affected by weather conditions, making the power generated from the PV systems uncertain. Solving this problem would help improve the reliability and cost effectiveness of the grid, and could help reduce reliance on fossil fuel plants. The present paper focuses on evaluating predictions of the energy generated by PV systems in the United Kingdom using Gaussian process regression (GPR). Gaussian process regression is a Bayesian non-parametric model that can provide predictions along with the uncertainty in the predicted value, which can be very useful in applications with a high degree of uncertainty. The model is evaluated for short-term forecasts of 48 hours against three main factors – training period, sky area coverage and kernel model selection – and for very short-term forecasts of four hours against sky area. We also compare very short-term forecasts in terms of cloud coverage within the prediction period and only initial cloud coverage as a predictor. Authors: Yahya Hasan Al Lawati (Queen Mary University of London); Jack Kelly (Open Climate Fix); Dan Stowell (Queen Mary University of London) |
Classification, regression, and supervised learning Causal and Bayesian methods |
NeurIPS 2020 |
Formatting the Landscape: Spatial conditional GAN for varying population in satellite imagery
(Papers Track)
Abstract and authors: (click to expand)Abstract: Climate change is expected to reshuffle the settlement landscape: forcing people in affected areas to migrate, to change their lifeways, and continuing to affect demographic change throughout the world. Changes to the geographic distribution of population will have dramatic impacts on land use and land cover and thus constitute one of the major challenges of planning for climate change scenarios. In this paper, we explore a generative model framework for generating satellite imagery conditional on gridded population distributions. We make additions to the existing ALAE [30] architecture, creating a spatially conditional version: SCALAE. This method allows us to explicitly disentangle population from the model’s latent space and thus input custom population forecasts into the generated imagery. We postulate that such imagery could then be directly used for land cover and land use change estimation using existing frameworks, as well as for realistic visualisation of expected local change. We evaluate the model by comparing pixel and semantic reconstructions, as well as calculate the standard FID metric. The results suggest the model captures population distributions accurately and delivers a controllable method to generate realistic satellite imagery. Authors: Tomas Langer (Intuition Machines); Natalia Fedorova (Max Planck Institute for Evolutionary Anthropology); Ron Hagensieker (Osir.io) |
Generative modeling Behavioral and social science Buildings and cities Societal adaptation Computer vision and remote sensing |
NeurIPS 2020 |
Storing Energy with Organic Molecules: Towards a Metric for Improving Molecular Performance for Redox Flow Batteries
(Papers Track)
Abstract and authors: (click to expand)Abstract: Energy storage is an important tool in the decarbonization of energy systems, particularly when coupled with intermittent forms of energy. However, storage technologies are still not commercially competitive to garner mainstream adoption. This work focuses on the cost reduction of organic redox flow batteries (ORFBs) via materials discovery. We identify macroscopic metrics of interest to optimize for lowering their cost and relate them to the molecular properties of the materials involved. Furthermore, we consolidate a benchmark set of experimental data for building predictive models for these materials properties. Building more accurate models will aid practitioners in the rational design of new ORFB. Authors: Luis M Mejia Mendoza (University of Toronto); Alan Aspuru-Guzik (Harvard University); Martha Flores Leonar (University of Toronto) |
Classification, regression, and supervised learning Other Meta- and transfer learning |
NeurIPS 2020 |
Predicting Landsat Reflectance with Deep Generative Fusion
(Papers Track)
Abstract and authors: (click to expand)Abstract: Public satellite missions are commonly bound to a trade-off between spatial and temporal resolution as no single sensor provides fine-grained acquisitions with frequent coverage. This hinders their potential to assist vegetation monitoring or humanitarian actions, which require detecting rapid and detailed terrestrial surface changes. In this work, we probe the potential of deep generative models to produce high-resolution optical imagery by fusing products with different spatial and temporal characteristics. We introduce a dataset of co-registered Moderate Resolution Imaging Spectroradiometer (MODIS) and Landsat surface reflectance time series and demonstrate the ability of our generative model to blend coarse daily reflectance information into low-paced finer acquisitions. We benchmark our proposed model against state-of-the-art reflectance fusion algorithms. Authors: Shahine Bouabid (University of Oxford); Jevgenij Gamper (Cervest Ltd.) |
Generative modeling Agriculture, forestry and other land use Computer vision and remote sensing |
NeurIPS 2020 |
Quantitative Assessment of Drought Impacts Using XGBoost based on the Drought Impact Reporter
(Papers Track)
Abstract and authors: (click to expand)Abstract: Under climate change, the increasing frequency, intensity, and spatial extent of drought events lead to higher socio-economic costs. However, the relationships between the hydro-meteorological indicators and drought impacts are not identified well yet because of the complexity and data scarcity. In this paper, we proposed a framework based on the extreme gradient model (XGBoost) for Texas to predict multi-category drought impacts and connected a typical drought indicator, Standardized Precipitation Index (SPI), to the text-based impacts from the Drought Impact Reporter (DIR). The preliminary results of this study showed an outstanding performance of the well-trained models to assess drought impacts on agriculture, fire, society & public health, plants & wildlife, as well as relief, response & restrictions in Texas. It also provided a possibility to appraise drought impacts using hydro-meteorological indicators with the proposed framework in the United States, which could help drought risk management by giving additional information and improving the updating frequency of drought impacts. Our interpretation results using the Shapley additive explanation (SHAP) interpretability technique revealed that the rules guiding the predictions of XGBoost comply with domain expertise knowledge around the role that SPI indicators play around drought impacts. Authors: Beichen Zhang (University of Nebraska-Lincoln); Fatima K Abu Salem (American University of Beirut); Michael Hayes (University of Nebraska-Lincoln); Tsegaye Tadesse (University of Nebraska-Lincoln) |
Disaster prediction, management, and relief Interpretable ML |
NeurIPS 2020 |
Estimating Forest Ground Vegetation Cover From Nadir Photographs Using Deep Convolutional Neural Networks
(Papers Track)
Abstract and authors: (click to expand)Abstract: Forest fires, such as those on the US west coast in September 2020, are an important factor in climate change. Wildfire modeling and mitigation require mapping vegetation ground cover over large plots of land. The current forestry practice is to send out human ground crews to collect photos of the forest floor at precisely determined locations, then manually calculate the percent cover of ground fuel types. In this work, we propose automating this process using a supervised learning-based deep convolutional neural network to perform image segmentation. Experimental results on a real dataset show this approach delivers very promising performance. Authors: Pranoy Panda (Indian Institute of Technology, Hyderabad); Martin Barczyk (University of Alberta); Jen Beverly (University of Alberta) |
Agriculture, forestry and other land use Classification, regression, and supervised learning |
NeurIPS 2020 |
A Machine Learning Approach to Methane Emissions Mitigation in the Oil and Gas Industry
(Papers Track)
Overall Best Paper
Abstract and authors: (click to expand)Abstract: Reducing methane emissions from the oil and gas sector is a key component of climate policy in the United States. Methane leaks across the supply chain are stochastic and intermittent, with a small number of sites (‘super-emitters’) responsible for a majority of emissions. Thus, cost-effective emissions reduction critically relies on effectively identifying the super-emitters from thousands of well-sites and millions of miles of pipelines. Conventional approaches such as walking surveys using optical gas imaging technology are slow and time-consuming. In addition, several variables contribute to the formation of leaks such as infrastructure age, production, weather conditions, and maintenance practices. Here, we develop a machine learning algorithm to predict high-emitting sites that can be prioritized for follow-up repair. Such prioritization can significantly reduce the cost of surveys and increase emissions reductions compared to conventional approaches. Our results show that the algorithm using logistic regression performs the best out of several algorithms. The model achieved a 70% accuracy rate with a 57% recall and a 66% balanced accuracy rate. Compared to the conventional approach, the machine learning model reduced the time to achieve a 50% emissions mitigation target by 42%. Correspondingly, the mitigation cost reduced from $85/t CO2e to $49/t CO2e. Authors: Jiayang Wang (Harrisburg University); Selvaprabu Nadarajah (University of Illinois at Chicago); Jingfan Wang (Stanford University); Arvind Ravikumar (Harrisburg University) |
Power and energy Classification, regression, and supervised learning |
NeurIPS 2020 |
Monitoring the Impact of Wildfires on Tree Species with Deep Learning
(Papers Track)
Abstract and authors: (click to expand)Abstract: One of the impacts of climate change is the difficulty of tree regrowth after wildfires over areas that traditionally were covered by certain tree species. Here a deep learning model is customized to classify land covers from four-band aerial imagery before and after wildfires to study the prolonged consequences of wildfires on tree species. The tree species labels are generated from manually delineated maps for five land cover classes: Conifer, Hardwood, Shrub, ReforestedTree, and Barren land. With an accuracy of 92% on the test split, the model is applied to three wildfires on data from 2009 to 2018. The model accurately delineates areas damaged by wildfires, changes in tree species, and regrowth in burned areas. The result shows clear evidence of wildfires impacting the local ecosystem and the outlined approach can help monitor reforested areas, observe changes in forest composition, and track wildfire impact on tree species. Authors: Wang Zhou (IBM Research); Levente Klein (IBM Research) |
Agriculture, forestry and other land use Disaster prediction, management, and relief Computer vision and remote sensing |
NeurIPS 2020 |
ForestNet: Classifying Drivers of Deforestation in Indonesia using Deep Learning on Satellite Imagery
(Papers Track)
Abstract and authors: (click to expand)Abstract: Characterizing the processes leading to deforestation is critical to the development and implementation of targeted forest conservation and management policies. In this work, we develop a deep learning model called ForestNet to classify the drivers of primary forest loss in Indonesia, a country with one of the highest deforestation rates in the world. Using satellite imagery, ForestNet identifies the direct drivers of deforestation in forest loss patches of any size. We curate a dataset of Landsat 8 satellite images of known forest loss events paired with driver annotations from expert interpreters. We use the dataset to train and validate the models and demonstrate that ForestNet substantially outperforms other standard driver classification approaches. In order to support future research on automated approaches to deforestation driver classification, the dataset curated in this study is publicly available at https://stanfordmlgroup.github.io/projects/forestnet . Authors: Jeremy A Irvin (Stanford); Hao Sheng (Stanford University); Neel Ramachandran (Stanford University); Sonja Johnson-Yu (Stanford University); Sharon Zhou (Stanford University); Kyle Story (Descartes Labs); Rose Rustowicz (Descartes Labs); Cooper Elsworth (Descartes Labs); Kemen Austin (RTI International); Andrew Ng (Stanford University) |
Agriculture, forestry and other land use Computer vision and remote sensing |
NeurIPS 2020 |
Mangrove Ecosystem Detection using Mixed-Resolution Imagery with a Hybrid-Convolutional Neural Network
(Papers Track)
Abstract and authors: (click to expand)Abstract: Mangrove forests are rich in biodiversity and are a large contributor to carbon sequestration critical in the fight against climate change. However, they are currently under threat from anthropogenic activities, so monitoring their health, extent, and productivity is vital to our ability to protect these important ecosystems. Traditionally, lower resolution satellite imagery or high resolution unmanned air vehicle (UAV) imagery has been used independently to monitor mangrove extent, both offering helpful features to predict mangrove extent. To take advantage of both of these data sources, we propose the use of a hybrid neural network, which combines a Convolutional Neural Network (CNN) feature extractor with a Multilayer-Perceptron (MLP), to accurately detect mangrove areas using both medium resolution satellite and high resolution drone imagery. We present a comparison of our novel Hybrid CNN with algorithms previously applied to mangrove image classification on a data set we collected of dwarf mangroves from consumer UAVs in Baja California Sur, Mexico, and show a 95\% intersection over union (IOU) score for mangrove image classification, outperforming all our baselines. Authors: Dillon Hicks (Engineers for Exploration); Ryan Kastner (University of California San Diego); Curt Schurgers (University of California San Diego); Astrid Hsu (University of California San Diego); Octavio Aburto (University of California San Diego) |
Computer vision and remote sensing Carbon capture and sequestration Ecosystems and natural systems Agriculture, forestry and other land use Classification, regression, and supervised learning |
NeurIPS 2020 |
Context-Aware Urban Energy Efficiency Optimization Using Hybrid Physical Models
(Papers Track)
Abstract and authors: (click to expand)Abstract: Buildings produce more U.S. greenhouse gas emissions through electricity generation than any other economic sector. To improve the energy efficiency of buildings, engineers often rely on physics-based building simulations to predict the impacts of retrofits in individual buildings. In dense urban areas, these models suffer from inaccuracy due to imprecise parameterization or external, unmodeled urban context factors such as inter-building effects and urban microclimates. In a case study of approximately 30 buildings in Sacramento, California, we demonstrate how our hybrid physics-driven deep learning framework can use these external factors advantageously to identify a more optimal energy efficiency retrofit installation strategy and achieve significant savings in both energy and cost. Authors: Benjamin Choi (Stanford University); Alex Nutkiewicz (Stanford University); Rishee Jain (Stanford University) |
Hybrid physical models Buildings and cities |
NeurIPS 2020 |
Deep learning architectures for inference of AC-OPF solutions
(Papers Track)
Abstract and authors: (click to expand)Abstract: We present a systematic comparison between neural network (NN) architectures for inference of AC-OPF solutions. Using fully connected NNs as a baseline we demonstrate the efficacy of leveraging network topology in the models by constructing abstract representations of electrical grids in the graph domain, for both convolutional and graph NNs. The performance of the NN architectures is compared for regression (predicting optimal generator set-points) and classification (predicting the active set of constraints) settings. Computational gains for obtaining optimal solutions are also presented. Authors: Thomas Falconer (University College London); Letif Mones (Invenia Labs) |
Classification, regression, and supervised learning Power and energy |
NeurIPS 2020 |
Predicting the Solar Potential of Rooftops using Image Segmentation and Structured Data
(Papers Track)
Abstract and authors: (click to expand)Abstract: Estimating the amount of electricity that can be produced by rooftop photovoltaic systems is a time-consuming process that requires on-site measurements, a difficult task to achieve on a large scale. In this paper, we present an approach to estimate the solar potential of rooftops based on their location and architectural characteristics, as well as the amount of solar radiation they receive annually. Our technique uses computer vision to achieve semantic segmentation of roof sections and roof objects on the one hand, and a machine learning model based on structured building features to predict roof pitch on the other hand. We then compute the azimuth and maximum number of solar panels that can be installed on a rooftop with geometric approaches. Finally, we compute precise shading masks and combine them with solar irradiation data that enables us to estimate the yearly solar potential of a rooftop. Authors: Daniel de Barros Soares (nam.R); François ANDRIEUX (nam.R); Bastien HELL (nam.R); Julien LENHARDT (nam.R; ENSTA); JORDI BADOSA (Ecole Polytechnique); Sylvain GAVOILLE (nam.R); Stéphane GAIFFAS (nam.R; LPSM (Université de Paris)); Emmanuel BACRY (nam.R; CEREMADE (Université Paris Dauphine, PSL)) |
Buildings and cities Power and energy Classification, regression, and supervised learning Computer vision and remote sensing |
NeurIPS 2020 |
Revealing the Oil Majors' Adaptive Capacity to the Energy Transition with Deep Multi-Agent Reinforcement Learning
(Papers Track)
Abstract and authors: (click to expand)Abstract: A low-carbon energy transition is transpiring to combat climate change, posing an existential threat to oil and gas companies, particularly the Majors. Though Majors yield the resources and expertise to adapt to low-carbon business models, meaningful climate-aligned strategies have yet to be enacted. A 2-degrees pathways (2DP) wargame was developed to assess climate-compatible pathways for the oil Majors. Recent advances in deep multi-agent reinforcement learning (MARL) have achieved superhuman-level performance in solving high-dimensional continuous control problems. Modeling within a Markovian framework, we present the novel 2DP-MARL model which applies deep MARL methods to solve the 2DP wargame across a multitude of transition scenarios. Designed to best mimic Majors in real- life competition, the model reveals all Majors quickly adapt to low-carbon business models to remain robust amidst energy transition uncertainty. The purpose of this work is provide tangible metrics to support the call for oil Majors to diversify into low-carbon business models and, thus, accelerate the energy transition. Authors: Dylan Radovic (Imperial College London); Lucas Kruitwagen (University of Oxford); Christian Schroeder de Witt (University of Oxford) |
Industry Climate finance and economics Reinforcement learning and control Uncertainty quantification and robustness |
NeurIPS 2020 |
Quantifying the presence of air pollutants over a road network in high spatio-temporal resolution
(Papers Track)
Abstract and authors: (click to expand)Abstract: Monitoring air pollution plays a key role when trying to reduce its impact on the environment and on human health. Traditionally, two main sources of information about the quantity of pollutants over a city are used: monitoring stations at ground-level (when available), and satellites' remote sensing. In addition to these two, other methods have been developed in the last years that aim at understanding how traffic emissions behave in space and time at a finer scale, taking into account the human mobility patterns. We present a simple and versatile framework for estimating the quantity of four air pollutants (CO2, NOx, PM, VOC) emitted by private vehicles moving on a road network, starting from raw GPS traces and information about vehicles' fuel type, and use this framework for analyses on how such pollutants distribute over the road networks of different cities. Authors: Matteo Bohm (Sapienza University of Rome); Mirco Nanni (ISTI-CNR Pisa, Italy); Luca Pappalardo (ISTI) |
Transportation Data mining |
NeurIPS 2020 |
Annual and in-season mapping of cropland at field scale with sparse labels
(Papers Track)
Abstract and authors: (click to expand)Abstract: Spatial information about where crops are being grown, known as cropland maps, are critical inputs for analyses and decision-making related to food security and climate change. Despite a widespread need for readily-updated annual and in-season cropland maps at the management (field) scale, these maps are unavailable for most regions at risk of food insecurity. This is largely due to lack of in-situ labels for training and validating machine learning classifiers. Previously, we developed a method for binary classification of cropland that learns from sparse local labels and abundant global labels using a multi-headed LSTM and time-series multispectral satellite inputs over one year. In this work, we present a new method that uses an autoregressive LSTM to classify cropland during the growing season (i.e., partially-observed time series). We used these methods to produce publicly-available 10m-resolution cropland maps in Kenya for the 2019-2020 and 2020-2021 growing seasons. These are the highest-resolution and most recent cropland maps publicly available for Kenya. These methods and associated maps are critical for scientific studies and decision-making at the intersection of food security and climate change. Authors: Gabriel Tseng (NASA Harvest); Hannah R Kerner (University of Maryland); Catherine L Nakalembe (University of Maryland); Inbal Becker-Reshef (University of Maryland) |
Agriculture, forestry and other land use Computer vision and remote sensing |
NeurIPS 2020 |
NightVision: Generating Nighttime Satellite Imagery from Infra-Red Observations
(Papers Track)
Abstract and authors: (click to expand)Abstract: The recent explosion in applications of machine learning to satellite imagery often rely on visible images and therefore suffer from a lack of data during the night. The gap can be filled by employing available infra-red observations to generate visible images. This work presents how Deep Learning can be applied successfully to create those images by using U-Net based architectures. The proposed methods show promising results, achieving a structural similarity index (SSIM) up to 86\% on an independent test set and providing visually convincing output images, generated from infra-red observations. Authors: Paula Harder (Fraunhofer ITWM); William Jones (University of Oxford); Redouane Lguensat (LSCE-IPSL); Shahine Bouabid (University of Oxford); James Fulton (University of Edinburgh); Dánnell Quesada-Chacón (Technische Universität Dresden); Aris Marcolongo (University of Bern); Sofija Stefanovic (University of Oxford); Yuhan Rao (North Carolina Institute for Climate Studies); Peter Manshausen (University of Oxford); Duncan Watson-Parris (University of Oxford) |
Computer vision and remote sensing Climate and Earth science Generative modeling |
NeurIPS 2020 |
Analyzing Sustainability Reports Using Natural Language Processing
(Papers Track)
Abstract and authors: (click to expand)Abstract: Climate change is a far-reaching, global phenomenon that will impact many aspects of our society, including the global stock market. In recent years, companies have increasingly been aiming to both mitigate their environmental impact and adapt their practices the changing climate context. This is reported via increasingly exhaustive reports, which cover many types of sustainability measures, often under the umbrella of Environmental, Social, and Governance (ESG) disclosures. However, given this abundance of data, sustainability analysts are obliged to comb through hundreds of pages of reports in order to find relevant information. We have leveraged recent progress in Natural Language Processing (NLP) to create a custom model, ClimateQA, which allows the analysis of financial reports in order to identify climate-relevant sections using a question answering approach. We present this tool and the methodology that we used to develop it in the present article. Authors: Sasha Luccioni (Mila); Emi Baylor (McGill); Nicolas Duchene (Universite de Montreal) |
Climate finance and economics Natural language processing |
NeurIPS 2020 |
Automated Identification of Oil Field Features using CNNs
(Papers Track)
Abstract and authors: (click to expand)Abstract: Oil and gas production sites have been identified as a major source of anthropogenic methane emissions. Emissions studies utilize counts of equipment to estimate emissions from production facilities. However these counts are poorly documented, including both information about well pad locations and major equipment on each well pad. While these data can be found by manually reviewing satellite imagery, it is prohibitively difficult and time consuming. This work, part of a larger study of methane emission studies in Colorado, US, adapted a machine learning (ML) algorithm to detect well pads and associated equipment. Our initial model showed an average well pad detection accuracy of 95% on the Denver-Julesburg (DJ) basin in northeastern Colorado. Our example demonstrates the potential for this type of automated detection from satellite imagery, leading to more accurate and complete models of production emissions. Authors: SONU DILEEP (Colorado State University); Daniel Zimmerle (Colorado State University); Ross Beveridge (CSU); Timothy Vaughn (Colorado State University) |
Climate and Earth science Computer vision and remote sensing |
NeurIPS 2020 |
Using attention to model long-term dependencies in occupancy behavior
(Papers Track)
Abstract and authors: (click to expand)Abstract: Over the past years, more and more models have been published that aim to capture relationships in human residential behavior. Most of these models are different Markov variants or regression models that have a strong assumption bias and are therefore unable to capture complex long-term dependencies and the diversity in occupant behavior. This work shows that attention based models are able to capture complex long-term dependencies in occupancy behavior and at the same time adequately depict the diversity in behavior across the entire population and different socio-demographic groups. By combining an autoregressive generative model with an imputation model, the advantages of two data sets are combined and new data are generated which are beneficial for multiple use cases (e.g. generation of consistent household energy demand profiles). The two step approach generates synthetic activity schedules that have similar statistical properties as the empirical collected schedules and do not contain direct information about single individuals. Therefore, the presented approach forms the basis to make data on occupant behavior freely available, so that further investigations based on the synthetic data can be carried out without a large data application effort. In future work it is planned to take interpersonal dependencies into account in order to be able to generate entire household behavior profiles. Authors: Max Kleinebrahm (Karlsruhe Institut für Technologie); Jacopo Torriti (University Reading); Russell McKenna (University of Aberdeen); Armin Ardone (Karlsruhe Institut für Technologie); Wolf Fichtner (Karlsruhe Institute of Technology) |
Behavioral and social science Buildings and cities Power and energy Transportation Natural language processing Time-series analysis |
NeurIPS 2020 |
Street to Cloud: Improving Flood Maps With Crowdsourcing and Semantic Segmentation
(Papers Track)
Abstract and authors: (click to expand)Abstract: To address the mounting destruction caused by floods in climate-vulnerable regions, we propose Street to Cloud, a machine learning pipeline for incorporating crowdsourced ground truth data into the segmentation of satellite imagery of floods. We propose this approach as a solution to the labor-intensive task of generating high-quality, hand-labeled training data, and demonstrate successes and failures of different plausible crowdsourcing approaches in our model. Street to Cloud leverages community reporting and machine learning to generate novel, near-real time insights into the extent of floods to be used for emergency response. Authors: Veda Sunkara (Cloud to Street); Matthew Purri (Rutgers University); Bertrand Le Saux (European Space Agency (ESA)); Jennifer Adams (European Space Agency (ESA)) |
Disaster prediction, management, and relief Computer vision and remote sensing |
NeurIPS 2020 |
Accurate river level predictions using a Wavenet-like model
(Papers Track)
Abstract and authors: (click to expand)Abstract: The effects of climate change on river levels are noticeable through a higher occurrence of floods with disastrous social and economic impacts. As such, river level forecasting is essential in flood mitigation, infrastructure management and secure shipping. Historical records of river levels and influencing factors such as rainfall or soil conditions are used for predicting future river levels. The current state-of-the-art time-series prediction model is the LSTM network, a recurrent neural network. In this work we study the efficiency of convolutional models, and specifically the WaveNet model in forecasting one-day ahead river levels. We show that the additional benefit of the WaveNet model is the computational ease with which other input features can be included in the predictions of river stage and river flow. The conditional WaveNet models outperformed conditional LSTM models for river level prediction by capturing short-term, non-linear dependencies between input data. Furthermore, the Wavenet model offers a faster computation time, stable results and more possibilities for fine-tuning. Authors: Shannon Doyle (UvA); Anastasia Borovykh (Imperial College London) |
Disaster prediction, management, and relief Other Classification, regression, and supervised learning |
NeurIPS 2020 |
Movement Tracks for the Automatic Detection of Fish Behavior in Videos
(Papers Track)
Abstract and authors: (click to expand)Abstract: Global warming is predicted to profoundly impact ocean ecosystems. Fish behavior is an important indicator of changes in such marine environments. Thus, the automatic identification of key fish behavior in videos represents a much needed tool for marine researchers, enabling them to study climate change-related phenomena. We offer a dataset of sablefish (Anoplopoma fimbria) startle behaviors in underwater videos, and investigate the use of deep learning (DL) methods for behavior detection on it. Our proposed detection system identifies fish instances using DL-based frameworks, determines trajectory tracks, derives novel behavior-specific features, and employs Long Short-Term Memory (LSTM) networks to identify startle behavior in sablefish. Its performance is studied by comparing it with a state-of-the-art DL-based video event detector. Authors: Declan GD McIntosh (University Of Victoria); Tunai Porto Marques (University of Victoria); Alexandra Branzan Albu (University of Victoria); Rodney Rountree (University of Victoria); Fabio De Leo Cabrera (Ocean Networks Canada) |
Computer vision and remote sensing Ecosystems and natural systems Other Classification, regression, and supervised learning Time-series analysis |
NeurIPS 2020 |
Counting Cows: Tracking Illegal Cattle Ranching From High-Resolution Satellite Imagery
(Papers Track)
Abstract and authors: (click to expand)Abstract: Cattle farming is responsible for 8.8\% of greenhouse gas emissions worldwide. In addition to the methane emitted due to their digestive process, the growing need for grazing areas is an important driver of deforestation. While some regulations are in place for preserving the Amazon against deforestation, these are being flouted in various ways. Hence the need to scale and automate the monitoring of cattle ranching activities. Through a partnership with \textit{Anonymous under review}, we explore the feasibility of tracking and counting cattle at the continental scale from satellite imagery. With a license from Maxar Technologies, we obtained satellite imagery of the Amazon at 40cm resolution, and compiled a dataset of 903 images containing a total of 28498 cattle. Our experiments show promising results and highlight important directions for the next steps on both counting algorithms and the data collection processes for solving such challenges. Authors: Issam Hadj Laradji (Element AI); Pau Rodriguez (Element AI); Alfredo Kalaitzis (University of Oxford); David Vazquez (Element AI); Ross Young (Element AI); Ed Davey (Global Witness); Alexandre Lacoste (Element AI) |
Computer vision and remote sensing Agriculture, forestry and other land use Climate and Earth science Ecosystems and natural systems |
NeurIPS 2020 |
RainBench: Enabling Data-Driven Precipitation Forecasting on a Global Scale
(Papers Track)
Abstract and authors: (click to expand)Abstract: Climate change is expected to aggravate extreme precipitation events, directly impacting the livelihood of millions. Without a global precipitation forecasting system in place, many regions -- especially those constrained in resources to collect expensive groundstation data -- are left behind. To mitigate such unequal reach of climate change, a solution is to alleviate the reliance on numerical models (and by extension groundstation data) by enabling machine-learning-based global forecasts from satellite imagery. Though prior works exist in regional precipitation nowcasting, there lacks work in global, medium-term precipitation forecasting. Importantly, a common, accessible baseline for meaningful comparison is absent. In this work, we present RainBench, a multi-modal benchmark dataset dedicated to advancing global precipitation forecasting. We establish baseline tasks and release PyRain, a data-handling pipeline to enable efficient processing of decades-worth of data by any modeling framework. Whilst our work serves as a basis for a new chapter on global precipitation forecast from satellite imagery, the greater promise lies in the community joining forces to use our released datasets and tools in developing machine learning approaches to tackle this important challenge. Authors: Catherine Tong (University of Oxford); Christian A Schroeder de Witt (University of Oxford); Valentina Zantedeschi (GE Global Research); Daniele De Martini (University of Oxford); Alfredo Kalaitzis (University of Oxford); Matthew Chantry (University of Oxford); Duncan Watson-Parris (University of Oxford); Piotr Bilinski (University of Warsaw / University of Oxford) |
Climate and Earth science Classification, regression, and supervised learning |
NeurIPS 2020 |
Machine learning for advanced solar cell production: adversarial denoising, sub-pixel alignment and the digital twin
(Papers Track)
Abstract and authors: (click to expand)Abstract: Photovoltaic is a main pillar to achieve the transition towards a renewable energy supply. In order to continue the tremendous cost decrease of the last decades, novel cell techologies and production processes are implemented into mass production to improve cell efficiency. Raising their full potential requires novel techniques of quality assurance and data analysis. We present three use-cases along the value chain where machine learning techniques are investigated for quality inspection and process optimization: Adversarial learning to denoise wafer images, alignment of surface structuring processes via sub-pixel coordinate regression, and the development of a digital twin for wafers and solar cells for material and process analysis. Authors: Matthias Demant (Fraunhofer ISE); Leslie Kurumundayil (Fraunhofer ISE); Philipp Kunze (Fraunhofer ISE); Aditya Kovvali (Fraunhofer ISE); Alexandra Woernhoer (Fraunhofer ISE); Stefan Rein (Fraunhofer ISE) |
Power and energy Industry Classification, regression, and supervised learning Computer vision and remote sensing Unsupervised and semi-supervised learning |
NeurIPS 2020 |
Climate Change Driven Crop Yield Failures
(Papers Track)
Abstract and authors: (click to expand)Abstract: The effect of extreme temperatures, precipitation and variations in other meteorological factors affect crop yields, and hence climate change jeopardizes the entire food supply chain and dependent economic activities. We utilize Deep Neural Networks and Gaussian Processes for understanding crop yields as functions of climatological variables, and use change detection techniques to identify climatological thresholds where yield drops significantly. Authors: Somya Sharma (U. Minnesota); Deepak Ray (University of Minnesota); Snigdhansu Chatterjee (University of Minnesota) |
Agriculture, forestry and other land use Spatial deep learning |
NeurIPS 2020 |
Physics-constrained Deep Recurrent Neural Models of Building Thermal Dynamics
(Papers Track)
Abstract and authors: (click to expand)Abstract: We develop physics-constrained and control-oriented predictive deep learning models for the thermal dynamics of a real-world commercial office building. The proposed method is based on the systematic encoding of physics-based prior knowledge into a structured recurrent neural architecture. Specifically, our model mimics the structure of the building thermal dynamics model and leverages penalty methods to model inequality constraints. Additionally, we use constrained matrix parameterization based on the Perron-Frobenius theorem to bound the eigenvalues of the learned network weights. We interpret the stable eigenvalues as dissipativeness of the learned building thermal model. We demonstrate the effectiveness of the proposed approach on a dataset obtained from an office building with 20 thermal zones. Authors: Jan Drgona (Pacific Northwest National Laboratory); Aaron R Tuor (Pacific Northwest National Laboratory); Vikas Chandan (PNNL); Draguna Vrabie (PNNL) |
Hybrid physical models Buildings and cities |
NeurIPS 2020 |
Narratives and Needs: Analyzing Experiences of Cyclone Amphan Using Twitter Discourse
(Papers Track)
Abstract and authors: (click to expand)Abstract: People often turn to social media to comment upon and share information about major global events. Accordingly, social media is receiving increasing attention as a rich data source for understanding people's social, political and economic experiences of extreme weather events. In this paper, we contribute two novel methodologies that leverage Twitter discourse to characterize narratives and identify unmet needs in response to Cyclone Amphan, which affected 18 million people in May 2020. Authors: Ancil S Crayton (Booz Allen Hamilton); Joao Fonseca (NOVA Information Management School); Kanav Mehra (Independent Researcher); Jared Ross (Booz Allen Hamilton); Marcelo Sandoval-Castañeda (New York University Abu Dhabi); Michelle Ng (International Water Management Institute); Rachel von Gnechten (International Water Management Institute) |
Disaster prediction, management, and relief Natural language processing Other Unsupervised and semi-supervised learning |
NeurIPS 2020 |
FlowDB: A new large scale river flow, flash flood, and precipitation dataset
(Papers Track)
Abstract and authors: (click to expand)Abstract: Flooding results in 8 billion dollars of damage annually in the US and causes the most deaths of any weather related event. Due to climate change scientists expect more heavy precipitation events in the future. However, no current datasets exist that contain both hourly precipitation and river flow data. We introduce a novel hourly river flow and precipitation dataset and a second subset of flash flood events with damage estimates and injury counts. Using these datasets we create two challenges (1) general stream flow forecasting and (2) flash flood damage estimation. We also create a public benchmark and an Python package to enable easy adding of new models . Additionally, in the future we aim to augment our dataset with snow pack data and soil index moisture data to improve predictions Authors: Isaac Godfried (CoronaWhy) |
Time-series analysis Climate and Earth science Data mining |
NeurIPS 2020 |
Can Federated Learning Save The Planet ?
(Papers Track)
Abstract and authors: (click to expand)Abstract: Despite impressive results, deep learning-based technologies also raise severe privacy and environmental concerns induced by the training procedure often conducted in data centers. In response, alternatives to centralized training such as Federated Learning (FL) have emerged. Perhaps unexpectedly, FL in particular is starting to be deployed at a global scale by companies that must adhere to new legal demands and policies originating from governments and the civil society for privacy protection. \textit{However, the potential environmental impact related to FL remains unclear and unexplored. This paper offers the first-ever systematic study of the carbon footprint of FL.} First, we propose a rigorous model to quantify the carbon footprint, hence facilitating the investigation of the relationship between FL design and carbon emissions. Then, we compare the carbon footprint of FL to traditional centralized learning. Our findings show FL, despite being slower to converge, can be a greener technology than data center GPUs. Finally, we highlight and connect the reported results to the future challenges and trends in FL to reduce its environmental impact, including algorithms efficiency, hardware capabilities, and stronger industry transparency. Authors: Xinchi Qiu (University of Cambridge); Titouan Parcollet (University of Oxford); Daniel J Beutel (Adap GmbH / University of Cambridge); Taner Topal (Adap GmbH); Akhil Mathur (Nokia Bell Labs); Nicholas Lane (University of Cambridge and Samsung AI) |
Power and energy Federated Learning |
NeurIPS 2020 |
FireSRnet: Geoscience-driven super-resolution of future fire risk from climate change
(Papers Track)
Abstract and authors: (click to expand)Abstract: With fires becoming increasingly frequent and severe across the globe in recent years, understanding climate change’s role in fire behavior is critical for quantifying current and future fire risk. However, global climate models typically simulate fire behavior at spatial scales too coarse for local risk assessments. Therefore, we propose a novel approach towards super-resolution (SR) enhancement of fire risk exposure maps that incorporates not only 2000 to 2020 monthly satellite observations of active fires but also local information on land cover and temperature. Inspired by SR architectures, we propose an efficient deep learning model trained for SR on fire risk exposure maps. We evaluate this model on resolution enhancement and find it outperforms standard image interpolation techniques at both 4x and 8x enhancement while having comparable performance at 2x enhancement. We then demonstrate the generalizability of this SR model over northern California and New South Wales, Australia. We conclude with a discussion and application of our proposed model to climate model simulations of fire risk in 2040 and 2100, illustrating the potential for SR enhancement of fire risk maps from the latest state-of-the-art climate models. Authors: Tristan C Ballard (Sust Global, Stanford University); Gopal Erinjippurath (Sust Global) |
Computer vision and remote sensing Climate and Earth science Societal adaptation |
NeurIPS 2020 |
An Enriched Automated PV Registry: Combining Image Recognition and 3D Building Data
(Papers Track)
Abstract and authors: (click to expand)Abstract: While photovoltaic (PV) systems are installed at an unprecedented rate, reliable information on an installation level remains scarce. As a result, automatically created PV registries are a timely contribution to optimize grid planning and operations. This paper demonstrates how aerial imagery and three-dimensional building data can be combined to create an address-level PV registry, specifying area, tilt, and orientation angles. We demonstrate the benefits of this approach for PV capacity estimation. In addition, this work presents, for the first time, a comparison between automated and officially-created PV registries. Our results indicate that our enriched automated registry proves to be useful to validate, update, and complement official registries. Authors: Benjamin Rausch (Stanford); Kevin Mayer (Stanford); Marie-Louise Arlt (Stanford); Gunther Gust (University of Freiburg); Philipp Staudt (KIT); Christof Weinhardt (Karlsruhe Institute of Technology); Dirk Neumann (Universität Freiburg); Ram Rajagopal (Stanford University) |
Power and energy Buildings and cities Classification, regression, and supervised learning Computer vision and remote sensing |
NeurIPS 2020 |
Satellite imagery analysis for Land Use, Land Use Change and Forestry: A pilot study in Kigali, Rwanda
(Papers Track)
Abstract and authors: (click to expand)Abstract: Estimating greenhouse gases from the Agriculture, Forestry and Other land Use (AFOLU) sector is very challenging partly due to the unavailability of data (particularly for land use and land use change sectors) and inadequate experts to analyze this data in case it is available. We used Collect Earth together with Machine Learning techniques to be able to predict and classify all the land use types based on some few points collected using Collect Earth. We investigated the adoption of this tool and technology in Rwanda to help its national and sub-national inventories. The use of Collect Earth and the Machine Learning (ML) implementation will help Rwanda monitor and predict its Land Use, Land Use Change, and Forestry in a cost-effective manner whiles enhancing the quality of reports submitted to national and international bodies whiles introducing a new approach. Among the classification algorithms we tested, we had an overall classification accuracy of 97% using the Classification and Regression Trees (CART) algorithm to to predict the six land Use classes across the country. Authors: Bright Aboh (African Institute for Mathematical Sciences); Alphonse Mutabazi (UN Environment Program) |
Agriculture, forestry and other land use Classification, regression, and supervised learning |
NeurIPS 2020 |
EarthNet2021: A novel large-scale dataset and challenge for forecasting localized climate impacts
(Papers Track)
Abstract and authors: (click to expand)Abstract: Climate change is global, yet its concrete impacts can strongly vary between different locations in the same region. Seasonal weather forecasts currently operate at the mesoscale (> 1 km). For more targeted mitigation and adaptation, modelling impacts to < 100 m is needed. Yet, the relationship between driving variables and Earth’s surface at such local scales remains unresolved by current physical models. Large Earth observation datasets now enable us to create machine learning models capable of translating coarse weather information into high-resolution Earth surface forecasts encompassing localized climate impacts. Here, we define high-resolution Earth surface forecasting as video prediction of satellite imagery conditional on mesoscale weather forecasts. Video prediction has been tackled with deep learning models. Developing such models requires analysis-ready datasets. We introduce EarthNet2021, a new, curated dataset containing target spatio-temporal Sentinel 2 satellite imagery at 20 m resolution, matched with high-resolution topography and mesoscale (1.28 km) weather variables. With over 32000 samples it is suitable for training deep neural networks. Comparing multiple Earth surface forecasts is not trivial. Hence, we define the EarthNetScore, a novel ranking criterion for models forecasting Earth surface reflectance. For model intercomparison we frame EarthNet2021 as a challenge with four tracks based on different test sets. These allow evaluation of model validity and robustness as well as model applicability to extreme events and the complete annual vegetation cycle. In addition to forecasting directly observable weather impacts through satellite-derived vegetation indices, capable Earth surface models will enable downstream applications such as crop yield prediction, forest health assessments, coastline management, or biodiversity monitoring. Find data, code, and how to participate at www.earthnet.tech . Authors: Christian Requena-Mesa (Computer Vision Group, Friedrich Schiller University Jena; DLR Institute of Data Science, Jena; Max Planck Institute for Biogeochemistry, Jena); Vitus Benson (Max-Planck-Institute for Biogeochemistry); Jakob Runge (Institute of Data Science, German Aerospace Center (DLR)); Joachim Denzler (Computer Vision Group, Friedrich Schiller University Jena, Germany); Markus Reichstein (Max Planck Institute for Biogeochemistry, Jena; Michael Stifel Center Jena for Data-Driven and Simulation Science, Jena) |
Climate and Earth science Agriculture, forestry and other land use Disaster prediction, management, and relief Ecosystems and natural systems Computer vision and remote sensing Data mining Generative modeling Hybrid physical models |
NeurIPS 2020 |
DeepWaste: Applying Deep Learning to Waste Classification for a Sustainable Planet
(Papers Track)
Abstract and authors: (click to expand)Abstract: Accurate waste disposal, at the point of disposal, is crucial to fighting climate change. When materials that could be recycled or composted get diverted into landfills, they cause the emission of potent greenhouse gases such as methane. Current attempts to reduce erroneous waste disposal are expensive, inaccurate, and confusing. In this work, we propose DeepWaste, an easy-to-use mobile app, that utilizes highly optimized deep learning techniques to provide users instantaneous waste classification into trash, recycling, and compost. We experiment with several convolution neural network architectures to detect and classify waste items. Our best model, a deep learning residual neural network with 50 layers, achieves an average precision of 0.881 on the test set. We demonstrate the performance and efficiency of our app on a set of real-world images. Authors: Yash Narayan (The Nueva School) |
Other Classification, regression, and supervised learning |
NeurIPS 2020 |
Machine Learning Climate Model Dynamics: Offline versus Online Performance
(Papers Track)
Abstract and authors: (click to expand)Abstract: Climate models are complicated software systems that approximate atmospheric and oceanic fluid mechanics at a coarse spatial resolution. Typical climate forecasts only explicitly resolve processes larger than 100 km and approximate any process occurring below this scale (e.g. thunderstorms) using so-called parametrizations. Machine learning could improve upon the accuracy of some traditional physical parametrizations by learning from so-called global cloud-resolving models. We compare the performance of two machine learning models, random forests (RF) and neural networks (NNs), at parametrizing the aggregate effect of moist physics in a 3 km resolution global simulation with an atmospheric model. The NN outperforms the RF when evaluated offline on a testing dataset. However, when the ML models are coupled to an atmospheric model run at 200 km resolution, the NN-assisted simulation crashes with 7 days, while the RF-assisted simulations remain stable. Both runs produce more accurate weather forecasts than a baseline configuration, but globally averaged climate variables drift over longer timescales. Authors: Noah D Brenowitz (Vulcan Inc.); Brian Henn (Vulcan, Inc.); Spencer Clark (Vulcan, Inc.); Anna Kwa (Vulcan, Inc.); Jeremy McGibbon (Vulcan, Inc.); W. Andre Perkins (Vulcan, Inc.); Oliver Watt-Meyer (Vulcan, Inc.); Christopher S. Bretherton (Vulcan, Inc.) |
Climate and Earth science Hybrid physical models |
NeurIPS 2020 |
OGNet: Towards a Global Oil and Gas Infrastructure Database using Deep Learning on Remotely Sensed Imagery
(Papers Track)
Abstract and authors: (click to expand)Abstract: At least a quarter of the warming that the Earth is experiencing today is due to anthropogenic methane emissions. There are multiple satellites in orbit and planned for launch in the next few years which can detect and quantify these emissions; however, to attribute methane emissions to their sources on the ground, a comprehensive database of the locations and characteristics of emission sources worldwide is essential. In this work, we develop deep learning algorithms that leverage freely available high-resolution aerial imagery to automatically detect oil and gas infrastructure, one of the largest contributors to global methane emissions. We use the best algorithm, which we call OGNet, together with expert review to identify the locations of oil refineries and petroleum terminals in the U.S. We show that OGNet detects many facilities which are not present in four standard public datasets of oil and gas infrastructure. All detected facilities are associated with characteristics critical to quantifying and attributing methane emissions, including the types of infrastructure and number of storage tanks. The data curated and produced in this study is freely available at https://link/provided/in/camera/ready/version. Authors: Hao Sheng (Stanford University); Jeremy A Irvin (Stanford); Sasankh Munukutla (Stanford University); Shawn Zhang (Stanford University); Christopher Cross (Stanford University); Zutao Yang (Stanford University); Kyle Story (Descartes Labs); Rose Rustowicz (Descartes Labs); Cooper Elsworth (Descartes Labs); Mark Omara (Environmental Defense Fund); Ritesh Gautam (Environmental Defense Fund); Rob Jackson (Stanford University); Andrew Ng (Stanford University) |
Power and energy Computer vision and remote sensing |
NeurIPS 2020 |
VConstruct: Filling Gaps in Chl-a Data Using a Variational Autoencoder
(Papers Track)
Abstract and authors: (click to expand)Abstract: Remote sensing of Chlorophyll-a is vital in monitoring climate change. Chlorphylla measurements give us an idea of the algae concentrations in the ocean, which lets us monitor ocean health. However, a common problem is that the satellites used to gather the data are commonly obstructed by clouds and other artifacts. This means that time series data from satellites can suffer from spatial data loss. There are a number of algorithms that are able to reconstruct the missing parts of these images to varying degrees of accuracy, with Data INterpolating Empirical Orthogonal Functions (DINEOF) being the current standard. However, DINEOF is slow, suffers from accuracy loss in temporally homogenous waters, reliant on temporal data, and only able to generate a single potential reconstruction. We propose a machine learning approach to reconstruction of Chlorophyll-a data using a Variational Autoencoder (VAE). Our accuracy results to date are competitive with but slightly less accurate than DINEOF. We show the benefits of our method including vastly decreased computation time and ability to generate multiple potential reconstructions. Lastly, we outline our planned improvements and future work. Authors: Matthew Ehrler (University of Victoria); Neil Ernst (University of Victoria) |
Computer vision and remote sensing Climate and Earth science Ecosystems and natural systems Generative modeling |
NeurIPS 2020 |
A Comparison of Data-Driven Models for Predicting Stream Water Temperature
(Papers Track)
Abstract and authors: (click to expand)Abstract: Changes to the Earth's climate are expected to negatively impact water resources in the future. It is important to have accurate modelling of river flow and water quality to make optimal decisions for water management. Machine learning and deep learning models have become promising methods for making such hydrological predictions. Using these models, however, requires careful consideration both of data constraints and of model complexity for a given problem. Here, we use machine learning (ML) models to predict monthly stream water temperature records at three monitoring locations in the Northwestern United States with long-term datasets, using meteorological data as predictors. We fit three ML models: a Multiple Linear Regression, a Random Forest Regression, and a Support Vector Regression, and compare them against two baseline models: a persistence model and historical model. We show that all three ML models are reasonably able to predict mean monthly stream temperatures with root mean-squared errors (RMSE) ranging from 0.63-0.91 degrees Celsius. Of the three ML models, Support Vector Regression performs the best with an error of 0.63-0.75 degrees Celsius. However, all models perform poorly on extreme values of water temperature. We identify the need for machine learning approaches for predicting extreme values for variables such as water temperature, since it has significant implications for stream ecosystems and biota. Authors: Helen Weierbach (Lawrence Berkeley); Aranildo Lima (Aquatic Informatics); Danielle Christianson (Lawrence Berkeley National Lab); Boris Faybishenko (Lawrence Berkeley National Lab); Val Hendrix (Lawrence Berkeley National Lab); Charuleka Varadharajan (Lawrence Berkeley National Lab) |
Ecosystems and natural systems Climate and Earth science Classification, regression, and supervised learning Time-series analysis |
NeurIPS 2020 |
Automated Salmonid Counting in Sonar Data
(Papers Track)
Abstract and authors: (click to expand)Abstract: The prosperity of salmonids is crucial for several ecological and economic functions. Accurately counting spawning salmonids during their seasonal migration is essential in monitoring threatened populations, assessing the efficacy of recovery strategies, guiding fishing season regulations, and supporting the management of commercial and recreational fisheries. While several different methods exist for counting river fish, they all rely heavily on human involvement, introducing a hefty financial and time burden. In this paper we present an automated fish counting method that utilizes data captured from ARIS sonar cameras to detect and track salmonids migrating in rivers. Our results show that our fully automated system has a 19.3% per-clip error when compared to human counting performance. There is room to improve, but our system can already decrease the amount of time field biologists and fishery managers need to spend manually watching ARIS clips. Authors: Peter Kulits (Caltech); Angelina Pan (Caltech); Sara M Beery (Caltech); Erik Young (Trout Unlimited); Pietro Perona (California Institute of Technology); Grant Van Horn (Cornell University) |
Ecosystems and natural systems Agriculture, forestry and other land use Climate and Earth science Industry Computer vision and remote sensing Time-series analysis |
NeurIPS 2020 |
Short-term PV output prediction using convolutional neural network: learning from an imbalanced sky images dataset via sampling and data augmentation
(Papers Track)
Abstract and authors: (click to expand)Abstract: Integrating photovoltaics (PV) into electricity grids is one of the major pathways towards a low-carbon energy system. However, the biggest challenge is the strong fluctuation in PV power generation. In recent years, sky-image-based PV output prediction using deep neural networks has emerged as a promising approach to alleviate the uncertainty. Despite the research surge in exploring different model architectures, there is currently no study addressing the issue of learning from an imbalanced sky images dataset, the outcome of which would be highly beneficial for improving the reliability of existing and new solar forecasting models. In this study, we train convolutional neural network (CNN) models from an imbalanced sky images dataset for two disparate PV output prediction tasks, i.e., nowcast and forecast. We empirically examine the efficacy of using different sampling and data augmentation approaches to create synthesized dataset for model development. We further apply a three-stage selection approach to determine the optimal sampling approach, data augmentation technique and oversampling rate. Authors: Yuhao Nie (Stanford University); Ahmed S Zamzam (The National Renewable Energy Laboratory); Adam Brandt (Stanford University) |
Computer vision and remote sensing Climate and Earth science Classification, regression, and supervised learning |
NeurIPS 2020 |
OfficeLearn: An OpenAI Gym Environment for Building Level Energy Demand Response
(Papers Track)
Abstract and authors: (click to expand)Abstract: Energy Demand Response (DR) will play a crucial role in balancing renewable energy generation with demand as grids decarbonize. There is growing interest in developing Reinforcement Learning (RL) techniques to optimize DR pricing, as pricing set by electric utilities often cannot take behavioral irrationality into account. However, so far, attempts to standardize RL efforts in this area do not exist. In this paper, we present a first of the kind OpenAI gym environment for testing DR with occupant level building dynamics. We demonstrate the variety of parameters built into our office environment allowing the researcher to customize a building to meet their specifications of interest. We hope that this work enables future work in DR in buildings. Authors: Lucas Spangher (U.C. Berkeley); Akash Gokul (University of California at Berkeley); Utkarsha Agwan (U.C. Berkeley); Joseph Palakapilly (UC Berkeley); Manan Khattar (University of California at Berkeley); Akaash Tawade (University of California at Berkeley); Costas J. Spanos (University of California at Berkeley) |
Buildings and cities Power and energy Reinforcement learning and control Time-series analysis |
NeurIPS 2020 |
Machine Learning for Glacier Monitoring in the Hindu Kush Himalaya
(Papers Track)
Abstract and authors: (click to expand)Abstract: Glacier mapping is key to ecological monitoring in the Hindu Kush Himalaya region. Climate change poses a risk to individuals whose livelihoods depend on the health of glacier ecosystems. In this work, we present a machine learning based approach to support ecological monitoring, with a focus on glaciers. Our approach is based on semi-automated mapping from satellite images. We utilize readily available remote sensing data to create a model to identify and outline both clean ice and debris-covered glaciers from satellite imagery. We also release data and develop a web tool that allows experts to visualize and correct model predictions, with the ultimate aim of accelerating the glacier mapping process. Authors: Shimaa Baraka (Mila); Benjamin Akera (Makerere University); Bibek Aryal (The University of Texas at El Paso); Tenzing Sherpa (International Centre for Integrated Mountain Development); Finu Shrestha (International Centre for Integrated Mountain Development); Anthony Ortiz (Microsoft); Kris Sankaran (University of Wisconsin-Madison); Juan M Lavista Ferres (Microsoft); Mir A Matin (International Center for Integrated Mountain Development); Yoshua Bengio (Mila) |
Ecosystems and natural systems Computer vision and remote sensing |
NeurIPS 2020 |
Data-driven modeling of cooling demand in a commercial building
(Papers Track)
Abstract and authors: (click to expand)Abstract: Heating, ventilation, and air conditioning (HVAC) systems account for 30% of the total energy consumption in buildings. Design and implementation of energy-efficient schemes can play a pivotal role in minimizing energy usage. As an important first step towards improved HVAC system controls, this study proposes a new framework for modeling the thermal response of buildings by leveraging data measurements and formulating a data-driven system identification model. The proposed method combines principal component analysis (PCA) to identify the most significant predictors that influence the cooling demand of a building with an auto-regressive integrated moving average with exogenous variables (ARIMAX) model. The performance of the developed model was evaluated both analytically and visually. It was found that our PCA-based ARIMAX (2-0-5) model was able to accurately forecast the cooling demand for the prediction horizon of 7 days. In this work, the actual measurements from a university campus building are used for model development and validation. Authors: Aqsa Naeem (Stanford University); Sally Benson (Stanford University); Jacques de Chalendar (Stanford University) |
Buildings and cities Power and energy Hybrid physical models |
NeurIPS 2020 |
Investigating two super-resolution methods for downscaling precipitation: ESRGAN and CAR
(Papers Track)
Abstract and authors: (click to expand)Abstract: In an effort to provide optimal inputs to downstream modeling systems (e.g., a hydrodynamics model that simulates the water circulation of a lake), we hereby strive to enhance resolution of precipitation fields from a weather model by up to 9x. We test two super-resolution models: the enhanced super-resolution generative adversarial networks (ESRGAN) proposed in 2017, and the content adaptive resampler (CAR) proposed in 2020. Both models outperform simple bicubic interpolation, with the ESRGAN exceeding expectations for accuracy. We make several proposals for extending the work to ensure it can be useful tool for quantifying the impact of climate change on local ecosystems while removing reliance on energy-intensive, high-resolution weather model simulations. Authors: Campbell Watson (IBM); Chulin Wang (Northwestern University); Tim Lynar (University of New South Wales); Komminist Weldemariam (IBM Research) |
Classification, regression, and supervised learning Ecosystems and natural systems |
NeurIPS 2020 |
Emerging Trends of Sustainability Reporting in the ICT Industry: Insights from Discriminative Topic Mining
(Papers Track)
Abstract and authors: (click to expand)Abstract: The Information and Communication Technologies (ICT) industry has a considerable climate change impact and accounts for approximately 3 percent of global carbon emissions. Despite the increasing availability of sustainability reports provided by ICT companies, we still lack a systematic understanding of what has been disclosed at an industry level. In this paper, we make the first major effort to use modern unsupervised learning methods to investigate the sustainability reporting themes and trends of the ICT industry over the past two decades. We build a cross-sector dataset containing 22,534 environmental reports from 1999 to 2019, of which 2,187 are ICT specific. We then apply CatE, a text embedding based topic modeling method, to mine specific keywords that ICT companies use to report on climate change and energy. As a result, we identify (1) important shifts in ICT companies' climate change narratives from physical metrics towards climate-related disasters, (2) key organizations with large influence on ICT companies, and (3) ICT companies' increasing focus on data center and server energy efficiency. Authors: Lin Shi (Stanford University); Nhi Truong Vu (Stanford University) |
Industry Climate policy Data mining Natural language processing Unsupervised and semi-supervised learning |
NeurIPS 2020 |
Loosely Conditioned Emulation of Global Climate Models With Generative Adversarial Networks
(Papers Track)
Abstract and authors: (click to expand)Abstract: Climate models encapsulate our best understanding of the Earth system, allowing research to be conducted on its future under alternative assumptions of how human-driven climate forces are going to evolve. An important application of climate models is to provide metrics of mean and extreme climate changes, particularly under these alternative future scenarios, as these quantities drive the impacts of climate on society and natural systems. Because of the need to explore a wide range of alternative scenarios and other sources of uncertainties in a computationally efficient manner, climate models can only take us so far, as they require significant computational resources, especially when attempting to characterize extreme events, which are rare and thus demand long and numerous simulations in order to accurately represent their changing statistics. Here we use deep learning in a proof of concept that lays the foundation for emulating global climate model output for different scenarios. We train two "loosely conditioned" Generative Adversarial Networks (GANs) that emulate daily precipitation output from a fully coupled Earth system model: one GAN modeling Fall-Winter behavior and the other Spring-Summer. Our GANs are trained to produce spatiotemporal samples: 32 days of precipitation over a 64x128 regular grid discretizing the globe. We evaluate the generator with a set of related performance metrics based upon KL divergence, and find the generated samples to be nearly as well matched to the test data as the validation data is to test. We also find the generated samples to accurately estimate the mean number of dry days and mean longest dry spell in the 32 day samples. Our trained GANs can rapidly generate numerous realizations at a vastly reduced computational expense, compared to large ensembles of climate models, which greatly aids in estimating the statistics of extreme events. Authors: Alex Ayala (Western Washington University); Chris Drazic (Western Washington University); Brian Hutchinson (Western Washington University); Ben Kravitz (Indiana University); Claudia Tebaldi (Joint Global Change Research Institute) |
Generative modeling Climate and Earth science |
NeurIPS 2020 |
The Peruvian Amazon Forestry Dataset: A Leaf Image Classification Corpus
(Papers Track)
Abstract and authors: (click to expand)Abstract: This paper introduces the Peruvian Amazon Forestry Dataset, which includes 59,441 leaves samples from ten of the most profitable and endangered Amazon timber-tree species. Besides, the proposal includes a background removal algorithm to feed a fine-tuned CNN. We evaluate the quantitative (accuracy metric) and qualitative (visual interpretation) impacts of each stage by ablation experiments. The results show a 96.64 % training accuracy and 96.52 % testing accuracy on the VGG-19 model. Furthermore, the visual interpretation of the model evidences that leaf venations have the highest correlation in the plant recognition task. Authors: Gerson Waldyr Vizcarra Aguilar (San Pablo Catholic University); Danitza Bermejo (Universidad Nacional del Altiplano); Manasses A. Mauricio (Universidad Católica San Pablo); Ricardo Zarate (Instituto de Investigaciones de la Amazonía Peruana); Erwin Dianderas (Instituto de Investigaciones de la Amazonía Peruana) |
Agriculture, forestry and other land use Meta- and transfer learning |
NeurIPS 2020 |
Spatiotemporal Features Improve Fine-Grained Butterfly Image Classification
(Papers Track)
Abstract and authors: (click to expand)Abstract: Understanding the changing distributions of butterflies gives insight into the impacts of climate change across ecosystems and is a prerequisite for conservation efforts. eButterfly is a citizen science website created to allow people to track the butterfly species around them and use these observations to contribute to research. However, correctly identifying butterfly species is a challenging task for non-specialists and currently requires the involvement of entomologists to verify the labels of novice users on the website. We have developed a computer vision model to label butterfly images from eButterfly automatically, decreasing the need for human experts. We employ a model that incorporates geographic and temporal information of where and when the image was taken, in addition to the image itself. We show that we can successfully apply this spatiotemporal model for fine-grained image recognition, significantly improving the accuracy of our classification model compared to a baseline image recognition system trained on the same dataset. Authors: Marta Skreta (University of Toronto); Sasha Luccioni (Mila); David Rolnick (McGill University, Mila) |
Ecosystems and natural systems Classification, regression, and supervised learning Computer vision and remote sensing |
NeurIPS 2020 |
High-resolution global irrigation prediction with Sentinel-2 30m data
(Papers Track)
Abstract and authors: (click to expand)Abstract: An accurate and precise understanding of global irrigation usage is crucial for a variety of climate science efforts. Irrigation is highly energy-intensive, and as population growth continues at its current pace, increases in crop need and water usage will have an impact on climate change. Precise irrigation data can help with monitoring water usage and optimizing agricultural yield, particularly in developing countries. Irrigation data, in tandem with precipitation data, can be used to predict water budgets as well as climate and weather modeling. With our research, we produce an irrigation prediction model that combines unsupervised clustering of Normalized Difference Vegetation Index (NDVI) temporal signatures with a precipitation heuristic to label the months that irrigation peaks for each cropland cluster in a given year. We have developed a novel irrigation model and Python package ("Irrigation30") to generate 30m resolution irrigation predictions of cropland worldwide. With a small crowdsourced test set of cropland coordinates and irrigation labels, using a fraction of the resources used by the state-of-the-art NASA-funded GFSAD30 project with irrigation data limited to India and Australia, our model was able to achieve consistency scores in excess of 97% and an accuracy of 92% in a small geo-diverse randomly sampled test set. Authors: Will Hawkins (UC Berkeley); Weixin Wu (UC Berkeley); Sonal Thakkar (UC Berkeley); Puya Vahabi (UC Berkeley); Alberto Todeschini (UC Berkeley) |
Agriculture, forestry and other land use Unsupervised and semi-supervised learning |
NeurIPS 2020 |
Do Occupants in a Building exhibit patterns in Energy Consumption? Analyzing Clusters in Energy Social Games
(Papers Track)
Abstract and authors: (click to expand)Abstract: Energy use in buildings account for approximately half of global electricity consumption and a significant amount of CO2 emissions. To encourage energy efficient behavior among occupants in a building, energy social games have emerged to be a successful strategy leveraging human-in-the-loop strategy and engaging users in a competitive game with incentives for energy efficient behavior. Prior works involve an incentive design mechanism which is dependent on knowledge of utility functions (energy use behavior) for the users, which is hard to compute when the number of users is high, common in buildings. We propose that the utilities can be grouped to a relatively small number of clusters, which can then be targeted with tailored incentives. Proposed work performs the above segmentation by learning the features leading to human decision making towards energy usage in competitive environment. We propose a graphical lasso based approach with explainable nature for such segmentation, by studying the feature correlations in a real-world energy social game dataset. Authors: Hari Prasanna Das (UC Berkeley); Ioannis C. Konstantakopoulos (UC Berkeley); Aummul Baneen Manasawala (UC Berkeley); Tanya Veeravalli (UC Berkeley); Huihan Liu (UC Berkeley); Costas J. Spanos (University of California at Berkeley) |
Buildings and cities Other |
NeurIPS 2020 | In-N-Out: Pre-Training and Self-Training using Auxiliary Information for Out-of-Distribution Robustness (Pap |