Minisymposia
A minisymposium is a two-hour session on a topic of current importance in computational science that showcases research related to domain science, applied mathematics, computer science and software engineering, and is an ideal platform for promoting interdisciplinary communication.
Forty-eight minisymposia have been accepted for PASC18. Typically, the format of a minisymposium is four 30-minute presentations, although in some cases the final 30-minute slot may consist of a panel discussion or open discussion forum.
Organizer(s):
Andreas Vitalis (University of Zurich, Switzerland)
, Marco Bacci (University of Zurich, Switzerland)
, Amedeo Caflisch (University of Zurich, Switzerland)
Track(s):
Life Sciences, Emerging Application Domains, Computer Science and Applied Mathematics
A common problem in numerical optimization and sampling is the detection of relevant states. These could be, for instance, the local minima on a rugged parameter surface or the transition state of a chemical reaction. For most cases, an exhaustive search for the optimal solution is intractable. Here, we focus on parallel sampling and optimization strategies relying on multiple replicas, most prominently, adaptive methods where all simulated replicas use the same propagator and sample the same underlying surface. In these methods, replica intercommunication is used to provide a global assessment as to which replicas are most interesting. This implies, in general, periodic data mining steps across replicas. Furthermore, in order to extract and utilize the gained information in post-processing, data must often be stored, which poses stringent data management and analysis challenges in particular for high-dimensional cases. The minisymposium wishes to discuss the following questions: What are meaningful and easily generalizable tools, strategies, and algorithms to guide the sampling/exploration? How can we maintain scalability and load balance? What types of post-processing algorithms can be applied to the generated data, and are those scalable to provide on-the-fly solutions to direct the exploration?
13:30 - 14:00
Applications and Advancements of the Progress-Index Guided Sampling Method in Molecular Dynamics Simulations
, Marco Bacci (University of Zurich, Switzerland)
+ Abstract { "session": {"id":"sess153","title":"MS01 - Adaptive Parallel Strategies for the Exploration of Challenging Search Spaces with Applications in Particle Simulations and Optimization, Part I","date":"Monday, July 2nd 2018","begin_time":"13:00","end_time":"15:00","room":"Samarkand Room","contributors":[{"type":"Session Chair","first_name":"Andreas","last_name":"Vitalis","affiliation":"University of Zurich","country":""}],"view_type":"VIII","view_type_id":"evtt110","tracks":["Life Sciences","Emerging Application Domains","Computer Science and Applied Mathematics"],"slots":[{"id":"symp112","type":"minisymposia","title":"MS01 - Adaptive Parallel Strategies for the Exploration of Challenging Search Spaces with Applications in Particle Simulations and Optimization, Part I","has_begin_end_time":true,"has_just_one_minute":true,"is_parent":true,"abstract":"A common problem in numerical optimization and sampling is the detection of relevant states. These could be, for instance, the local minima on a rugged parameter surface or the transition state of a chemical reaction. For most cases, an exhaustive search for the optimal solution is intractable. Here, we focus on parallel sampling and optimization strategies relying on multiple replicas, most prominently, adaptive methods where all simulated replicas use the same propagator and sample the same underlying surface. In these methods, replica intercommunication is used to provide a global assessment as to which replicas are most interesting. This implies, in general, periodic data mining steps across replicas. Furthermore, in order to extract and utilize the gained information in post-processing, data must often be stored, which poses stringent data management and analysis challenges in particular for high-dimensional cases. The minisymposium wishes to discuss the following questions: What are meaningful and easily generalizable tools, strategies, and algorithms to guide the sampling\/exploration? How can we maintain scalability and load balance? What types of post-processing algorithms can be applied to the generated data, and are those scalable to provide on-the-fly solutions to direct the exploration?","bio":"","contributors":[{"type":"Organizer","first_name":"Andreas","last_name":"Vitalis","affiliation":"University of Zurich","country":"Switzerland","bio":"","order":"1","is_presenter":true},{"type":"Organizer","first_name":"Marco","last_name":"Bacci","affiliation":"University of Zurich","country":"Switzerland","bio":"","order":"2","is_presenter":false},{"type":"Organizer","first_name":"Amedeo","last_name":"Caflisch","affiliation":"University of Zurich","country":"Switzerland","bio":"","order":"3","is_presenter":false}],"has_presenters":true,"presenters":[{"type":"Organizer","first_name":"Andreas","last_name":"Vitalis","affiliation":"University of Zurich","country":"Switzerland","bio":"","order":"1","is_presenter":true}]},{"id":"msa159","type":"child","title":"FAST - Goal-Oriented Adaptive Sampling of Protein Dynamics","begin_time":"13:00","end_time":"13:30","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"Molecular dynamics simulations are a powerful \u2028means of understanding conformational changes. However, it \u2028is still difficult to simulate biologically relevant time scales\u2028 without the use of specialized supercomputers. Here, we \u2028introduce a goal-oriented sampling method, called fluctuation \u2028amplification of specific traits (FAST), for extending the\u2028 capabilities of commodity hardware. FAST works by iteratively running a batch of simulations, building a Markov state model (MSM), and then using the last MSM to decide what subset of the states that have been discovered so far it would be most valuable to run the next set of simulations from. Importantly, the ranking function we use to choose starting points for each batch of simulations includes an exploitation term that favors states with desirable geometric properties and an exploration term that favors poorly sampled states. FAST outperforms conventional simulations and other MSM-based adaptive sampling algorithms by at least an order of magnitude. Furthermore, FAST yields both the proper thermodynamics and kinetics because, in contrast to many other enhanced sampling algorithms, the Hamiltonian used during individual simulations is unperturbed. Therefore, we expect FAST to be of great utility for a wide range of applications.","bio":"","contributors":[{"type":"Author","first_name":"Gregory","last_name":"Bowman","affiliation":"Washington University School of Medicine","country":"United States of America","bio":"","order":"1","is_presenter":true}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Gregory","last_name":"Bowman","affiliation":"Washington University School of Medicine","country":"United States of America","bio":"","order":"1","is_presenter":true}]},{"id":"msa166","type":"child","title":"Applications and Advancements of the Progress-Index Guided Sampling Method in Molecular Dynamics Simulations","begin_time":"13:30","end_time":"14:00","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"Computer simulations of molecules offer unparalleled spatial and temporal resolution to the characterization of atomistic processes. However, the complexity and ruggedness of the free energy landscape often hamper the usefulness of brute-force molecular dynamics as most of the simulation time is spent in a few metastable states. To help in overcoming these limitations, we have recently developed the Progress Index Guided Sampling (PIGS) method. PIGS is a multi-replica unsupervised adaptive sampling protocol that aims to maximize phase space coverage by reseeding redundant replicas with interesting ones. Interesting replicas are detected on-the-fly by using a heuristic, which is informed by scalable data-mining algorithms that take as input a user-defined representation of the simulated system. Therefore, PIGS allows focusing the sampling enhancement on selected regions of interest without the need for reaction coordinates or external potentials. This also means that it is a straightforward task to retrieve in post-processing the thermodynamics and kinetics of the system within a Markovian approximation of the true dynamics. Here we show results from real-life simulations of biomolecules in explicit solvent performed with sampling enhancement on segments of different length. Additionally, we present algorithmic advancements, especially a fully scalable implementation of PIGS in the simulation engine GROMACS.","filename":"msa166s1.pdf","bio":"","contributors":[{"type":"Author","first_name":"Marco","last_name":"Bacci","affiliation":"University of Zurich","country":"Switzerland","bio":"","order":"1","is_presenter":true},{"type":"Author","first_name":"Cassiano","last_name":"Langini","affiliation":"University of Zurich","country":"Switzerland","bio":"","order":"2","is_presenter":false},{"type":"Author","first_name":"Andreas","last_name":"Vitalis","affiliation":"University of Zurich","country":"Switzerland","bio":"","order":"3","is_presenter":false},{"type":"Author","first_name":"Amedeo","last_name":"Caflisch","affiliation":"University of Zurich","country":"Switzerland","bio":"","order":"4","is_presenter":false}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Marco","last_name":"Bacci","affiliation":"University of Zurich","country":"Switzerland","bio":"","order":"1","is_presenter":true}]},{"id":"msa286","type":"child","title":"iMapD: Intrinsic Map Dynamics Exploration for Uncharted Effective Free Energy Surfaces","begin_time":"14:00","end_time":"14:30","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"Molecular dynamics (MD) simulations explore the configurational space of physical systems at their natural pace. Simulations extensively revisit typical configurations until rare and interesting transition events occur. Biasing the simulator away from the region already explored can, therefore, drastically accelerate the discovery of new regions, and is often the only way to gain access to all relevant states. We propose iMapD, an enhanced exploration simulation framework, where MD and machine learning adaptively bootstrap each other. Machine learning guides the search for important configurations by processing information from previous explorations. This search proceeds iteratively in an algorithmically orchestrated fashion without advance knowledge of suitable collective variables. The enhanced exploration occurs through strategically initialized short unbiased simulations, and does not rely on any unphysical force steering the dynamics of the system. Applied to a molecular sensor of lipid saturation in membranes, a dimer dissociation pathway not seen in millisecond long equilibrium simulations is discovered at the second iteration. In combination with path sampling techniques, iMapD enables us to characterize even the slowest dynamics of the system.","bio":"","contributors":[{"type":"Author","first_name":"Roberto","last_name":"Covino","affiliation":"Max Planck Institute of Biophysics","country":"Germany","bio":"","order":"1","is_presenter":true},{"type":"Author","first_name":"Hendrik","last_name":"Jung","affiliation":"Max Planck Institute of Biophysics","country":"Germany","bio":"","order":"2","is_presenter":false},{"type":"Author","first_name":"Eliodoro","last_name":"Chiavazzo","affiliation":"Politecnico di Torino","country":"Italy","bio":"","order":"3","is_presenter":false},{"type":"Author","first_name":"Gerhard","last_name":"Hummer","affiliation":"Max Planck Institute of Biophysics","country":"Germany","bio":"","order":"4","is_presenter":false},{"type":"Author","first_name":"Ioannis","last_name":"Kevrekidis","affiliation":"Johns Hopkins University","country":"United States of America","bio":"","order":"5","is_presenter":false}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Roberto","last_name":"Covino","affiliation":"Max Planck Institute of Biophysics","country":"Germany","bio":"","order":"1","is_presenter":true}]},{"id":"msa141","type":"child","title":"Exploiting Task-Based Parallelism in Bayesian Uncertainty Quantification and Stochastic Optimization","begin_time":"14:30","end_time":"15:00","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"The mapping of Uncertainty Quantification (UQ) to computing architectures is a very challenging process and at the same time an essential aspect for all fields of simulation science. In UQ the aggregate scientific knowledge is obtained by ensembles of simulation runs, created dynamically by the employed UQ algorithm and scheduled on the available compute nodes. \u03a04U is a computational framework that exploits the capabilities of massively parallel and hybrid computer architectures for large scale Bayesian uncertainty quantification, reliability analysis and stochastic optimization. At the core of the framework, a platform-agnostic task-parallel library supports nested parallelism and provides automatic load balancing on computing architectures that range from multicore systems to hybrid GPU clusters. The software is open-source and includes HPC implementations of algorithms such as Transitional Markov Chain Monte Carlo and Approximate Bayesian Computation. Experimental results using representative applications demonstrate the flexibility and excellent scalability of the proposed framework.","bio":"","contributors":[{"type":"Author","first_name":"Panagiotis","last_name":"Hadjidoukas","affiliation":"ETH Zurich","country":"Switzerland","bio":"","order":"1","is_presenter":true}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Panagiotis","last_name":"Hadjidoukas","affiliation":"ETH Zurich","country":"Switzerland","bio":"","order":"1","is_presenter":true}]}]}, "slot": {"id":"msa166","type":"child","title":"Applications and Advancements of the Progress-Index Guided Sampling Method in Molecular Dynamics Simulations","begin_time":"13:30","end_time":"14:00","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"Computer simulations of molecules offer unparalleled spatial and temporal resolution to the characterization of atomistic processes. However, the complexity and ruggedness of the free energy landscape often hamper the usefulness of brute-force molecular dynamics as most of the simulation time is spent in a few metastable states. To help in overcoming these limitations, we have recently developed the Progress Index Guided Sampling (PIGS) method. PIGS is a multi-replica unsupervised adaptive sampling protocol that aims to maximize phase space coverage by reseeding redundant replicas with interesting ones. Interesting replicas are detected on-the-fly by using a heuristic, which is informed by scalable data-mining algorithms that take as input a user-defined representation of the simulated system. Therefore, PIGS allows focusing the sampling enhancement on selected regions of interest without the need for reaction coordinates or external potentials. This also means that it is a straightforward task to retrieve in post-processing the thermodynamics and kinetics of the system within a Markovian approximation of the true dynamics. Here we show results from real-life simulations of biomolecules in explicit solvent performed with sampling enhancement on segments of different length. Additionally, we present algorithmic advancements, especially a fully scalable implementation of PIGS in the simulation engine GROMACS.","filename":"msa166s1.pdf","bio":"","contributors":[{"type":"Author","first_name":"Marco","last_name":"Bacci","affiliation":"University of Zurich","country":"Switzerland","bio":"","order":"1","is_presenter":true},{"type":"Author","first_name":"Cassiano","last_name":"Langini","affiliation":"University of Zurich","country":"Switzerland","bio":"","order":"2","is_presenter":false},{"type":"Author","first_name":"Andreas","last_name":"Vitalis","affiliation":"University of Zurich","country":"Switzerland","bio":"","order":"3","is_presenter":false},{"type":"Author","first_name":"Amedeo","last_name":"Caflisch","affiliation":"University of Zurich","country":"Switzerland","bio":"","order":"4","is_presenter":false}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Marco","last_name":"Bacci","affiliation":"University of Zurich","country":"Switzerland","bio":"","order":"1","is_presenter":true}]}, "slotContributors": [{"type":"Author","first_name":"Marco","last_name":"Bacci","affiliation":"University of Zurich","country":"Switzerland","bio":"","order":"1","is_presenter":true},{"type":"Author","first_name":"Cassiano","last_name":"Langini","affiliation":"University of Zurich","country":"Switzerland","bio":"","order":"2","is_presenter":false},{"type":"Author","first_name":"Andreas","last_name":"Vitalis","affiliation":"University of Zurich","country":"Switzerland","bio":"","order":"3","is_presenter":false},{"type":"Author","first_name":"Amedeo","last_name":"Caflisch","affiliation":"University of Zurich","country":"Switzerland","bio":"","order":"4","is_presenter":false}] } Presentation
MS02 - Capability Computing, Performance Portability, and Co-Design in the PASC Projects
Sydney Room
Organizer(s):
Joost VandeVondele (ETH Zurich / CSCS, Switzerland)
Track(s):
Life Sciences, Computer Science and Applied Mathematics, Climate and Weather, Chemistry and Materials, Solid Earth Dynamics
Selected PASC projects will present the scientific challenge they aim to solve by using high-end supercomputers, and in particular the computational approach adopted. The topics include astrophysics (smooth particle hydrodynamics), numerical weather and climate (stencils and grids), linear algebra for electronic structure (sparse matrix and tensor operations), and biomedical applications (fluid-structure interaction, machine learning).
The focus will be on aspects concerning (i) capability computing: how to scale to several hundreds/thousands of compute nodes, including the use of communication optimal algorithms and asynchronous communication; (ii) performance portability: how to address the growing diversity in hardware on a compute node, including generic software design, and auto-tuning; and (iii) co-design in these projects: how to engage with vendors to optimally exploit current hardware, and to provide feedback that has or will influence next-generation hardware.
Topics include: side by side comparisons of multi-core, many core, and GPU compute nodes; optimization techniques for flops, memory bandwidth, or network performance; JIT compilation of machine specific kernels; programming approaches such as the use of domain specific languages (DSLs), remote memory access (RMA), task based programming.
The focus will be on aspects concerning (i) capability computing: how to scale to several hundreds/thousands of compute nodes, including the use of communication optimal algorithms and asynchronous communication; (ii) performance portability: how to address the growing diversity in hardware on a compute node, including generic software design, and auto-tuning; and (iii) co-design in these projects: how to engage with vendors to optimally exploit current hardware, and to provide feedback that has or will influence next-generation hardware.
Topics include: side by side comparisons of multi-core, many core, and GPU compute nodes; optimization techniques for flops, memory bandwidth, or network performance; JIT compilation of machine specific kernels; programming approaches such as the use of domain specific languages (DSLs), remote memory access (RMA), task based programming.
13:30 - 14:00
Portability and Scalability of the COSMO Weather and Climate Model on Heterogeneous Architectures
, Carlos E. Osuna (MeteoSwiss, Switzerland)
+ Abstract { "session": {"id":"sess159","title":"MS02 - Capability Computing, Performance Portability, and Co-Design in the PASC Projects","date":"Monday, July 2nd 2018","begin_time":"13:00","end_time":"15:00","room":"Sydney Room","contributors":[{"type":"Session Chair","first_name":"Joost","last_name":"VandeVondele","affiliation":"ETH Zurich \/ CSCS","country":"Switzerland"}],"view_type":"VIII","view_type_id":"evtt110","tracks":["Life Sciences","Computer Science and Applied Mathematics","Climate and Weather","Chemistry and Materials","Solid Earth Dynamics"],"slots":[{"id":"symp108","type":"minisymposia","title":"MS02 - Capability Computing, Performance Portability, and Co-Design in the PASC Projects","has_begin_end_time":true,"has_just_one_minute":true,"is_parent":true,"abstract":"Selected PASC projects will present the scientific challenge they aim to solve by using high-end supercomputers, and in particular the computational approach adopted. The topics include astrophysics (smooth particle hydrodynamics), numerical weather and climate (stencils and grids), linear algebra for electronic structure (sparse matrix and tensor operations), and biomedical applications (fluid-structure interaction, machine learning).\u003Cbr \/\u003E\u003Cbr \/\u003EThe focus will be on aspects concerning (i) capability computing: how to scale to several hundreds\/thousands of compute nodes, including the use of communication optimal algorithms and asynchronous communication; (ii) performance portability: how to address the growing diversity in hardware on a compute node, including generic software design, and auto-tuning; and (iii) co-design in these projects: how to engage with vendors to optimally exploit current hardware, and to provide feedback that has or will influence next-generation hardware.\u003Cbr \/\u003E\u003Cbr \/\u003ETopics include: side by side comparisons of multi-core, many core, and GPU compute nodes; optimization techniques for flops, memory bandwidth, or network performance; JIT compilation of machine specific kernels; programming approaches such as the use of domain specific languages (DSLs), remote memory access (RMA), task based programming.","bio":"","contributors":[{"type":"Organizer","first_name":"Joost","last_name":"VandeVondele","affiliation":"ETH Zurich \/ CSCS","country":"Switzerland","bio":"","order":"1","is_presenter":true}],"has_presenters":true,"presenters":[{"type":"Organizer","first_name":"Joost","last_name":"VandeVondele","affiliation":"ETH Zurich \/ CSCS","country":"Switzerland","bio":"","order":"1","is_presenter":true}]},{"id":"msa123","type":"child","title":"SPH-EXA: Optimizing Smooth Particle Hydrodynamics for Exascale Computing","begin_time":"13:00","end_time":"13:30","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"Understanding fluid and plasma behavior under complex physical conditions forms the basis of highly\u00a0important research questions. Numerical simulations of fluids in\u00a0astrophysics and computational fluid dynamics are among the most computationally demanding calculations in\u00a0terms of sustained floating point operations per second, which are expected to benefit from the upcoming Exascale high-performance computers. A well-known hydrodynamics solver is\u00a0Smooth Particle Hydrodynamics (SPH). The parallelization of codes implementing the SPH method is not trivial\u00a0due to the nature of the\u00a0physics and the algorithms involved. The SPH-EXA project targets the design of a\u00a0scalable\u00a0and\u00a0fault tolerant\u00a0SPH-EXA mini-app. The scientific insights from the optimized executions of the SPH-EXA mini-app will be\u00a0incorporated into current SPH-based production codes in the fields of astrophysics\u00a0(SPHYNX, ChaNGa), and CFD (SPH-flow), resulting in, what we call, the SPH-EXA\u00a0version of those codes. The SPH-EXA mini-app will employ advanced parallelization methods, scalable dynamic load balancing within single compute nodes and across massive numbers of nodes, and fault-tolerance mechanisms to sustain its scalable execution. An essential outcome of this project is a repository of experiments to enable verification, reproducibility, and portability of the execution and simulation results to other SPH-EXA codes.","bio":"","contributors":[{"type":"Author","first_name":"Florina","last_name":"Ciorba","affiliation":"University of Basel","country":"Switzerland","bio":"","order":"1","is_presenter":true},{"type":"Author","first_name":"Rub\u00e9n","last_name":"Cabezon","affiliation":"University of Basel","country":"Switzerland","bio":"","order":"2","is_presenter":true},{"type":"Author","first_name":"Lucio","last_name":"Mayer","affiliation":"University of Zurich","country":"Switzerland","bio":"","order":"3","is_presenter":false},{"type":"Author","first_name":"David","last_name":"Imbert","affiliation":"NEXTFLOW Software","country":"France","bio":"","order":"4","is_presenter":false}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Florina","last_name":"Ciorba","affiliation":"University of Basel","country":"Switzerland","bio":"","order":"1","is_presenter":true},{"type":"Author","first_name":"Rub\u00e9n","last_name":"Cabezon","affiliation":"University of Basel","country":"Switzerland","bio":"","order":"2","is_presenter":true}]},{"id":"msa287","type":"child","title":"Portability and Scalability of the COSMO Weather and Climate Model on Heterogeneous Architectures","begin_time":"13:30","end_time":"14:00","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"The clear evidence of the importance of high horizontal resolutions in the quality and accuracy of weather and climate simulations is demanding unprecedented computational capacity. Previous developments from PASC projects resulted in a GPU capable version of the COSMO model that provides significant speedup in the time to solution on NVIDIA GPUs, which allowed the first operational GPU enabled weather forecast system at MeteoSwiss as well as European-scale decadal climate simulations at unprecedented resolutions of 2 km. In order to improve the performance portability of COSMO in heterogeneous systems, recent efforts are supporting and optimizing COSMO for Xeon Phi KNL architecture and further improving the performance on accelerators exploiting advanced optimizations like task parallelism on the model that improve the performance on strong scalability regimes on massively parallel accelerators. Additionally, recent developments of a toolchain allow to combine all these advanced optimizations with a performance model specific to the domain and configuration of the model. We present results and performance comparisons for the COSMO 1km resolution configuration on Xeon Phi KNL and NVIDIA P100 systems.","filename":"msa287s1.pdf","bio":"","contributors":[{"type":"Author","first_name":"Carlos E.","last_name":"Osuna","affiliation":"MeteoSwiss","country":"Switzerland","bio":"","order":"1","is_presenter":true},{"type":"Author","first_name":"Stefan","last_name":"Moosbrugger","affiliation":"MeteoSwiss","country":"Switzerland","bio":"","order":"2","is_presenter":false},{"type":"Author","first_name":"Felix","last_name":"Thaler","affiliation":"ETH Zurich \/ CSCS","country":"Switzerland","bio":"","order":"3","is_presenter":false},{"type":"Author","first_name":"Oliver","last_name":"Fuhrer","affiliation":"MeteoSwiss","country":"Switzerland","bio":"","order":"4","is_presenter":false},{"type":"Author","first_name":"Torsten","last_name":"Hoefler","affiliation":"ETH Zurich","country":"Switzerland","bio":"","order":"5","is_presenter":false}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Carlos E.","last_name":"Osuna","affiliation":"MeteoSwiss","country":"Switzerland","bio":"","order":"1","is_presenter":true}]},{"id":"msa155","type":"child","title":"Implementing a Sparse Tensor Linear Algebra Library for Electronic Structure Calculations","begin_time":"14:00","end_time":"14:30","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"Sparse matrix-matrix multiplication is an essential building block for a wide range of algorithms in various scientific fields. For this task, the sparse matrix library DBCSR (Distributed Block Compressed Sparse Row) has been developed. Its multi-layered structure automatically takes care of and optimizes several computational aspects like parallelism (MPI, OpenMP, CUDA), data (cache) locality and on-the-fly filtering. As part of the PASC project, we are extending the library to include tensor algebra, based on the realization that most tensor operations can be mapped on matrix multiplications. First, we introduce the library, describing the repository on GitHub, how to compile it, the test methods, the tutorial, and the API in Fortran and C\/C++. Then we give details on the implemented solutions to tackle scalability on large node-counts, based on a communication optimal algorithm with dynamically distributed load-balancing, implemented with remote memory access MPI communications. At the node level, we present a novel approach for the generation of optimal kernels based on autotuning and JIT compilation. Finally, we report the performance results, in terms of time-to-solution and energy-to-solution, of DBCSR on systems with Intel Xeon CPUs, Intel Xeon Phi Knights Landing (KNL) processors, and systems with NVIDIA GPUs.","bio":"","contributors":[{"type":"Author","first_name":"Juerg","last_name":"Hutter","affiliation":"University of Zurich","country":"Switzerland","bio":"","order":"1","is_presenter":true},{"type":"Author","first_name":"Alfio","last_name":"Lazzaro","affiliation":"University of Zurich","country":"Switzerland","bio":"","order":"2","is_presenter":false},{"type":"Author","first_name":"Ilia","last_name":"Sivkov","affiliation":"University of Zurich","country":"Switzerland","bio":"","order":"3","is_presenter":false},{"type":"Author","first_name":"Patrick","last_name":"Seewald","affiliation":"University of Zurich","country":"Switzerland","bio":"","order":"4","is_presenter":false}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Juerg","last_name":"Hutter","affiliation":"University of Zurich","country":"Switzerland","bio":"","order":"1","is_presenter":true}]},{"id":"msa214","type":"child","title":"AV-FLOW: A High-Performance Library for Fluid-Structure Interaction with Complex Materials and Transitional Flow","begin_time":"14:30","end_time":"15:00","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"The flow systems of the heart and the great blood vessels comprise complex materials (soft tissue) and flows at moderately high Reynolds numbers which may undergo transition from laminar to turbulent flow. Computational modelling of such fluid-structure interaction (FSI) problems requires efficient high-fidelity solvers for structure and flow as well as a robust scheme for coupling the two phases. We present a new FSI framework based on the immersed boundary method which has been developed for modelling cardiovascular flow systems. This high-performance library is optimized for parallel execution on the Cray XC40\/50 system at CSCS. The structural and flow solvers use geometric domain decomposition for parallelization on multi-core multi-node platforms. The coupling between the structure and flow uses a parallel transfer library to minimize communication between the different computing cores. Compute intensive kernels were written in CUDA to make use of the GPGPUs on the nodes of the Cray XC40\/50. We show performance benchmarks and different FSI test cases including a benchmark for solid inertia and a problem with transitional flow past an obstacle made of a complex material with fibers.","bio":"","contributors":[{"type":"Author","first_name":"Dominik","last_name":"Obrist","affiliation":"University of Bern","country":"Switzerland","bio":"","order":"1","is_presenter":true}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Dominik","last_name":"Obrist","affiliation":"University of Bern","country":"Switzerland","bio":"","order":"1","is_presenter":true}]}]}, "slot": {"id":"msa287","type":"child","title":"Portability and Scalability of the COSMO Weather and Climate Model on Heterogeneous Architectures","begin_time":"13:30","end_time":"14:00","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"The clear evidence of the importance of high horizontal resolutions in the quality and accuracy of weather and climate simulations is demanding unprecedented computational capacity. Previous developments from PASC projects resulted in a GPU capable version of the COSMO model that provides significant speedup in the time to solution on NVIDIA GPUs, which allowed the first operational GPU enabled weather forecast system at MeteoSwiss as well as European-scale decadal climate simulations at unprecedented resolutions of 2 km. In order to improve the performance portability of COSMO in heterogeneous systems, recent efforts are supporting and optimizing COSMO for Xeon Phi KNL architecture and further improving the performance on accelerators exploiting advanced optimizations like task parallelism on the model that improve the performance on strong scalability regimes on massively parallel accelerators. Additionally, recent developments of a toolchain allow to combine all these advanced optimizations with a performance model specific to the domain and configuration of the model. We present results and performance comparisons for the COSMO 1km resolution configuration on Xeon Phi KNL and NVIDIA P100 systems.","filename":"msa287s1.pdf","bio":"","contributors":[{"type":"Author","first_name":"Carlos E.","last_name":"Osuna","affiliation":"MeteoSwiss","country":"Switzerland","bio":"","order":"1","is_presenter":true},{"type":"Author","first_name":"Stefan","last_name":"Moosbrugger","affiliation":"MeteoSwiss","country":"Switzerland","bio":"","order":"2","is_presenter":false},{"type":"Author","first_name":"Felix","last_name":"Thaler","affiliation":"ETH Zurich \/ CSCS","country":"Switzerland","bio":"","order":"3","is_presenter":false},{"type":"Author","first_name":"Oliver","last_name":"Fuhrer","affiliation":"MeteoSwiss","country":"Switzerland","bio":"","order":"4","is_presenter":false},{"type":"Author","first_name":"Torsten","last_name":"Hoefler","affiliation":"ETH Zurich","country":"Switzerland","bio":"","order":"5","is_presenter":false}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Carlos E.","last_name":"Osuna","affiliation":"MeteoSwiss","country":"Switzerland","bio":"","order":"1","is_presenter":true}]}, "slotContributors": [{"type":"Author","first_name":"Carlos E.","last_name":"Osuna","affiliation":"MeteoSwiss","country":"Switzerland","bio":"","order":"1","is_presenter":true},{"type":"Author","first_name":"Stefan","last_name":"Moosbrugger","affiliation":"MeteoSwiss","country":"Switzerland","bio":"","order":"2","is_presenter":false},{"type":"Author","first_name":"Felix","last_name":"Thaler","affiliation":"ETH Zurich \/ CSCS","country":"Switzerland","bio":"","order":"3","is_presenter":false},{"type":"Author","first_name":"Oliver","last_name":"Fuhrer","affiliation":"MeteoSwiss","country":"Switzerland","bio":"","order":"4","is_presenter":false},{"type":"Author","first_name":"Torsten","last_name":"Hoefler","affiliation":"ETH Zurich","country":"Switzerland","bio":"","order":"5","is_presenter":false}] } Presentation
Organizer(s):
Felix Kubler (University of Zurich, Switzerland)
Track(s):
Emerging Application Domains
Discrete-time, infinite-horizon, general equilibrium models are routinely used in macroeconomics and in public finance for exploring the quantitative features of model economies and for counterfactual policy analysis. One important question concerns the importance of household heterogeneity for the amplification and propagation of macroeconomic shocks.
In this session we bring together leading young researchers in the field to present alternative approaches to the computation of equilibria in dynamic stochastic models with heterogeneous agents and/or with financial frictions. Three of the papers directly propose new methods for the solution of models with a continuum of ex post heterogeneous agents.
In this session we bring together leading young researchers in the field to present alternative approaches to the computation of equilibria in dynamic stochastic models with heterogeneous agents and/or with financial frictions. Three of the papers directly propose new methods for the solution of models with a continuum of ex post heterogeneous agents.
13:30 - 14:00
Solving Heterogeneous Agent Models with Nonconvex Optimization Problems: Linearization and Beyond
, Michael Reiter (Institute for Advanced Studies, Austria)
+ Abstract { "session": {"id":"sess161","title":"MS03 - Computational Aspects of Heterogeneous Agents Macro","date":"Monday, July 2nd 2018","begin_time":"13:00","end_time":"15:00","room":"Nairobi Room","contributors":[{"type":"Session Chair","first_name":"Felix","last_name":"Kubler","affiliation":"University of Zurich","country":"Switzerland"}],"view_type":"VIII","view_type_id":"evtt110","tracks":["Emerging Application Domains"],"slots":[{"id":"symp125","type":"minisymposia","title":"MS03 - Computational Aspects of Heterogeneous Agents Macro","has_begin_end_time":true,"has_just_one_minute":true,"is_parent":true,"abstract":"Discrete-time, infinite-horizon, general equilibrium models are routinely used in macroeconomics and in public finance for exploring the quantitative features of model economies and for counterfactual policy analysis.\u00a0One important question concerns the importance of household heterogeneity for the amplification and propagation of macroeconomic shocks.\u003Cbr \/\u003E\u003Cbr \/\u003EIn this session we bring together leading young researchers in the field to present alternative approaches to the computation of equilibria in dynamic stochastic models with heterogeneous agents and\/or with financial frictions.\u00a0Three of the papers directly propose new methods for the solution of models with a continuum of ex post heterogeneous agents.","bio":"","contributors":[{"type":"Organizer","first_name":"Felix","last_name":"Kubler","affiliation":"University of Zurich","country":"Switzerland","bio":"","order":"1","is_presenter":true}],"has_presenters":true,"presenters":[{"type":"Organizer","first_name":"Felix","last_name":"Kubler","affiliation":"University of Zurich","country":"Switzerland","bio":"","order":"1","is_presenter":true}]},{"id":"msa124","type":"child","title":"Exploiting MIT Shocks in Heterogeneous-Agent Economies: The Impulse Response as a Numerical Derivative","begin_time":"13:00","end_time":"13:30","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"We propose a new method for computing equilibria in heterogeneous-agent models with aggregate uncertainty. The idea relies on an assumption that linearization offers a good approximation; we share this assumption with existing linearization methods. However, unlike those methods, the approach here does not rely on direct derivation of first-order Taylor terms. It also does not use recursive methods, whereby aggregates and prices would be expressed as linear functions of the state, usually a very high-dimensional object (such as the wealth distribution). Rather, we rely merely on solving nonlinearly for a deterministic transition path: we study the equilibrium response to a single, small \u0022MIT shock\u0027\u0027 carefully. We then regard this impulse response path as a numerical derivative in sequence space and hence provide our linearized solution directly using this path. The method can easily be extended to the case of many shocks and computation time rises linearly in the number of shocks. We also propose a set of checks on whether linearization is a good approximation. We assert that our method is the simplest, most transparent linearization technique among currently known methods. The key numerical tool required to implement it is value-function iteration, using a very limited set of state variables.","bio":"","contributors":[{"type":"Author","first_name":"Timo","last_name":"Boppart","affiliation":"Stockholm University","country":"Sweden","bio":"","order":"1","is_presenter":false},{"type":"Author","first_name":"Per","last_name":"Krusell","affiliation":"Stockholm University","country":"Sweden","bio":"","order":"2","is_presenter":false},{"type":"Author","first_name":"Kurt","last_name":"Mitman","affiliation":"Stockholm University","country":"Sweden","bio":"","order":"3","is_presenter":true}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Kurt","last_name":"Mitman","affiliation":"Stockholm University","country":"Sweden","bio":"","order":"3","is_presenter":true}]},{"id":"msa151","type":"child","title":"Solving Heterogeneous Agent Models with Nonconvex Optimization Problems: Linearization and Beyond","begin_time":"13:30","end_time":"14:00","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"In this talk I present a methodology for the solution of dynamic stochastic general equilibrium models with heterogeneous agents, with an emphasis on models with nonconvex decision problems. First I present an implementation of a linearization method that makes the solution of large models feasible, using dimension reduction methods both on the states and on the equilibrium variables of the model. The linearized solution serves as a starting point to compute global approximation solutions, by providing a guess of the value functions and a suitable collocation grid. The method is applied to a two-asset model where households hold a financial asset and face a discrete housing choice.","filename":"msa151s1.pdf","bio":"","contributors":[{"type":"Author","first_name":"Michael","last_name":"Reiter","affiliation":"Institute for Advanced Studies","country":"Austria","bio":"","order":"1","is_presenter":true}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Michael","last_name":"Reiter","affiliation":"Institute for Advanced Studies","country":"Austria","bio":"","order":"1","is_presenter":true}]},{"id":"msa109","type":"child","title":"Comparative Valuation Dynamics in Models with Financing Restrictions","begin_time":"14:00","end_time":"14:30","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"This contribution develops a theoretical framework to nest many recent dynamic stochastic general equilibrium economies with financial frictions into one common generic model. Our goal is to study the macroeconomic and asset pricing properties of this class of models, identify common features and highlight areas where these models depart from each other. In order to characterize the asset pricing implications of this family of models, we study their term structure of risk prices and risk exposures, the natural extension of impulse response functions in economic environments exhibiting non-linear dynamics. Given our continuous time setup with a Brownian information structure, our study requires us to solve systems of non-linear partial differential equations of up to 4 state variables; the occasionally binding nature of our financial frictions give rise to a free boundary problem in the 4-dimensional state space. We use finite difference schemes coded in C++ and an iterative procedure to compute the equilibrium dynamics, the stationary distribution, the shock exposure and cost elasticities, and rho-mixing coefficients of our model.","filename":"msa109s1.pdf","bio":"","contributors":[{"type":"Author","first_name":"Fabrice","last_name":"Tourre","affiliation":"Northwestern University","country":"United States of America","bio":"","order":"1","is_presenter":false},{"type":"Author","first_name":"Paymon","last_name":"Khorrami","affiliation":"University of Chicago","country":"United States of America","bio":"","order":"2","is_presenter":true}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Paymon","last_name":"Khorrami","affiliation":"University of Chicago","country":"United States of America","bio":"","order":"2","is_presenter":true}]},{"id":"msa143","type":"child","title":"Self-Justified Equilibria: Existence and Computation","begin_time":"14:30","end_time":"15:00","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"In this talk we introduce \u0022self-justified\u0022 equilibrium as a solution concept in stochastic general equilibrium models with a large number of heterogeneous agents. In each period agents trade in assets to maximize the sum of current utility and forecasted future utility. Current prices ensure that markets clear and agents forecast the probability distribution of future prices and consumption on the basis of current endogenous variables and the current exogenous shock. The forecasts are self-justified in the sense that agents use forecasting functions that are optimal within a given class of functions and that can be viewed as optimally trading off the accuracy of the forecast and its complexity. We show that self-justified equilibria always exist and we develop a computational method to approximate them numerically. By restricting the complexity of agents\u0027 forecasts we can solve models with a very large number of heterogeneous agents. Errors can be directly interpreted.","bio":"","contributors":[{"type":"Author","first_name":"Felix","last_name":"Kubler","affiliation":"University of Zurich","country":"Switzerland","bio":"","order":"1","is_presenter":true},{"type":"Author","first_name":"Simon","last_name":"Scheidegger","affiliation":"University of Zurich","country":"Switzerland","bio":"","order":"2","is_presenter":false}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Felix","last_name":"Kubler","affiliation":"University of Zurich","country":"Switzerland","bio":"","order":"1","is_presenter":true}]}]}, "slot": {"id":"msa151","type":"child","title":"Solving Heterogeneous Agent Models with Nonconvex Optimization Problems: Linearization and Beyond","begin_time":"13:30","end_time":"14:00","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"In this talk I present a methodology for the solution of dynamic stochastic general equilibrium models with heterogeneous agents, with an emphasis on models with nonconvex decision problems. First I present an implementation of a linearization method that makes the solution of large models feasible, using dimension reduction methods both on the states and on the equilibrium variables of the model. The linearized solution serves as a starting point to compute global approximation solutions, by providing a guess of the value functions and a suitable collocation grid. The method is applied to a two-asset model where households hold a financial asset and face a discrete housing choice.","filename":"msa151s1.pdf","bio":"","contributors":[{"type":"Author","first_name":"Michael","last_name":"Reiter","affiliation":"Institute for Advanced Studies","country":"Austria","bio":"","order":"1","is_presenter":true}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Michael","last_name":"Reiter","affiliation":"Institute for Advanced Studies","country":"Austria","bio":"","order":"1","is_presenter":true}]}, "slotContributors": [{"type":"Author","first_name":"Michael","last_name":"Reiter","affiliation":"Institute for Advanced Studies","country":"Austria","bio":"","order":"1","is_presenter":true}] } Presentation
14:00 - 14:30
Comparative Valuation Dynamics in Models with Financing Restrictions
, Paymon Khorrami (University of Chicago, United States of America)
+ Abstract { "session": {"id":"sess161","title":"MS03 - Computational Aspects of Heterogeneous Agents Macro","date":"Monday, July 2nd 2018","begin_time":"13:00","end_time":"15:00","room":"Nairobi Room","contributors":[{"type":"Session Chair","first_name":"Felix","last_name":"Kubler","affiliation":"University of Zurich","country":"Switzerland"}],"view_type":"VIII","view_type_id":"evtt110","tracks":["Emerging Application Domains"],"slots":[{"id":"symp125","type":"minisymposia","title":"MS03 - Computational Aspects of Heterogeneous Agents Macro","has_begin_end_time":true,"has_just_one_minute":true,"is_parent":true,"abstract":"Discrete-time, infinite-horizon, general equilibrium models are routinely used in macroeconomics and in public finance for exploring the quantitative features of model economies and for counterfactual policy analysis.\u00a0One important question concerns the importance of household heterogeneity for the amplification and propagation of macroeconomic shocks.\u003Cbr \/\u003E\u003Cbr \/\u003EIn this session we bring together leading young researchers in the field to present alternative approaches to the computation of equilibria in dynamic stochastic models with heterogeneous agents and\/or with financial frictions.\u00a0Three of the papers directly propose new methods for the solution of models with a continuum of ex post heterogeneous agents.","bio":"","contributors":[{"type":"Organizer","first_name":"Felix","last_name":"Kubler","affiliation":"University of Zurich","country":"Switzerland","bio":"","order":"1","is_presenter":true}],"has_presenters":true,"presenters":[{"type":"Organizer","first_name":"Felix","last_name":"Kubler","affiliation":"University of Zurich","country":"Switzerland","bio":"","order":"1","is_presenter":true}]},{"id":"msa124","type":"child","title":"Exploiting MIT Shocks in Heterogeneous-Agent Economies: The Impulse Response as a Numerical Derivative","begin_time":"13:00","end_time":"13:30","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"We propose a new method for computing equilibria in heterogeneous-agent models with aggregate uncertainty. The idea relies on an assumption that linearization offers a good approximation; we share this assumption with existing linearization methods. However, unlike those methods, the approach here does not rely on direct derivation of first-order Taylor terms. It also does not use recursive methods, whereby aggregates and prices would be expressed as linear functions of the state, usually a very high-dimensional object (such as the wealth distribution). Rather, we rely merely on solving nonlinearly for a deterministic transition path: we study the equilibrium response to a single, small \u0022MIT shock\u0027\u0027 carefully. We then regard this impulse response path as a numerical derivative in sequence space and hence provide our linearized solution directly using this path. The method can easily be extended to the case of many shocks and computation time rises linearly in the number of shocks. We also propose a set of checks on whether linearization is a good approximation. We assert that our method is the simplest, most transparent linearization technique among currently known methods. The key numerical tool required to implement it is value-function iteration, using a very limited set of state variables.","bio":"","contributors":[{"type":"Author","first_name":"Timo","last_name":"Boppart","affiliation":"Stockholm University","country":"Sweden","bio":"","order":"1","is_presenter":false},{"type":"Author","first_name":"Per","last_name":"Krusell","affiliation":"Stockholm University","country":"Sweden","bio":"","order":"2","is_presenter":false},{"type":"Author","first_name":"Kurt","last_name":"Mitman","affiliation":"Stockholm University","country":"Sweden","bio":"","order":"3","is_presenter":true}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Kurt","last_name":"Mitman","affiliation":"Stockholm University","country":"Sweden","bio":"","order":"3","is_presenter":true}]},{"id":"msa151","type":"child","title":"Solving Heterogeneous Agent Models with Nonconvex Optimization Problems: Linearization and Beyond","begin_time":"13:30","end_time":"14:00","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"In this talk I present a methodology for the solution of dynamic stochastic general equilibrium models with heterogeneous agents, with an emphasis on models with nonconvex decision problems. First I present an implementation of a linearization method that makes the solution of large models feasible, using dimension reduction methods both on the states and on the equilibrium variables of the model. The linearized solution serves as a starting point to compute global approximation solutions, by providing a guess of the value functions and a suitable collocation grid. The method is applied to a two-asset model where households hold a financial asset and face a discrete housing choice.","filename":"msa151s1.pdf","bio":"","contributors":[{"type":"Author","first_name":"Michael","last_name":"Reiter","affiliation":"Institute for Advanced Studies","country":"Austria","bio":"","order":"1","is_presenter":true}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Michael","last_name":"Reiter","affiliation":"Institute for Advanced Studies","country":"Austria","bio":"","order":"1","is_presenter":true}]},{"id":"msa109","type":"child","title":"Comparative Valuation Dynamics in Models with Financing Restrictions","begin_time":"14:00","end_time":"14:30","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"This contribution develops a theoretical framework to nest many recent dynamic stochastic general equilibrium economies with financial frictions into one common generic model. Our goal is to study the macroeconomic and asset pricing properties of this class of models, identify common features and highlight areas where these models depart from each other. In order to characterize the asset pricing implications of this family of models, we study their term structure of risk prices and risk exposures, the natural extension of impulse response functions in economic environments exhibiting non-linear dynamics. Given our continuous time setup with a Brownian information structure, our study requires us to solve systems of non-linear partial differential equations of up to 4 state variables; the occasionally binding nature of our financial frictions give rise to a free boundary problem in the 4-dimensional state space. We use finite difference schemes coded in C++ and an iterative procedure to compute the equilibrium dynamics, the stationary distribution, the shock exposure and cost elasticities, and rho-mixing coefficients of our model.","filename":"msa109s1.pdf","bio":"","contributors":[{"type":"Author","first_name":"Fabrice","last_name":"Tourre","affiliation":"Northwestern University","country":"United States of America","bio":"","order":"1","is_presenter":false},{"type":"Author","first_name":"Paymon","last_name":"Khorrami","affiliation":"University of Chicago","country":"United States of America","bio":"","order":"2","is_presenter":true}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Paymon","last_name":"Khorrami","affiliation":"University of Chicago","country":"United States of America","bio":"","order":"2","is_presenter":true}]},{"id":"msa143","type":"child","title":"Self-Justified Equilibria: Existence and Computation","begin_time":"14:30","end_time":"15:00","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"In this talk we introduce \u0022self-justified\u0022 equilibrium as a solution concept in stochastic general equilibrium models with a large number of heterogeneous agents. In each period agents trade in assets to maximize the sum of current utility and forecasted future utility. Current prices ensure that markets clear and agents forecast the probability distribution of future prices and consumption on the basis of current endogenous variables and the current exogenous shock. The forecasts are self-justified in the sense that agents use forecasting functions that are optimal within a given class of functions and that can be viewed as optimally trading off the accuracy of the forecast and its complexity. We show that self-justified equilibria always exist and we develop a computational method to approximate them numerically. By restricting the complexity of agents\u0027 forecasts we can solve models with a very large number of heterogeneous agents. Errors can be directly interpreted.","bio":"","contributors":[{"type":"Author","first_name":"Felix","last_name":"Kubler","affiliation":"University of Zurich","country":"Switzerland","bio":"","order":"1","is_presenter":true},{"type":"Author","first_name":"Simon","last_name":"Scheidegger","affiliation":"University of Zurich","country":"Switzerland","bio":"","order":"2","is_presenter":false}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Felix","last_name":"Kubler","affiliation":"University of Zurich","country":"Switzerland","bio":"","order":"1","is_presenter":true}]}]}, "slot": {"id":"msa109","type":"child","title":"Comparative Valuation Dynamics in Models with Financing Restrictions","begin_time":"14:00","end_time":"14:30","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"This contribution develops a theoretical framework to nest many recent dynamic stochastic general equilibrium economies with financial frictions into one common generic model. Our goal is to study the macroeconomic and asset pricing properties of this class of models, identify common features and highlight areas where these models depart from each other. In order to characterize the asset pricing implications of this family of models, we study their term structure of risk prices and risk exposures, the natural extension of impulse response functions in economic environments exhibiting non-linear dynamics. Given our continuous time setup with a Brownian information structure, our study requires us to solve systems of non-linear partial differential equations of up to 4 state variables; the occasionally binding nature of our financial frictions give rise to a free boundary problem in the 4-dimensional state space. We use finite difference schemes coded in C++ and an iterative procedure to compute the equilibrium dynamics, the stationary distribution, the shock exposure and cost elasticities, and rho-mixing coefficients of our model.","filename":"msa109s1.pdf","bio":"","contributors":[{"type":"Author","first_name":"Fabrice","last_name":"Tourre","affiliation":"Northwestern University","country":"United States of America","bio":"","order":"1","is_presenter":false},{"type":"Author","first_name":"Paymon","last_name":"Khorrami","affiliation":"University of Chicago","country":"United States of America","bio":"","order":"2","is_presenter":true}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Paymon","last_name":"Khorrami","affiliation":"University of Chicago","country":"United States of America","bio":"","order":"2","is_presenter":true}]}, "slotContributors": [{"type":"Author","first_name":"Fabrice","last_name":"Tourre","affiliation":"Northwestern University","country":"United States of America","bio":"","order":"1","is_presenter":false},{"type":"Author","first_name":"Paymon","last_name":"Khorrami","affiliation":"University of Chicago","country":"United States of America","bio":"","order":"2","is_presenter":true}] } Presentation
Organizer(s):
Jean-Roch Vlimant (California Institute of Technology, United States of America)
, Sofia Vallecorsa (CERN, Switzerland)
, Wahid Bhimji (Lawrence Berkeley National Laboratory, United States of America)
Track(s):
Computer Science and Applied Mathematics, Physics
The power of artificial neural nets at executing challenging tasks learned from data is very attractive to fields of physical science and especially to High Energy Physics. In recent years, there have been a significant number of articles reporting promising results with applying deep learning to HEP challenges. By virtue of the very large number of parameters of artificial neural nets, with deep and wide architecture, trained with stochastic gradient descent, it is mandatory to process a lot of representative data in order to obtain accurate models. Training of such models requires a tremendous amount of computing, and commonly takes days, if not weeks, to converge. GP-GPU technology has enabled a lot of this computation, thanks to the high level of parallelisation of the formalism of artificial neural net, but more however can be gained in parallelised calculation of the stochastic gradient descent. Supercomputing facilities are particularly suited for distributed training of deep neural nets, thanks to their large computation power and excellent connectivity. This minisymposium will address the current state-of-the-art and present performances in training models for high energy physics, with a particular view to software availability and to foster further utilization of supercomputers.
14:00 - 14:30
Extreme Scale Deep Learning at NERSC
, Thorsten Kurth (Lawrence Berkeley National Laboratory, United States of America)
+ Abstract { "session": {"id":"sess166","title":"MS04 - Distributed Training of Deep Neural Net Models for High Energy Physics","date":"Monday, July 2nd 2018","begin_time":"13:00","end_time":"15:00","room":"Osaka Room","contributors":[{"type":"Session Chair","first_name":"Jean-Roch","last_name":"Vlimant","affiliation":"California Institute of Technology","country":""}],"view_type":"VIII","view_type_id":"evtt110","tracks":["Computer Science and Applied Mathematics","Physics"],"slots":[{"id":"symp142","type":"minisymposia","title":"MS04 - Distributed Training of Deep Neural Net Models for High Energy Physics","has_begin_end_time":true,"has_just_one_minute":true,"is_parent":true,"abstract":"The power of artificial neural nets at executing challenging tasks learned from data is very attractive to fields of physical science and especially to High Energy Physics. In recent years, there have been a significant number of articles reporting promising results with applying deep learning to HEP challenges. By virtue of the very large number of parameters of artificial neural nets, with deep and wide architecture, trained with stochastic gradient descent, it is mandatory to process a lot of representative data in order to obtain accurate models. Training of such models requires a tremendous amount of computing, and commonly takes days, if not weeks, to converge. GP-GPU technology has enabled a lot of this computation, thanks to the high level of parallelisation of the formalism of artificial neural net, but more however can be gained in parallelised calculation of the stochastic gradient descent. Supercomputing facilities are particularly suited for distributed training of deep neural nets, thanks to their large computation power and excellent connectivity. This minisymposium will address the current state-of-the-art and present performances in training models for high energy physics, with a particular view to software availability and to foster further utilization of supercomputers.","bio":"","contributors":[{"type":"Organizer","first_name":"Jean-Roch","last_name":"Vlimant","affiliation":"California Institute of Technology","country":"United States of America","bio":"","order":"1","is_presenter":true},{"type":"Organizer","first_name":"Sofia","last_name":"Vallecorsa","affiliation":"CERN","country":"Switzerland","bio":"","order":"2","is_presenter":false},{"type":"Organizer","first_name":"Wahid","last_name":"Bhimji","affiliation":"Lawrence Berkeley National Laboratory","country":"United States of America","bio":"","order":"3","is_presenter":false}],"has_presenters":true,"presenters":[{"type":"Organizer","first_name":"Jean-Roch","last_name":"Vlimant","affiliation":"California Institute of Technology","country":"United States of America","bio":"","order":"1","is_presenter":true}]},{"id":"msa193","type":"child","title":"Large Scale Training for Model Optimization","begin_time":"13:00","end_time":"13:30","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"In the recent years, several studies have demonstrated the benefit of using deep learning to solve typical tasks related to high energy physics data taking and analysis. The computational need for inference of a model once trained is rather modest and does not usually need specific treatment. The training of neural net models requires a lot of data, especially for deep models with numerous parameters. Training of such models has been made tractable with the improvement of optimization methods and the advent of GPUs well adapted to tackle the task of training neural nets. It is important to scale up the available network-training resources and to provide tools for optimal large-scale trainings. One of the avenues to further accelerate the training is via data parallelism, in which the computation of the gradients is computed on multiple subsets of the data in parallel and used collectively to update the model toward the optimum parameters. Several frameworks exist for performing distributed training, all with their strengths and limitations. In this context, our development of a new training workflow, which scales on multi-node\/multi-GPU architectures with an eye to deployment on high performance computing machines is described.","bio":"","contributors":[{"type":"Author","first_name":"Felice","last_name":"Pantaleo","affiliation":"CERN","country":"Switzerland","bio":"","order":"1","is_presenter":true},{"type":"Author","first_name":"Maurizio","last_name":"Pierini","affiliation":"CERN","country":"Switzerland","bio":"","order":"2","is_presenter":false},{"type":"Author","first_name":"Jean-Roch","last_name":"Vlimant","affiliation":"California Institute of Technology","country":"United States of America","bio":"","order":"3","is_presenter":false},{"type":"Author","first_name":"Thong","last_name":"Nguyen","affiliation":"California Institute of Technology","country":"United States of America","bio":"","order":"4","is_presenter":false}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Felice","last_name":"Pantaleo","affiliation":"CERN","country":"Switzerland","bio":"","order":"1","is_presenter":true}]},{"id":"msa191","type":"child","title":"Training Generative Adversarial Models over Distributed Computing System","begin_time":"13:30","end_time":"14:00","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"In the High Energy Physics field, simulation of the interaction of particles in detectors material is a computing intensive task, even more so with complex and fined grained detectors. The complete and most accurate simulation of particle\/matter interaction is primordial while calibrating and understanding the detector, but is seldomly required at physics analysis level, once several detector effects can hide slight imperfection in simulation. Some level of approximation is therefore acceptable and less computationally intensive approaches can be implemented. We present a fast simulation based on conditional generative adversarial networks. We use a dataset composed of the energy deposition from electron, photons, charged and neutral hadrons in a fine grained digital calorimeter. The training of these models is quite computing intensive, even with the help of GPGPU, and we propose a method to train them over multiple nodes and GPGPU using a standard message passing interface. We report on the scalings of time-to-solution. Further tuning of hyper-parameter of the models are rendered tractable and we present the physics performance of the best model obtained via a Bayesian optimization using gaussian processes. We demonstrate how a high performance computing center can be utilized to globally optimize these kinds of models.","bio":"","contributors":[{"type":"Author","first_name":"Gul Rukh","last_name":"Khattak","affiliation":"CERN","country":"Switzerland","bio":"","order":"1","is_presenter":true},{"type":"Author","first_name":"Sofia","last_name":"Vallecorsa","affiliation":"CERN","country":"Switzerland","bio":"","order":"2","is_presenter":false},{"type":"Author","first_name":"Jean-Roch","last_name":"Vlimant","affiliation":"California Institute of Technology","country":"United States of America","bio":"","order":"3","is_presenter":false},{"type":"Author","first_name":"Federico","last_name":"Carminati","affiliation":"CERN","country":"Switzerland","bio":"","order":"4","is_presenter":false}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Gul Rukh","last_name":"Khattak","affiliation":"CERN","country":"Switzerland","bio":"","order":"1","is_presenter":true}]},{"id":"msa182","type":"child","title":"Extreme Scale Deep Learning at NERSC","begin_time":"14:00","end_time":"14:30","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"We present various studies on very large scale distributed deep learning on HPC systems including the ~10k node Intel Xeon-Phi-based Cori system at NERSC. We explore CNN classification architectures and generative adversarial networks for HEP problems using large images corresponding to full LHC detectors and high-resolution cosmology convergence maps. We have explored distributed scaling in different deep-learning frameworks, including Caffe, TensorFlow and PyTorch with different communication layers, i.e. Google RPC or MPI-based approaches such as Intel MLSL, Uber Horovod and Cray\u2019s CPE ML Plugin. We describe various approaches for scaling out the training of single models up to the full Cori system. We further discuss recent work contrasting performance with different frameworks, systems and system architectures.","filename":"msa182s1.pdf","bio":"","contributors":[{"type":"Author","first_name":"Thorsten","last_name":"Kurth","affiliation":"Lawrence Berkeley National Laboratory","country":"United States of America","bio":"","order":"1","is_presenter":true},{"type":"Author","first_name":"Wahid","last_name":"Bhimji","affiliation":"Lawrence Berkeley National Laboratory","country":"United States of America","bio":"","order":"2","is_presenter":false}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Thorsten","last_name":"Kurth","affiliation":"Lawrence Berkeley National Laboratory","country":"United States of America","bio":"","order":"1","is_presenter":true}]},{"id":"msa260","type":"child","title":"Practical Scaling Techniques","begin_time":"14:30","end_time":"15:00","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"The need for large scale training of neural networks is stemming from the advent of ever growing labeled datasets in data science combined with the successes of deep learning at achieving super-human performance at pattern recognition tasks and others. Fast and powerful GP-GPU have enabled such trainings thanks to an impressive level of parallelisation of computation. There remain however large problems which may take days to weeks to converge. To this end, additional level of parallelisation across computing units are used for additional speed up. We present an overview of the practical techniques which can be used for scaling throughput of model training.","bio":"","contributors":[{"type":"Author","first_name":"Peter","last_name":"Messmer","affiliation":"NVIDIA Inc.","country":"Switzerland","bio":"","order":"1","is_presenter":false},{"type":"Author","first_name":"Fernanda","last_name":"Foertter","affiliation":"NVIDIA Inc.","country":"Switzerland","bio":"","order":"2","is_presenter":true}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Fernanda","last_name":"Foertter","affiliation":"NVIDIA Inc.","country":"Switzerland","bio":"","order":"2","is_presenter":true}]}]}, "slot": {"id":"msa182","type":"child","title":"Extreme Scale Deep Learning at NERSC","begin_time":"14:00","end_time":"14:30","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"We present various studies on very large scale distributed deep learning on HPC systems including the ~10k node Intel Xeon-Phi-based Cori system at NERSC. We explore CNN classification architectures and generative adversarial networks for HEP problems using large images corresponding to full LHC detectors and high-resolution cosmology convergence maps. We have explored distributed scaling in different deep-learning frameworks, including Caffe, TensorFlow and PyTorch with different communication layers, i.e. Google RPC or MPI-based approaches such as Intel MLSL, Uber Horovod and Cray\u2019s CPE ML Plugin. We describe various approaches for scaling out the training of single models up to the full Cori system. We further discuss recent work contrasting performance with different frameworks, systems and system architectures.","filename":"msa182s1.pdf","bio":"","contributors":[{"type":"Author","first_name":"Thorsten","last_name":"Kurth","affiliation":"Lawrence Berkeley National Laboratory","country":"United States of America","bio":"","order":"1","is_presenter":true},{"type":"Author","first_name":"Wahid","last_name":"Bhimji","affiliation":"Lawrence Berkeley National Laboratory","country":"United States of America","bio":"","order":"2","is_presenter":false}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Thorsten","last_name":"Kurth","affiliation":"Lawrence Berkeley National Laboratory","country":"United States of America","bio":"","order":"1","is_presenter":true}]}, "slotContributors": [{"type":"Author","first_name":"Thorsten","last_name":"Kurth","affiliation":"Lawrence Berkeley National Laboratory","country":"United States of America","bio":"","order":"1","is_presenter":true},{"type":"Author","first_name":"Wahid","last_name":"Bhimji","affiliation":"Lawrence Berkeley National Laboratory","country":"United States of America","bio":"","order":"2","is_presenter":false}] } Presentation
Organizer(s):
Gerhard Wellein (Friedrich-Alexander-Universität Erlangen-Nürnberg, Germany)
, Georg Hager (Friedrich-Alexander-Universität Erlangen-Nürnberg, Germany)
, Helmar Burkhart (University of Basel, Switzerland)
Track(s):
Solid Earth Dynamics, Physics, Life Sciences, Engineering, Emerging Application Domains, Computer Science and Applied Mathematics, Climate and Weather, Chemistry and Materials
Achieving hardware and energy efficiency is important for current large-scale numerical simulations and will be a key component in the exascale era. In a world of heterogeneous, highly parallel computer architectures with deep memory hierarchies, complex application scenarios, and a broad spectrum of algorithms, a thorough analysis and understanding of the complex interaction of software, data structures, algorithms, and hardware features, a.k.a. performance engineering, is required for implementing codes that allow for portable performance on the computer generations to come. The minisymposium addresses a broad range of topics in performance engineering for modern HPC architectures, ranging from recent advances in performance models and tools supporting a "white-box" performance engineering approach to application performance tuning cases studies and "black-box" solutions. The presentations will point out the potentials and limitations of performance engineering activities and demonstrate the wide spectrum of performance models used in the performance engineering, including simple performance expectations, automatic model parameter selections, and analytic models.
13:00 - 13:30
Performance Engineering - Why and How?
, Georg Hager (Friedrich-Alexander-Universität Erlangen-Nürnberg, Germany)
+ Abstract { "session": {"id":"sess174","title":"MS05 - Foundations and Applications of Performance Engineering","date":"Monday, July 2nd 2018","begin_time":"13:00","end_time":"15:00","room":"Singapore Room","contributors":[{"type":"Session Chair","first_name":"Gerhard","last_name":"Wellein","affiliation":"Friedrich-Alexander-Universit\u00e4t Erlangen-N\u00fcrnberg","country":"Germany"}],"view_type":"VIII","view_type_id":"evtt110","tracks":["Solid Earth Dynamics","Physics","Life Sciences","Engineering","Emerging Application Domains","Computer Science and Applied Mathematics","Climate and Weather","Chemistry and Materials"],"slots":[{"id":"symp156","type":"minisymposia","title":"MS05 - Foundations and Applications of Performance Engineering","has_begin_end_time":true,"has_just_one_minute":true,"is_parent":true,"abstract":"Achieving hardware and energy efficiency is important for current large-scale numerical simulations and will be a key component in the exascale era. In a world of heterogeneous, highly parallel computer architectures with deep memory hierarchies, complex application scenarios, and a broad spectrum of algorithms, a thorough analysis and understanding of the complex interaction of software, data structures, algorithms, and hardware features, a.k.a. performance engineering, is required for implementing codes that allow for portable performance on the computer generations to come. The minisymposium addresses a broad range of topics in performance engineering for modern HPC architectures, ranging from recent advances in performance models and tools supporting a \u0022white-box\u0022 performance engineering approach to application performance tuning cases studies and \u0022black-box\u0022 solutions. The presentations will point out the potentials and limitations of performance engineering activities and demonstrate the wide spectrum of performance models used in the performance engineering, including simple performance expectations, automatic model parameter selections, and analytic models.","bio":"","contributors":[{"type":"Organizer","first_name":"Gerhard","last_name":"Wellein","affiliation":"Friedrich-Alexander-Universit\u00e4t Erlangen-N\u00fcrnberg","country":"Germany","bio":"","order":"1","is_presenter":true},{"type":"Organizer","first_name":"Georg","last_name":"Hager","affiliation":"Friedrich-Alexander-Universit\u00e4t Erlangen-N\u00fcrnberg","country":"Germany","bio":"","order":"2","is_presenter":false},{"type":"Organizer","first_name":"Helmar","last_name":"Burkhart","affiliation":"University of Basel","country":"Switzerland","bio":"","order":"3","is_presenter":false}],"has_presenters":true,"presenters":[{"type":"Organizer","first_name":"Gerhard","last_name":"Wellein","affiliation":"Friedrich-Alexander-Universit\u00e4t Erlangen-N\u00fcrnberg","country":"Germany","bio":"","order":"1","is_presenter":true}]},{"id":"msa145","type":"child","title":"Performance Engineering - Why and How?","begin_time":"13:00","end_time":"13:30","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"We give an overview of Performance Engineering (PE) techniques used in scientific computing. Starting from a motivation based on resource efficiency, we demonstrate how PE can support computational science along several directions of thrust: Classification, insight, prediction, and optimization. There is a wide range of PE techniques, all of which have a modeling component of some kind. Such models come in all shapes and sizes, but we classify them on a scale from black to gray to white: Black-box models ignore all or most of the actual \u0022inner workings\u0022 of hardware-software interactions and try to classify or predict interesting metrics automatically, based mostly on measurements. White-box models try to derive useful predictions from first principles, i.e., known properties of the hardware and the software. The wide range of \u0022gray-box\u0022 models in between bridge the gap and use the best of both worlds. Examples from physics and high performance computing are given.","filename":"msa145s1.pdf","bio":"","contributors":[{"type":"Author","first_name":"Georg","last_name":"Hager","affiliation":"Friedrich-Alexander-Universit\u00e4t Erlangen-N\u00fcrnberg","country":"Germany","bio":"","order":"1","is_presenter":true}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Georg","last_name":"Hager","affiliation":"Friedrich-Alexander-Universit\u00e4t Erlangen-N\u00fcrnberg","country":"Germany","bio":"","order":"1","is_presenter":true}]},{"id":"msa224","type":"child","title":"Towards a Discipline of Performance Engineering: Lessons Learned from Stencil Kernel Benchmarks","begin_time":"13:30","end_time":"14:00","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"An accurate measure of performance is often challenging, and the measurement process is seldom well documented and accurate, raising a credibility problem concerning the collected data. In this talk, we show how performance models and tools can work together. The selected test cases are kernels belonging to the stencil pattern, that is present in several scientific applications, ranging from geophysics to astronomy, fluid dynamics, image processing, and weather forecasts. First, we comment on how to pass from a description of a stencil to pseudo code and move to a modeling phase based on the \u0022Kerncraft\u0022 tool for automatic Roofline and Execution-Cache-Memory performance modeling. After the automatic generation of compilable source code, we will focus on how to ensure the reproducibility of the performance results of its execution, using \u0022PROVA!\u0022 a distributed workflow and system management tool for reproducible research. Knowing that a specific code performs accordingly with the model(s) can drive to the identification of relevant bottlenecks and therefore to potential optimizations. The ultimate goal is to generalize our approach to modeling, predicting and benchmarking, to a general application context.","filename":"msa224s1.pdf","bio":"","contributors":[{"type":"Author","first_name":"Danilo","last_name":"Guerrera","affiliation":"University of Basel","country":"Switzerland","bio":"","order":"1","is_presenter":true}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Danilo","last_name":"Guerrera","affiliation":"University of Basel","country":"Switzerland","bio":"","order":"1","is_presenter":true}]},{"id":"msa158","type":"child","title":"Holistic Performance Engineering for Sparse Iterative Solvers","begin_time":"14:00","end_time":"14:30","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"In many applications, sparse (linear and\/or eigenvalue) solvers take up a large fraction of the overall runtime. We believe that the increasingly complex hardware of today\u0027s and future HPC systems has led to a gap in the understanding of the performance achieved by actual applications, many of which are still using a monolithic \u0027MPI only\u0027 approach despite the heterogeneous nature of the hardware. We have developed a new sparse solver library PHIST (https:\/\/bitbucket.org\/essex\/phist\/) that defines a simple \u0022kernel interface\u0022 layer inspired by MPI. Algorithms implemented in PHIST are portable in terms of software and performance as they only call building blocks of linear algebra via this interface. We have introduced simple performance models for these basic building blocks at the interface level, so that regardless of the backend providing the implementation, an overview of the optimization potential on the kernel level can be obtained, and performance pitfalls in the application (e.g. strided memory accesses) may be revealed. Available backends for PHIST include established libraries such as Trilinos\/Epetra or PETSc, as well as more recent \u0022MPI+X\u0022 approaches as implemented in Trilinos\/Tpetra or our own kernel library GHOST (https:\/\/bitbucket.org\/essex\/ghost).","filename":"msa158s1.pdf","bio":"","contributors":[{"type":"Author","first_name":"Jonas","last_name":"Thies","affiliation":"German Aerospace Center","country":"Germany","bio":"","order":"1","is_presenter":true}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Jonas","last_name":"Thies","affiliation":"German Aerospace Center","country":"Germany","bio":"","order":"1","is_presenter":true}]},{"id":"msa206","type":"child","title":"Machine Learning Framework for Performance Coverage Analysis","begin_time":"14:30","end_time":"15:00","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"Proxy applications are written to represent subsets of performance behaviors of larger, and more complex applications that often have distribution restrictions. They enable easy evaluation of these behaviors across systems, e.g., for procurement or co-design purposes. However, the intended correlation between the performance behaviors of proxy applications and their parent codes is often based solely on the developer\u0027s intuition. In this paper, we present novel machine learning techniques to methodically quantify the coverage of performance behaviors of parent codes by their proxy applications. We have developed a framework, VERITAS, to answer these questions in the context of on-node performance: (a) which hardware resources are covered by a proxy application and how well, and (b) which resources are important, but not covered. Since 2016, a more general machine learning framework has been developed around VERITAS which leverages deep learning techniques to automatically learn feature space and present information in a more intuitive fashion.","bio":"","contributors":[{"type":"Author","first_name":"Tanzima Z.","last_name":"Islam","affiliation":"Western Washington University","country":"United States of America","bio":"","order":"1","is_presenter":true},{"type":"Author","first_name":"Jayaraman J.","last_name":"Thiagarajan","affiliation":"Lawrence Livermore National Laboratory","country":"United States of America","bio":"","order":"2","is_presenter":false},{"type":"Author","first_name":"Abhinav","last_name":"Bhatele","affiliation":"Lawrence Livermore National Laboratory","country":"United States of America","bio":"","order":"3","is_presenter":false},{"type":"Author","first_name":"Martin","last_name":"Schulz","affiliation":"TU Munich","country":"Germany","bio":"","order":"4","is_presenter":false},{"type":"Author","first_name":"Todd","last_name":"Gamblin","affiliation":"Lawrence Livermore National Laboratory","country":"United States of America","bio":"","order":"5","is_presenter":false}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Tanzima Z.","last_name":"Islam","affiliation":"Western Washington University","country":"United States of America","bio":"","order":"1","is_presenter":true}]}]}, "slot": {"id":"msa145","type":"child","title":"Performance Engineering - Why and How?","begin_time":"13:00","end_time":"13:30","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"We give an overview of Performance Engineering (PE) techniques used in scientific computing. Starting from a motivation based on resource efficiency, we demonstrate how PE can support computational science along several directions of thrust: Classification, insight, prediction, and optimization. There is a wide range of PE techniques, all of which have a modeling component of some kind. Such models come in all shapes and sizes, but we classify them on a scale from black to gray to white: Black-box models ignore all or most of the actual \u0022inner workings\u0022 of hardware-software interactions and try to classify or predict interesting metrics automatically, based mostly on measurements. White-box models try to derive useful predictions from first principles, i.e., known properties of the hardware and the software. The wide range of \u0022gray-box\u0022 models in between bridge the gap and use the best of both worlds. Examples from physics and high performance computing are given.","filename":"msa145s1.pdf","bio":"","contributors":[{"type":"Author","first_name":"Georg","last_name":"Hager","affiliation":"Friedrich-Alexander-Universit\u00e4t Erlangen-N\u00fcrnberg","country":"Germany","bio":"","order":"1","is_presenter":true}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Georg","last_name":"Hager","affiliation":"Friedrich-Alexander-Universit\u00e4t Erlangen-N\u00fcrnberg","country":"Germany","bio":"","order":"1","is_presenter":true}]}, "slotContributors": [{"type":"Author","first_name":"Georg","last_name":"Hager","affiliation":"Friedrich-Alexander-Universit\u00e4t Erlangen-N\u00fcrnberg","country":"Germany","bio":"","order":"1","is_presenter":true}] } Presentation
13:30 - 14:00
Towards a Discipline of Performance Engineering: Lessons Learned from Stencil Kernel Benchmarks
, Danilo Guerrera (University of Basel, Switzerland)
+ Abstract { "session": {"id":"sess174","title":"MS05 - Foundations and Applications of Performance Engineering","date":"Monday, July 2nd 2018","begin_time":"13:00","end_time":"15:00","room":"Singapore Room","contributors":[{"type":"Session Chair","first_name":"Gerhard","last_name":"Wellein","affiliation":"Friedrich-Alexander-Universit\u00e4t Erlangen-N\u00fcrnberg","country":"Germany"}],"view_type":"VIII","view_type_id":"evtt110","tracks":["Solid Earth Dynamics","Physics","Life Sciences","Engineering","Emerging Application Domains","Computer Science and Applied Mathematics","Climate and Weather","Chemistry and Materials"],"slots":[{"id":"symp156","type":"minisymposia","title":"MS05 - Foundations and Applications of Performance Engineering","has_begin_end_time":true,"has_just_one_minute":true,"is_parent":true,"abstract":"Achieving hardware and energy efficiency is important for current large-scale numerical simulations and will be a key component in the exascale era. In a world of heterogeneous, highly parallel computer architectures with deep memory hierarchies, complex application scenarios, and a broad spectrum of algorithms, a thorough analysis and understanding of the complex interaction of software, data structures, algorithms, and hardware features, a.k.a. performance engineering, is required for implementing codes that allow for portable performance on the computer generations to come. The minisymposium addresses a broad range of topics in performance engineering for modern HPC architectures, ranging from recent advances in performance models and tools supporting a \u0022white-box\u0022 performance engineering approach to application performance tuning cases studies and \u0022black-box\u0022 solutions. The presentations will point out the potentials and limitations of performance engineering activities and demonstrate the wide spectrum of performance models used in the performance engineering, including simple performance expectations, automatic model parameter selections, and analytic models.","bio":"","contributors":[{"type":"Organizer","first_name":"Gerhard","last_name":"Wellein","affiliation":"Friedrich-Alexander-Universit\u00e4t Erlangen-N\u00fcrnberg","country":"Germany","bio":"","order":"1","is_presenter":true},{"type":"Organizer","first_name":"Georg","last_name":"Hager","affiliation":"Friedrich-Alexander-Universit\u00e4t Erlangen-N\u00fcrnberg","country":"Germany","bio":"","order":"2","is_presenter":false},{"type":"Organizer","first_name":"Helmar","last_name":"Burkhart","affiliation":"University of Basel","country":"Switzerland","bio":"","order":"3","is_presenter":false}],"has_presenters":true,"presenters":[{"type":"Organizer","first_name":"Gerhard","last_name":"Wellein","affiliation":"Friedrich-Alexander-Universit\u00e4t Erlangen-N\u00fcrnberg","country":"Germany","bio":"","order":"1","is_presenter":true}]},{"id":"msa145","type":"child","title":"Performance Engineering - Why and How?","begin_time":"13:00","end_time":"13:30","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"We give an overview of Performance Engineering (PE) techniques used in scientific computing. Starting from a motivation based on resource efficiency, we demonstrate how PE can support computational science along several directions of thrust: Classification, insight, prediction, and optimization. There is a wide range of PE techniques, all of which have a modeling component of some kind. Such models come in all shapes and sizes, but we classify them on a scale from black to gray to white: Black-box models ignore all or most of the actual \u0022inner workings\u0022 of hardware-software interactions and try to classify or predict interesting metrics automatically, based mostly on measurements. White-box models try to derive useful predictions from first principles, i.e., known properties of the hardware and the software. The wide range of \u0022gray-box\u0022 models in between bridge the gap and use the best of both worlds. Examples from physics and high performance computing are given.","filename":"msa145s1.pdf","bio":"","contributors":[{"type":"Author","first_name":"Georg","last_name":"Hager","affiliation":"Friedrich-Alexander-Universit\u00e4t Erlangen-N\u00fcrnberg","country":"Germany","bio":"","order":"1","is_presenter":true}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Georg","last_name":"Hager","affiliation":"Friedrich-Alexander-Universit\u00e4t Erlangen-N\u00fcrnberg","country":"Germany","bio":"","order":"1","is_presenter":true}]},{"id":"msa224","type":"child","title":"Towards a Discipline of Performance Engineering: Lessons Learned from Stencil Kernel Benchmarks","begin_time":"13:30","end_time":"14:00","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"An accurate measure of performance is often challenging, and the measurement process is seldom well documented and accurate, raising a credibility problem concerning the collected data. In this talk, we show how performance models and tools can work together. The selected test cases are kernels belonging to the stencil pattern, that is present in several scientific applications, ranging from geophysics to astronomy, fluid dynamics, image processing, and weather forecasts. First, we comment on how to pass from a description of a stencil to pseudo code and move to a modeling phase based on the \u0022Kerncraft\u0022 tool for automatic Roofline and Execution-Cache-Memory performance modeling. After the automatic generation of compilable source code, we will focus on how to ensure the reproducibility of the performance results of its execution, using \u0022PROVA!\u0022 a distributed workflow and system management tool for reproducible research. Knowing that a specific code performs accordingly with the model(s) can drive to the identification of relevant bottlenecks and therefore to potential optimizations. The ultimate goal is to generalize our approach to modeling, predicting and benchmarking, to a general application context.","filename":"msa224s1.pdf","bio":"","contributors":[{"type":"Author","first_name":"Danilo","last_name":"Guerrera","affiliation":"University of Basel","country":"Switzerland","bio":"","order":"1","is_presenter":true}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Danilo","last_name":"Guerrera","affiliation":"University of Basel","country":"Switzerland","bio":"","order":"1","is_presenter":true}]},{"id":"msa158","type":"child","title":"Holistic Performance Engineering for Sparse Iterative Solvers","begin_time":"14:00","end_time":"14:30","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"In many applications, sparse (linear and\/or eigenvalue) solvers take up a large fraction of the overall runtime. We believe that the increasingly complex hardware of today\u0027s and future HPC systems has led to a gap in the understanding of the performance achieved by actual applications, many of which are still using a monolithic \u0027MPI only\u0027 approach despite the heterogeneous nature of the hardware. We have developed a new sparse solver library PHIST (https:\/\/bitbucket.org\/essex\/phist\/) that defines a simple \u0022kernel interface\u0022 layer inspired by MPI. Algorithms implemented in PHIST are portable in terms of software and performance as they only call building blocks of linear algebra via this interface. We have introduced simple performance models for these basic building blocks at the interface level, so that regardless of the backend providing the implementation, an overview of the optimization potential on the kernel level can be obtained, and performance pitfalls in the application (e.g. strided memory accesses) may be revealed. Available backends for PHIST include established libraries such as Trilinos\/Epetra or PETSc, as well as more recent \u0022MPI+X\u0022 approaches as implemented in Trilinos\/Tpetra or our own kernel library GHOST (https:\/\/bitbucket.org\/essex\/ghost).","filename":"msa158s1.pdf","bio":"","contributors":[{"type":"Author","first_name":"Jonas","last_name":"Thies","affiliation":"German Aerospace Center","country":"Germany","bio":"","order":"1","is_presenter":true}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Jonas","last_name":"Thies","affiliation":"German Aerospace Center","country":"Germany","bio":"","order":"1","is_presenter":true}]},{"id":"msa206","type":"child","title":"Machine Learning Framework for Performance Coverage Analysis","begin_time":"14:30","end_time":"15:00","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"Proxy applications are written to represent subsets of performance behaviors of larger, and more complex applications that often have distribution restrictions. They enable easy evaluation of these behaviors across systems, e.g., for procurement or co-design purposes. However, the intended correlation between the performance behaviors of proxy applications and their parent codes is often based solely on the developer\u0027s intuition. In this paper, we present novel machine learning techniques to methodically quantify the coverage of performance behaviors of parent codes by their proxy applications. We have developed a framework, VERITAS, to answer these questions in the context of on-node performance: (a) which hardware resources are covered by a proxy application and how well, and (b) which resources are important, but not covered. Since 2016, a more general machine learning framework has been developed around VERITAS which leverages deep learning techniques to automatically learn feature space and present information in a more intuitive fashion.","bio":"","contributors":[{"type":"Author","first_name":"Tanzima Z.","last_name":"Islam","affiliation":"Western Washington University","country":"United States of America","bio":"","order":"1","is_presenter":true},{"type":"Author","first_name":"Jayaraman J.","last_name":"Thiagarajan","affiliation":"Lawrence Livermore National Laboratory","country":"United States of America","bio":"","order":"2","is_presenter":false},{"type":"Author","first_name":"Abhinav","last_name":"Bhatele","affiliation":"Lawrence Livermore National Laboratory","country":"United States of America","bio":"","order":"3","is_presenter":false},{"type":"Author","first_name":"Martin","last_name":"Schulz","affiliation":"TU Munich","country":"Germany","bio":"","order":"4","is_presenter":false},{"type":"Author","first_name":"Todd","last_name":"Gamblin","affiliation":"Lawrence Livermore National Laboratory","country":"United States of America","bio":"","order":"5","is_presenter":false}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Tanzima Z.","last_name":"Islam","affiliation":"Western Washington University","country":"United States of America","bio":"","order":"1","is_presenter":true}]}]}, "slot": {"id":"msa224","type":"child","title":"Towards a Discipline of Performance Engineering: Lessons Learned from Stencil Kernel Benchmarks","begin_time":"13:30","end_time":"14:00","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"An accurate measure of performance is often challenging, and the measurement process is seldom well documented and accurate, raising a credibility problem concerning the collected data. In this talk, we show how performance models and tools can work together. The selected test cases are kernels belonging to the stencil pattern, that is present in several scientific applications, ranging from geophysics to astronomy, fluid dynamics, image processing, and weather forecasts. First, we comment on how to pass from a description of a stencil to pseudo code and move to a modeling phase based on the \u0022Kerncraft\u0022 tool for automatic Roofline and Execution-Cache-Memory performance modeling. After the automatic generation of compilable source code, we will focus on how to ensure the reproducibility of the performance results of its execution, using \u0022PROVA!\u0022 a distributed workflow and system management tool for reproducible research. Knowing that a specific code performs accordingly with the model(s) can drive to the identification of relevant bottlenecks and therefore to potential optimizations. The ultimate goal is to generalize our approach to modeling, predicting and benchmarking, to a general application context.","filename":"msa224s1.pdf","bio":"","contributors":[{"type":"Author","first_name":"Danilo","last_name":"Guerrera","affiliation":"University of Basel","country":"Switzerland","bio":"","order":"1","is_presenter":true}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Danilo","last_name":"Guerrera","affiliation":"University of Basel","country":"Switzerland","bio":"","order":"1","is_presenter":true}]}, "slotContributors": [{"type":"Author","first_name":"Danilo","last_name":"Guerrera","affiliation":"University of Basel","country":"Switzerland","bio":"","order":"1","is_presenter":true}] } Presentation
14:00 - 14:30
Holistic Performance Engineering for Sparse Iterative Solvers
, Jonas Thies (German Aerospace Center, Germany)
+ Abstract { "session": {"id":"sess174","title":"MS05 - Foundations and Applications of Performance Engineering","date":"Monday, July 2nd 2018","begin_time":"13:00","end_time":"15:00","room":"Singapore Room","contributors":[{"type":"Session Chair","first_name":"Gerhard","last_name":"Wellein","affiliation":"Friedrich-Alexander-Universit\u00e4t Erlangen-N\u00fcrnberg","country":"Germany"}],"view_type":"VIII","view_type_id":"evtt110","tracks":["Solid Earth Dynamics","Physics","Life Sciences","Engineering","Emerging Application Domains","Computer Science and Applied Mathematics","Climate and Weather","Chemistry and Materials"],"slots":[{"id":"symp156","type":"minisymposia","title":"MS05 - Foundations and Applications of Performance Engineering","has_begin_end_time":true,"has_just_one_minute":true,"is_parent":true,"abstract":"Achieving hardware and energy efficiency is important for current large-scale numerical simulations and will be a key component in the exascale era. In a world of heterogeneous, highly parallel computer architectures with deep memory hierarchies, complex application scenarios, and a broad spectrum of algorithms, a thorough analysis and understanding of the complex interaction of software, data structures, algorithms, and hardware features, a.k.a. performance engineering, is required for implementing codes that allow for portable performance on the computer generations to come. The minisymposium addresses a broad range of topics in performance engineering for modern HPC architectures, ranging from recent advances in performance models and tools supporting a \u0022white-box\u0022 performance engineering approach to application performance tuning cases studies and \u0022black-box\u0022 solutions. The presentations will point out the potentials and limitations of performance engineering activities and demonstrate the wide spectrum of performance models used in the performance engineering, including simple performance expectations, automatic model parameter selections, and analytic models.","bio":"","contributors":[{"type":"Organizer","first_name":"Gerhard","last_name":"Wellein","affiliation":"Friedrich-Alexander-Universit\u00e4t Erlangen-N\u00fcrnberg","country":"Germany","bio":"","order":"1","is_presenter":true},{"type":"Organizer","first_name":"Georg","last_name":"Hager","affiliation":"Friedrich-Alexander-Universit\u00e4t Erlangen-N\u00fcrnberg","country":"Germany","bio":"","order":"2","is_presenter":false},{"type":"Organizer","first_name":"Helmar","last_name":"Burkhart","affiliation":"University of Basel","country":"Switzerland","bio":"","order":"3","is_presenter":false}],"has_presenters":true,"presenters":[{"type":"Organizer","first_name":"Gerhard","last_name":"Wellein","affiliation":"Friedrich-Alexander-Universit\u00e4t Erlangen-N\u00fcrnberg","country":"Germany","bio":"","order":"1","is_presenter":true}]},{"id":"msa145","type":"child","title":"Performance Engineering - Why and How?","begin_time":"13:00","end_time":"13:30","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"We give an overview of Performance Engineering (PE) techniques used in scientific computing. Starting from a motivation based on resource efficiency, we demonstrate how PE can support computational science along several directions of thrust: Classification, insight, prediction, and optimization. There is a wide range of PE techniques, all of which have a modeling component of some kind. Such models come in all shapes and sizes, but we classify them on a scale from black to gray to white: Black-box models ignore all or most of the actual \u0022inner workings\u0022 of hardware-software interactions and try to classify or predict interesting metrics automatically, based mostly on measurements. White-box models try to derive useful predictions from first principles, i.e., known properties of the hardware and the software. The wide range of \u0022gray-box\u0022 models in between bridge the gap and use the best of both worlds. Examples from physics and high performance computing are given.","filename":"msa145s1.pdf","bio":"","contributors":[{"type":"Author","first_name":"Georg","last_name":"Hager","affiliation":"Friedrich-Alexander-Universit\u00e4t Erlangen-N\u00fcrnberg","country":"Germany","bio":"","order":"1","is_presenter":true}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Georg","last_name":"Hager","affiliation":"Friedrich-Alexander-Universit\u00e4t Erlangen-N\u00fcrnberg","country":"Germany","bio":"","order":"1","is_presenter":true}]},{"id":"msa224","type":"child","title":"Towards a Discipline of Performance Engineering: Lessons Learned from Stencil Kernel Benchmarks","begin_time":"13:30","end_time":"14:00","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"An accurate measure of performance is often challenging, and the measurement process is seldom well documented and accurate, raising a credibility problem concerning the collected data. In this talk, we show how performance models and tools can work together. The selected test cases are kernels belonging to the stencil pattern, that is present in several scientific applications, ranging from geophysics to astronomy, fluid dynamics, image processing, and weather forecasts. First, we comment on how to pass from a description of a stencil to pseudo code and move to a modeling phase based on the \u0022Kerncraft\u0022 tool for automatic Roofline and Execution-Cache-Memory performance modeling. After the automatic generation of compilable source code, we will focus on how to ensure the reproducibility of the performance results of its execution, using \u0022PROVA!\u0022 a distributed workflow and system management tool for reproducible research. Knowing that a specific code performs accordingly with the model(s) can drive to the identification of relevant bottlenecks and therefore to potential optimizations. The ultimate goal is to generalize our approach to modeling, predicting and benchmarking, to a general application context.","filename":"msa224s1.pdf","bio":"","contributors":[{"type":"Author","first_name":"Danilo","last_name":"Guerrera","affiliation":"University of Basel","country":"Switzerland","bio":"","order":"1","is_presenter":true}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Danilo","last_name":"Guerrera","affiliation":"University of Basel","country":"Switzerland","bio":"","order":"1","is_presenter":true}]},{"id":"msa158","type":"child","title":"Holistic Performance Engineering for Sparse Iterative Solvers","begin_time":"14:00","end_time":"14:30","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"In many applications, sparse (linear and\/or eigenvalue) solvers take up a large fraction of the overall runtime. We believe that the increasingly complex hardware of today\u0027s and future HPC systems has led to a gap in the understanding of the performance achieved by actual applications, many of which are still using a monolithic \u0027MPI only\u0027 approach despite the heterogeneous nature of the hardware. We have developed a new sparse solver library PHIST (https:\/\/bitbucket.org\/essex\/phist\/) that defines a simple \u0022kernel interface\u0022 layer inspired by MPI. Algorithms implemented in PHIST are portable in terms of software and performance as they only call building blocks of linear algebra via this interface. We have introduced simple performance models for these basic building blocks at the interface level, so that regardless of the backend providing the implementation, an overview of the optimization potential on the kernel level can be obtained, and performance pitfalls in the application (e.g. strided memory accesses) may be revealed. Available backends for PHIST include established libraries such as Trilinos\/Epetra or PETSc, as well as more recent \u0022MPI+X\u0022 approaches as implemented in Trilinos\/Tpetra or our own kernel library GHOST (https:\/\/bitbucket.org\/essex\/ghost).","filename":"msa158s1.pdf","bio":"","contributors":[{"type":"Author","first_name":"Jonas","last_name":"Thies","affiliation":"German Aerospace Center","country":"Germany","bio":"","order":"1","is_presenter":true}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Jonas","last_name":"Thies","affiliation":"German Aerospace Center","country":"Germany","bio":"","order":"1","is_presenter":true}]},{"id":"msa206","type":"child","title":"Machine Learning Framework for Performance Coverage Analysis","begin_time":"14:30","end_time":"15:00","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"Proxy applications are written to represent subsets of performance behaviors of larger, and more complex applications that often have distribution restrictions. They enable easy evaluation of these behaviors across systems, e.g., for procurement or co-design purposes. However, the intended correlation between the performance behaviors of proxy applications and their parent codes is often based solely on the developer\u0027s intuition. In this paper, we present novel machine learning techniques to methodically quantify the coverage of performance behaviors of parent codes by their proxy applications. We have developed a framework, VERITAS, to answer these questions in the context of on-node performance: (a) which hardware resources are covered by a proxy application and how well, and (b) which resources are important, but not covered. Since 2016, a more general machine learning framework has been developed around VERITAS which leverages deep learning techniques to automatically learn feature space and present information in a more intuitive fashion.","bio":"","contributors":[{"type":"Author","first_name":"Tanzima Z.","last_name":"Islam","affiliation":"Western Washington University","country":"United States of America","bio":"","order":"1","is_presenter":true},{"type":"Author","first_name":"Jayaraman J.","last_name":"Thiagarajan","affiliation":"Lawrence Livermore National Laboratory","country":"United States of America","bio":"","order":"2","is_presenter":false},{"type":"Author","first_name":"Abhinav","last_name":"Bhatele","affiliation":"Lawrence Livermore National Laboratory","country":"United States of America","bio":"","order":"3","is_presenter":false},{"type":"Author","first_name":"Martin","last_name":"Schulz","affiliation":"TU Munich","country":"Germany","bio":"","order":"4","is_presenter":false},{"type":"Author","first_name":"Todd","last_name":"Gamblin","affiliation":"Lawrence Livermore National Laboratory","country":"United States of America","bio":"","order":"5","is_presenter":false}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Tanzima Z.","last_name":"Islam","affiliation":"Western Washington University","country":"United States of America","bio":"","order":"1","is_presenter":true}]}]}, "slot": {"id":"msa158","type":"child","title":"Holistic Performance Engineering for Sparse Iterative Solvers","begin_time":"14:00","end_time":"14:30","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"In many applications, sparse (linear and\/or eigenvalue) solvers take up a large fraction of the overall runtime. We believe that the increasingly complex hardware of today\u0027s and future HPC systems has led to a gap in the understanding of the performance achieved by actual applications, many of which are still using a monolithic \u0027MPI only\u0027 approach despite the heterogeneous nature of the hardware. We have developed a new sparse solver library PHIST (https:\/\/bitbucket.org\/essex\/phist\/) that defines a simple \u0022kernel interface\u0022 layer inspired by MPI. Algorithms implemented in PHIST are portable in terms of software and performance as they only call building blocks of linear algebra via this interface. We have introduced simple performance models for these basic building blocks at the interface level, so that regardless of the backend providing the implementation, an overview of the optimization potential on the kernel level can be obtained, and performance pitfalls in the application (e.g. strided memory accesses) may be revealed. Available backends for PHIST include established libraries such as Trilinos\/Epetra or PETSc, as well as more recent \u0022MPI+X\u0022 approaches as implemented in Trilinos\/Tpetra or our own kernel library GHOST (https:\/\/bitbucket.org\/essex\/ghost).","filename":"msa158s1.pdf","bio":"","contributors":[{"type":"Author","first_name":"Jonas","last_name":"Thies","affiliation":"German Aerospace Center","country":"Germany","bio":"","order":"1","is_presenter":true}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Jonas","last_name":"Thies","affiliation":"German Aerospace Center","country":"Germany","bio":"","order":"1","is_presenter":true}]}, "slotContributors": [{"type":"Author","first_name":"Jonas","last_name":"Thies","affiliation":"German Aerospace Center","country":"Germany","bio":"","order":"1","is_presenter":true}] } Presentation
Organizer(s):
Stefan Goedecker (University of Basel, Switzerland)
, Andre Schleife (University of Illinois at Urbana-Champaign, United States of America)
, Matthieu Verstraete (Universite de Liege, Belgium)
Track(s):
Engineering, Chemistry and Materials, Physics
Modern electronic-structure methods provide parameter-free simulations of properties across the whole spectrum of Physics, Chemistry, and Materials Science. They have become a bedrock of advanced analytical methods and are used systematically to interpret advanced experiments and complex interactions, with ever growing perspectives for more "realistic" systems, including defects, thermal and external fields, and transient phenomena. This minisymposium explores the next generation of electronic-structure software, which will lead users to exascale supercomputers, through highly efficient and highly parallel algorithms. We will showcase recent advances in ground- and excited-state calculations, spectroscopic quantities and transport, and adaptive methods which exploit different algorithms for different systems such as periodic/localized or many/few electrons per atom.
13:00 - 13:30
First-Principles Electron Transport with Phonon Coupling: Large Scale at Low Cost
, Tue Gunst (Technical University of Denmark, Denmark)
+ Abstract { "session": {"id":"sess184","title":"MS06 - Large Scale Electronic-Structure Calculations on Modern and Future High-Performance Supercomputers","date":"Monday, July 2nd 2018","begin_time":"13:00","end_time":"15:00","room":"Boston 3 Room","contributors":[{"type":"Session Chair","first_name":"Stefan","last_name":"Goedecker","affiliation":"University of Basel","country":"Switzerland"}],"view_type":"VIII","view_type_id":"evtt110","tracks":["Engineering","Chemistry and Materials","Physics"],"slots":[{"id":"symp137","type":"minisymposia","title":"MS06 - Large Scale Electronic-Structure Calculations on Modern and Future High-Performance Supercomputers","has_begin_end_time":true,"has_just_one_minute":true,"is_parent":true,"abstract":"Modern electronic-structure methods provide parameter-free simulations of properties across the whole spectrum of Physics, Chemistry, and Materials Science. They have become a bedrock of advanced analytical methods and are used systematically to interpret advanced experiments and complex interactions, with ever growing perspectives for more \u0022realistic\u0022 systems, including defects, thermal and external fields, and transient phenomena. This minisymposium explores the next generation of electronic-structure software, which will lead users to exascale supercomputers, through highly efficient and highly parallel algorithms. We will showcase recent advances in ground- and excited-state calculations, spectroscopic quantities and transport, and adaptive methods which exploit different algorithms for different systems such as periodic\/localized or many\/few electrons per atom.","bio":"","contributors":[{"type":"Organizer","first_name":"Stefan","last_name":"Goedecker","affiliation":"University of Basel","country":"Switzerland","bio":"","order":"1","is_presenter":true},{"type":"Organizer","first_name":"Andre","last_name":"Schleife","affiliation":"University of Illinois at Urbana-Champaign","country":"United States of America","bio":"","order":"2","is_presenter":false},{"type":"Organizer","first_name":"Matthieu","last_name":"Verstraete","affiliation":"Universite de Liege","country":"Belgium","bio":"","order":"3","is_presenter":false}],"has_presenters":true,"presenters":[{"type":"Organizer","first_name":"Stefan","last_name":"Goedecker","affiliation":"University of Basel","country":"Switzerland","bio":"","order":"1","is_presenter":true}]},{"id":"msa121","type":"child","title":"First-Principles Electron Transport with Phonon Coupling: Large Scale at Low Cost","begin_time":"13:00","end_time":"13:30","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"In the race towards high-performance nanometer-scaled devices the electronics industry now faces a major challenge from phonon-assisted tunneling. Despite the rapid size-reduction in experiments, system sizes still fall outside what is feasible for existing device models including electron-phonon coupling from first-principles. Therefore, the role of phonon-assisted tunneling in sub-10-nanometer gate-length devices has not been accurately quantified so far. We present a method that include phonon-assisted tunneling in large-scale first-principles calculations using a single \u0022special thermal displacement\u0022 of the atomic coordinates at almost the same cost as elastic transport calculations [1]. We apply the method to ultrascaled silicon devices and demonstrate the importance of phonon-assisted band-to-band and source-to-drain tunneling. In a diode the phonons lead to a rectification ratio suppression in good agreement with experiments, while in an ultrathin body transistor the phonons increase off currents by four orders of magnitude, in agreement with our state-of-the-art perturbation theory calculations. In addition, electron-phonon coupling of nanostructured devices in operation conditions can change significantly from its bulk value [2]. This makes the method an appealing design tool for next-generation devices and nanomaterials. [1]T. Gunst \u003Cem\u003Eet al\u003C\/em\u003E.,\u00a0Phys. Rev. B 96, 161404(R) (2017). [2]T. Gunst \u003Cem\u003Eet al\u003C\/em\u003E., Phys. Rev. Lett. 118, 046601 (2017).","filename":"msa121s1.pdf","bio":"","contributors":[{"type":"Author","first_name":"Tue","last_name":"Gunst","affiliation":"Technical University of Denmark","country":"Denmark","bio":"","order":"1","is_presenter":true}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Tue","last_name":"Gunst","affiliation":"Technical University of Denmark","country":"Denmark","bio":"","order":"1","is_presenter":true}]},{"id":"msa140","type":"child","title":"Large-Scale First-Principles Electronic Structure Calculations in Petascale and Exascale Supercomputers: A Real-Space Density Functional Theory Code","begin_time":"13:30","end_time":"14:00","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"First-principles electronic structure calculation based on the Density Functional Theory (DFT) has been an indispensable tool for many fields of material science and engineering. With the development of supercomputers, the size of the targets of the first-principles DFT calculations becomes larger and larger, and nowadays, the target systems with a few hundreds to a thousand of atoms have been computable with standard plane-wave based DFT program codes. However, the computable size is not yet satisfactory to clarify the properties of materials in the situations close to realistic applications. We\u0027d like to introduce our program code RSDFT, which has been developed to perform large-scale first-principles calculations on massively-parallel supercomputers including the Japanese flagship machine K computer. RSDFT is based on the real-space finite-difference pseudopotential method. Contrary to the standard plane-wave methods, the real-space method does not need to use Fast Fourier Transformations, which requires heavy communication burden in parallel computations, and therefore RSDFT shows rather good scalability even in the computations with tens of thousands of compute nodes. It has also been started to develop RSDFT for the next Japanese flagship computer called post-K computer. We aim to make the first-principles calculations of tens-of-thousand-atom systems easy as a daily work.","filename":"msa140s1.pdf","bio":"","contributors":[{"type":"Author","first_name":"Jun-Ichi","last_name":"Iwata","affiliation":"The University of Tokyo","country":"Japan","bio":"","order":"1","is_presenter":true},{"type":"Author","first_name":"Atsushi","last_name":"Oshiyama","affiliation":"The University of Tokyo","country":"Japan","bio":"","order":"2","is_presenter":false}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Jun-Ichi","last_name":"Iwata","affiliation":"The University of Tokyo","country":"Japan","bio":"","order":"1","is_presenter":true}]},{"id":"msa270","type":"child","title":"Potentialities of Wavelet Formalism towards a Reduction of the Complexity of Large Scale Electronic Structure Calculations","begin_time":"14:00","end_time":"14:30","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"For the last few years, the BigDFT software package has implemented a linear scaling Kohn-Sham density functional theory optimization algorithm based on Daubechies wavelets, where a minimal set of localized support functions are optimized in situ and therefore adapted to the physico-chemical properties of the system under investigation. We illustrate, from a general perspective, a quantitative method to identify and assess the partitioning of a large quantum-mechanical system into fragments. Our approach reduces arbitrariness in the fragmentation procedure and enables the possibility of assessing quantitatively whether the corresponding fragment multipoles can be interpreted as observable quantities associated with a system moiety. Such an approach is based on general grounds and its implementation is unrelated to the wavelet formalism. However, we show that the use of a minimal set of in situ-optimized basis functions allows at the same time a proper fragment definition and an accurate description of the electronic structure.","bio":"","contributors":[{"type":"Author","first_name":"Luigi","last_name":"Genovese","affiliation":"CEA","country":"France","bio":"","order":"1","is_presenter":true},{"type":"Author","first_name":"Stephan","last_name":"Mohr","affiliation":"Barcelona Supercomputing Center","country":"Spain","bio":"","order":"2","is_presenter":false},{"type":"Author","first_name":"Laura","last_name":"Ratcliff","affiliation":"Imperial College London","country":"United Kingdom","bio":"","order":"3","is_presenter":false}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Luigi","last_name":"Genovese","affiliation":"CEA","country":"France","bio":"","order":"1","is_presenter":true}]},{"id":"msa134","type":"child","title":"ABINIT on Pre-Exascale Supercomputers: Hybrid Parallelism and Numerical Stability","begin_time":"14:30","end_time":"15:00","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"ABINIT is one of the most widely used electronic structure codes, implementing plane-wave based Density-Functional Theory. With multiple levels of parallelism, changing the code with every hardware evolution is a tedious task. To avoid obsolescence and allow adaptivity, an abstract layer for intensive low-level computing tasks has been introduced. Low-level sections have been rewritten specifically for a few hardware types. A global change of the hybrid parallelism is necessary to adapt the code to the new and future many-core architectures, as well as to memory bandwidth. A positive side effect of memory sharing is a better convergence of the diagonalization algorithm. Performances will be shown on Intel Xeon Skylake and Intel Xeon Phi KNL. Vectorization with large vectors and multithreading with more and more tasks induce a non-predictability of the floating-point operations that increase numerical noise and instabilities. We tackle this issue with the use of stochastic arithmetic to estimate the number of significant digits of each code section. Doing this, we can identify numerically sensitive code sections.","filename":"msa134s1.pdf","bio":"","contributors":[{"type":"Author","first_name":"Marc","last_name":"Torrent","affiliation":"CEA","country":"France","bio":"","order":"1","is_presenter":true},{"type":"Author","first_name":"Jordan","last_name":"Bieder","affiliation":"CEA","country":"France","bio":"","order":"2","is_presenter":false},{"type":"Author","first_name":"Yohan","last_name":"Chatelain","affiliation":"Universit\u00e9 de Versailles Saint-Quentin-en-Yvelines","country":"France","bio":"","order":"3","is_presenter":false},{"type":"Author","first_name":"Pablo","last_name":"Oliveira","affiliation":"Universit\u00e9 de Versailles Saint-Quentin-en-Yvelines","country":"France","bio":"","order":"4","is_presenter":false}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Marc","last_name":"Torrent","affiliation":"CEA","country":"France","bio":"","order":"1","is_presenter":true}]}]}, "slot": {"id":"msa121","type":"child","title":"First-Principles Electron Transport with Phonon Coupling: Large Scale at Low Cost","begin_time":"13:00","end_time":"13:30","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"In the race towards high-performance nanometer-scaled devices the electronics industry now faces a major challenge from phonon-assisted tunneling. Despite the rapid size-reduction in experiments, system sizes still fall outside what is feasible for existing device models including electron-phonon coupling from first-principles. Therefore, the role of phonon-assisted tunneling in sub-10-nanometer gate-length devices has not been accurately quantified so far. We present a method that include phonon-assisted tunneling in large-scale first-principles calculations using a single \u0022special thermal displacement\u0022 of the atomic coordinates at almost the same cost as elastic transport calculations [1]. We apply the method to ultrascaled silicon devices and demonstrate the importance of phonon-assisted band-to-band and source-to-drain tunneling. In a diode the phonons lead to a rectification ratio suppression in good agreement with experiments, while in an ultrathin body transistor the phonons increase off currents by four orders of magnitude, in agreement with our state-of-the-art perturbation theory calculations. In addition, electron-phonon coupling of nanostructured devices in operation conditions can change significantly from its bulk value [2]. This makes the method an appealing design tool for next-generation devices and nanomaterials. [1]T. Gunst \u003Cem\u003Eet al\u003C\/em\u003E.,\u00a0Phys. Rev. B 96, 161404(R) (2017). [2]T. Gunst \u003Cem\u003Eet al\u003C\/em\u003E., Phys. Rev. Lett. 118, 046601 (2017).","filename":"msa121s1.pdf","bio":"","contributors":[{"type":"Author","first_name":"Tue","last_name":"Gunst","affiliation":"Technical University of Denmark","country":"Denmark","bio":"","order":"1","is_presenter":true}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Tue","last_name":"Gunst","affiliation":"Technical University of Denmark","country":"Denmark","bio":"","order":"1","is_presenter":true}]}, "slotContributors": [{"type":"Author","first_name":"Tue","last_name":"Gunst","affiliation":"Technical University of Denmark","country":"Denmark","bio":"","order":"1","is_presenter":true}] } Presentation
13:30 - 14:00
Large-Scale First-Principles Electronic Structure Calculations in Petascale and Exascale Supercomputers: A Real-Space Density Functional Theory Code
, Jun-Ichi Iwata (The University of Tokyo, Japan)
+ Abstract { "session": {"id":"sess184","title":"MS06 - Large Scale Electronic-Structure Calculations on Modern and Future High-Performance Supercomputers","date":"Monday, July 2nd 2018","begin_time":"13:00","end_time":"15:00","room":"Boston 3 Room","contributors":[{"type":"Session Chair","first_name":"Stefan","last_name":"Goedecker","affiliation":"University of Basel","country":"Switzerland"}],"view_type":"VIII","view_type_id":"evtt110","tracks":["Engineering","Chemistry and Materials","Physics"],"slots":[{"id":"symp137","type":"minisymposia","title":"MS06 - Large Scale Electronic-Structure Calculations on Modern and Future High-Performance Supercomputers","has_begin_end_time":true,"has_just_one_minute":true,"is_parent":true,"abstract":"Modern electronic-structure methods provide parameter-free simulations of properties across the whole spectrum of Physics, Chemistry, and Materials Science. They have become a bedrock of advanced analytical methods and are used systematically to interpret advanced experiments and complex interactions, with ever growing perspectives for more \u0022realistic\u0022 systems, including defects, thermal and external fields, and transient phenomena. This minisymposium explores the next generation of electronic-structure software, which will lead users to exascale supercomputers, through highly efficient and highly parallel algorithms. We will showcase recent advances in ground- and excited-state calculations, spectroscopic quantities and transport, and adaptive methods which exploit different algorithms for different systems such as periodic\/localized or many\/few electrons per atom.","bio":"","contributors":[{"type":"Organizer","first_name":"Stefan","last_name":"Goedecker","affiliation":"University of Basel","country":"Switzerland","bio":"","order":"1","is_presenter":true},{"type":"Organizer","first_name":"Andre","last_name":"Schleife","affiliation":"University of Illinois at Urbana-Champaign","country":"United States of America","bio":"","order":"2","is_presenter":false},{"type":"Organizer","first_name":"Matthieu","last_name":"Verstraete","affiliation":"Universite de Liege","country":"Belgium","bio":"","order":"3","is_presenter":false}],"has_presenters":true,"presenters":[{"type":"Organizer","first_name":"Stefan","last_name":"Goedecker","affiliation":"University of Basel","country":"Switzerland","bio":"","order":"1","is_presenter":true}]},{"id":"msa121","type":"child","title":"First-Principles Electron Transport with Phonon Coupling: Large Scale at Low Cost","begin_time":"13:00","end_time":"13:30","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"In the race towards high-performance nanometer-scaled devices the electronics industry now faces a major challenge from phonon-assisted tunneling. Despite the rapid size-reduction in experiments, system sizes still fall outside what is feasible for existing device models including electron-phonon coupling from first-principles. Therefore, the role of phonon-assisted tunneling in sub-10-nanometer gate-length devices has not been accurately quantified so far. We present a method that include phonon-assisted tunneling in large-scale first-principles calculations using a single \u0022special thermal displacement\u0022 of the atomic coordinates at almost the same cost as elastic transport calculations [1]. We apply the method to ultrascaled silicon devices and demonstrate the importance of phonon-assisted band-to-band and source-to-drain tunneling. In a diode the phonons lead to a rectification ratio suppression in good agreement with experiments, while in an ultrathin body transistor the phonons increase off currents by four orders of magnitude, in agreement with our state-of-the-art perturbation theory calculations. In addition, electron-phonon coupling of nanostructured devices in operation conditions can change significantly from its bulk value [2]. This makes the method an appealing design tool for next-generation devices and nanomaterials. [1]T. Gunst \u003Cem\u003Eet al\u003C\/em\u003E.,\u00a0Phys. Rev. B 96, 161404(R) (2017). [2]T. Gunst \u003Cem\u003Eet al\u003C\/em\u003E., Phys. Rev. Lett. 118, 046601 (2017).","filename":"msa121s1.pdf","bio":"","contributors":[{"type":"Author","first_name":"Tue","last_name":"Gunst","affiliation":"Technical University of Denmark","country":"Denmark","bio":"","order":"1","is_presenter":true}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Tue","last_name":"Gunst","affiliation":"Technical University of Denmark","country":"Denmark","bio":"","order":"1","is_presenter":true}]},{"id":"msa140","type":"child","title":"Large-Scale First-Principles Electronic Structure Calculations in Petascale and Exascale Supercomputers: A Real-Space Density Functional Theory Code","begin_time":"13:30","end_time":"14:00","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"First-principles electronic structure calculation based on the Density Functional Theory (DFT) has been an indispensable tool for many fields of material science and engineering. With the development of supercomputers, the size of the targets of the first-principles DFT calculations becomes larger and larger, and nowadays, the target systems with a few hundreds to a thousand of atoms have been computable with standard plane-wave based DFT program codes. However, the computable size is not yet satisfactory to clarify the properties of materials in the situations close to realistic applications. We\u0027d like to introduce our program code RSDFT, which has been developed to perform large-scale first-principles calculations on massively-parallel supercomputers including the Japanese flagship machine K computer. RSDFT is based on the real-space finite-difference pseudopotential method. Contrary to the standard plane-wave methods, the real-space method does not need to use Fast Fourier Transformations, which requires heavy communication burden in parallel computations, and therefore RSDFT shows rather good scalability even in the computations with tens of thousands of compute nodes. It has also been started to develop RSDFT for the next Japanese flagship computer called post-K computer. We aim to make the first-principles calculations of tens-of-thousand-atom systems easy as a daily work.","filename":"msa140s1.pdf","bio":"","contributors":[{"type":"Author","first_name":"Jun-Ichi","last_name":"Iwata","affiliation":"The University of Tokyo","country":"Japan","bio":"","order":"1","is_presenter":true},{"type":"Author","first_name":"Atsushi","last_name":"Oshiyama","affiliation":"The University of Tokyo","country":"Japan","bio":"","order":"2","is_presenter":false}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Jun-Ichi","last_name":"Iwata","affiliation":"The University of Tokyo","country":"Japan","bio":"","order":"1","is_presenter":true}]},{"id":"msa270","type":"child","title":"Potentialities of Wavelet Formalism towards a Reduction of the Complexity of Large Scale Electronic Structure Calculations","begin_time":"14:00","end_time":"14:30","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"For the last few years, the BigDFT software package has implemented a linear scaling Kohn-Sham density functional theory optimization algorithm based on Daubechies wavelets, where a minimal set of localized support functions are optimized in situ and therefore adapted to the physico-chemical properties of the system under investigation. We illustrate, from a general perspective, a quantitative method to identify and assess the partitioning of a large quantum-mechanical system into fragments. Our approach reduces arbitrariness in the fragmentation procedure and enables the possibility of assessing quantitatively whether the corresponding fragment multipoles can be interpreted as observable quantities associated with a system moiety. Such an approach is based on general grounds and its implementation is unrelated to the wavelet formalism. However, we show that the use of a minimal set of in situ-optimized basis functions allows at the same time a proper fragment definition and an accurate description of the electronic structure.","bio":"","contributors":[{"type":"Author","first_name":"Luigi","last_name":"Genovese","affiliation":"CEA","country":"France","bio":"","order":"1","is_presenter":true},{"type":"Author","first_name":"Stephan","last_name":"Mohr","affiliation":"Barcelona Supercomputing Center","country":"Spain","bio":"","order":"2","is_presenter":false},{"type":"Author","first_name":"Laura","last_name":"Ratcliff","affiliation":"Imperial College London","country":"United Kingdom","bio":"","order":"3","is_presenter":false}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Luigi","last_name":"Genovese","affiliation":"CEA","country":"France","bio":"","order":"1","is_presenter":true}]},{"id":"msa134","type":"child","title":"ABINIT on Pre-Exascale Supercomputers: Hybrid Parallelism and Numerical Stability","begin_time":"14:30","end_time":"15:00","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"ABINIT is one of the most widely used electronic structure codes, implementing plane-wave based Density-Functional Theory. With multiple levels of parallelism, changing the code with every hardware evolution is a tedious task. To avoid obsolescence and allow adaptivity, an abstract layer for intensive low-level computing tasks has been introduced. Low-level sections have been rewritten specifically for a few hardware types. A global change of the hybrid parallelism is necessary to adapt the code to the new and future many-core architectures, as well as to memory bandwidth. A positive side effect of memory sharing is a better convergence of the diagonalization algorithm. Performances will be shown on Intel Xeon Skylake and Intel Xeon Phi KNL. Vectorization with large vectors and multithreading with more and more tasks induce a non-predictability of the floating-point operations that increase numerical noise and instabilities. We tackle this issue with the use of stochastic arithmetic to estimate the number of significant digits of each code section. Doing this, we can identify numerically sensitive code sections.","filename":"msa134s1.pdf","bio":"","contributors":[{"type":"Author","first_name":"Marc","last_name":"Torrent","affiliation":"CEA","country":"France","bio":"","order":"1","is_presenter":true},{"type":"Author","first_name":"Jordan","last_name":"Bieder","affiliation":"CEA","country":"France","bio":"","order":"2","is_presenter":false},{"type":"Author","first_name":"Yohan","last_name":"Chatelain","affiliation":"Universit\u00e9 de Versailles Saint-Quentin-en-Yvelines","country":"France","bio":"","order":"3","is_presenter":false},{"type":"Author","first_name":"Pablo","last_name":"Oliveira","affiliation":"Universit\u00e9 de Versailles Saint-Quentin-en-Yvelines","country":"France","bio":"","order":"4","is_presenter":false}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Marc","last_name":"Torrent","affiliation":"CEA","country":"France","bio":"","order":"1","is_presenter":true}]}]}, "slot": {"id":"msa140","type":"child","title":"Large-Scale First-Principles Electronic Structure Calculations in Petascale and Exascale Supercomputers: A Real-Space Density Functional Theory Code","begin_time":"13:30","end_time":"14:00","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"First-principles electronic structure calculation based on the Density Functional Theory (DFT) has been an indispensable tool for many fields of material science and engineering. With the development of supercomputers, the size of the targets of the first-principles DFT calculations becomes larger and larger, and nowadays, the target systems with a few hundreds to a thousand of atoms have been computable with standard plane-wave based DFT program codes. However, the computable size is not yet satisfactory to clarify the properties of materials in the situations close to realistic applications. We\u0027d like to introduce our program code RSDFT, which has been developed to perform large-scale first-principles calculations on massively-parallel supercomputers including the Japanese flagship machine K computer. RSDFT is based on the real-space finite-difference pseudopotential method. Contrary to the standard plane-wave methods, the real-space method does not need to use Fast Fourier Transformations, which requires heavy communication burden in parallel computations, and therefore RSDFT shows rather good scalability even in the computations with tens of thousands of compute nodes. It has also been started to develop RSDFT for the next Japanese flagship computer called post-K computer. We aim to make the first-principles calculations of tens-of-thousand-atom systems easy as a daily work.","filename":"msa140s1.pdf","bio":"","contributors":[{"type":"Author","first_name":"Jun-Ichi","last_name":"Iwata","affiliation":"The University of Tokyo","country":"Japan","bio":"","order":"1","is_presenter":true},{"type":"Author","first_name":"Atsushi","last_name":"Oshiyama","affiliation":"The University of Tokyo","country":"Japan","bio":"","order":"2","is_presenter":false}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Jun-Ichi","last_name":"Iwata","affiliation":"The University of Tokyo","country":"Japan","bio":"","order":"1","is_presenter":true}]}, "slotContributors": [{"type":"Author","first_name":"Jun-Ichi","last_name":"Iwata","affiliation":"The University of Tokyo","country":"Japan","bio":"","order":"1","is_presenter":true},{"type":"Author","first_name":"Atsushi","last_name":"Oshiyama","affiliation":"The University of Tokyo","country":"Japan","bio":"","order":"2","is_presenter":false}] } Presentation
14:30 - 15:00
ABINIT on Pre-Exascale Supercomputers: Hybrid Parallelism and Numerical Stability
, Marc Torrent (CEA, France)
+ Abstract { "session": {"id":"sess184","title":"MS06 - Large Scale Electronic-Structure Calculations on Modern and Future High-Performance Supercomputers","date":"Monday, July 2nd 2018","begin_time":"13:00","end_time":"15:00","room":"Boston 3 Room","contributors":[{"type":"Session Chair","first_name":"Stefan","last_name":"Goedecker","affiliation":"University of Basel","country":"Switzerland"}],"view_type":"VIII","view_type_id":"evtt110","tracks":["Engineering","Chemistry and Materials","Physics"],"slots":[{"id":"symp137","type":"minisymposia","title":"MS06 - Large Scale Electronic-Structure Calculations on Modern and Future High-Performance Supercomputers","has_begin_end_time":true,"has_just_one_minute":true,"is_parent":true,"abstract":"Modern electronic-structure methods provide parameter-free simulations of properties across the whole spectrum of Physics, Chemistry, and Materials Science. They have become a bedrock of advanced analytical methods and are used systematically to interpret advanced experiments and complex interactions, with ever growing perspectives for more \u0022realistic\u0022 systems, including defects, thermal and external fields, and transient phenomena. This minisymposium explores the next generation of electronic-structure software, which will lead users to exascale supercomputers, through highly efficient and highly parallel algorithms. We will showcase recent advances in ground- and excited-state calculations, spectroscopic quantities and transport, and adaptive methods which exploit different algorithms for different systems such as periodic\/localized or many\/few electrons per atom.","bio":"","contributors":[{"type":"Organizer","first_name":"Stefan","last_name":"Goedecker","affiliation":"University of Basel","country":"Switzerland","bio":"","order":"1","is_presenter":true},{"type":"Organizer","first_name":"Andre","last_name":"Schleife","affiliation":"University of Illinois at Urbana-Champaign","country":"United States of America","bio":"","order":"2","is_presenter":false},{"type":"Organizer","first_name":"Matthieu","last_name":"Verstraete","affiliation":"Universite de Liege","country":"Belgium","bio":"","order":"3","is_presenter":false}],"has_presenters":true,"presenters":[{"type":"Organizer","first_name":"Stefan","last_name":"Goedecker","affiliation":"University of Basel","country":"Switzerland","bio":"","order":"1","is_presenter":true}]},{"id":"msa121","type":"child","title":"First-Principles Electron Transport with Phonon Coupling: Large Scale at Low Cost","begin_time":"13:00","end_time":"13:30","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"In the race towards high-performance nanometer-scaled devices the electronics industry now faces a major challenge from phonon-assisted tunneling. Despite the rapid size-reduction in experiments, system sizes still fall outside what is feasible for existing device models including electron-phonon coupling from first-principles. Therefore, the role of phonon-assisted tunneling in sub-10-nanometer gate-length devices has not been accurately quantified so far. We present a method that include phonon-assisted tunneling in large-scale first-principles calculations using a single \u0022special thermal displacement\u0022 of the atomic coordinates at almost the same cost as elastic transport calculations [1]. We apply the method to ultrascaled silicon devices and demonstrate the importance of phonon-assisted band-to-band and source-to-drain tunneling. In a diode the phonons lead to a rectification ratio suppression in good agreement with experiments, while in an ultrathin body transistor the phonons increase off currents by four orders of magnitude, in agreement with our state-of-the-art perturbation theory calculations. In addition, electron-phonon coupling of nanostructured devices in operation conditions can change significantly from its bulk value [2]. This makes the method an appealing design tool for next-generation devices and nanomaterials. [1]T. Gunst \u003Cem\u003Eet al\u003C\/em\u003E.,\u00a0Phys. Rev. B 96, 161404(R) (2017). [2]T. Gunst \u003Cem\u003Eet al\u003C\/em\u003E., Phys. Rev. Lett. 118, 046601 (2017).","filename":"msa121s1.pdf","bio":"","contributors":[{"type":"Author","first_name":"Tue","last_name":"Gunst","affiliation":"Technical University of Denmark","country":"Denmark","bio":"","order":"1","is_presenter":true}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Tue","last_name":"Gunst","affiliation":"Technical University of Denmark","country":"Denmark","bio":"","order":"1","is_presenter":true}]},{"id":"msa140","type":"child","title":"Large-Scale First-Principles Electronic Structure Calculations in Petascale and Exascale Supercomputers: A Real-Space Density Functional Theory Code","begin_time":"13:30","end_time":"14:00","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"First-principles electronic structure calculation based on the Density Functional Theory (DFT) has been an indispensable tool for many fields of material science and engineering. With the development of supercomputers, the size of the targets of the first-principles DFT calculations becomes larger and larger, and nowadays, the target systems with a few hundreds to a thousand of atoms have been computable with standard plane-wave based DFT program codes. However, the computable size is not yet satisfactory to clarify the properties of materials in the situations close to realistic applications. We\u0027d like to introduce our program code RSDFT, which has been developed to perform large-scale first-principles calculations on massively-parallel supercomputers including the Japanese flagship machine K computer. RSDFT is based on the real-space finite-difference pseudopotential method. Contrary to the standard plane-wave methods, the real-space method does not need to use Fast Fourier Transformations, which requires heavy communication burden in parallel computations, and therefore RSDFT shows rather good scalability even in the computations with tens of thousands of compute nodes. It has also been started to develop RSDFT for the next Japanese flagship computer called post-K computer. We aim to make the first-principles calculations of tens-of-thousand-atom systems easy as a daily work.","filename":"msa140s1.pdf","bio":"","contributors":[{"type":"Author","first_name":"Jun-Ichi","last_name":"Iwata","affiliation":"The University of Tokyo","country":"Japan","bio":"","order":"1","is_presenter":true},{"type":"Author","first_name":"Atsushi","last_name":"Oshiyama","affiliation":"The University of Tokyo","country":"Japan","bio":"","order":"2","is_presenter":false}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Jun-Ichi","last_name":"Iwata","affiliation":"The University of Tokyo","country":"Japan","bio":"","order":"1","is_presenter":true}]},{"id":"msa270","type":"child","title":"Potentialities of Wavelet Formalism towards a Reduction of the Complexity of Large Scale Electronic Structure Calculations","begin_time":"14:00","end_time":"14:30","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"For the last few years, the BigDFT software package has implemented a linear scaling Kohn-Sham density functional theory optimization algorithm based on Daubechies wavelets, where a minimal set of localized support functions are optimized in situ and therefore adapted to the physico-chemical properties of the system under investigation. We illustrate, from a general perspective, a quantitative method to identify and assess the partitioning of a large quantum-mechanical system into fragments. Our approach reduces arbitrariness in the fragmentation procedure and enables the possibility of assessing quantitatively whether the corresponding fragment multipoles can be interpreted as observable quantities associated with a system moiety. Such an approach is based on general grounds and its implementation is unrelated to the wavelet formalism. However, we show that the use of a minimal set of in situ-optimized basis functions allows at the same time a proper fragment definition and an accurate description of the electronic structure.","bio":"","contributors":[{"type":"Author","first_name":"Luigi","last_name":"Genovese","affiliation":"CEA","country":"France","bio":"","order":"1","is_presenter":true},{"type":"Author","first_name":"Stephan","last_name":"Mohr","affiliation":"Barcelona Supercomputing Center","country":"Spain","bio":"","order":"2","is_presenter":false},{"type":"Author","first_name":"Laura","last_name":"Ratcliff","affiliation":"Imperial College London","country":"United Kingdom","bio":"","order":"3","is_presenter":false}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Luigi","last_name":"Genovese","affiliation":"CEA","country":"France","bio":"","order":"1","is_presenter":true}]},{"id":"msa134","type":"child","title":"ABINIT on Pre-Exascale Supercomputers: Hybrid Parallelism and Numerical Stability","begin_time":"14:30","end_time":"15:00","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"ABINIT is one of the most widely used electronic structure codes, implementing plane-wave based Density-Functional Theory. With multiple levels of parallelism, changing the code with every hardware evolution is a tedious task. To avoid obsolescence and allow adaptivity, an abstract layer for intensive low-level computing tasks has been introduced. Low-level sections have been rewritten specifically for a few hardware types. A global change of the hybrid parallelism is necessary to adapt the code to the new and future many-core architectures, as well as to memory bandwidth. A positive side effect of memory sharing is a better convergence of the diagonalization algorithm. Performances will be shown on Intel Xeon Skylake and Intel Xeon Phi KNL. Vectorization with large vectors and multithreading with more and more tasks induce a non-predictability of the floating-point operations that increase numerical noise and instabilities. We tackle this issue with the use of stochastic arithmetic to estimate the number of significant digits of each code section. Doing this, we can identify numerically sensitive code sections.","filename":"msa134s1.pdf","bio":"","contributors":[{"type":"Author","first_name":"Marc","last_name":"Torrent","affiliation":"CEA","country":"France","bio":"","order":"1","is_presenter":true},{"type":"Author","first_name":"Jordan","last_name":"Bieder","affiliation":"CEA","country":"France","bio":"","order":"2","is_presenter":false},{"type":"Author","first_name":"Yohan","last_name":"Chatelain","affiliation":"Universit\u00e9 de Versailles Saint-Quentin-en-Yvelines","country":"France","bio":"","order":"3","is_presenter":false},{"type":"Author","first_name":"Pablo","last_name":"Oliveira","affiliation":"Universit\u00e9 de Versailles Saint-Quentin-en-Yvelines","country":"France","bio":"","order":"4","is_presenter":false}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Marc","last_name":"Torrent","affiliation":"CEA","country":"France","bio":"","order":"1","is_presenter":true}]}]}, "slot": {"id":"msa134","type":"child","title":"ABINIT on Pre-Exascale Supercomputers: Hybrid Parallelism and Numerical Stability","begin_time":"14:30","end_time":"15:00","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"ABINIT is one of the most widely used electronic structure codes, implementing plane-wave based Density-Functional Theory. With multiple levels of parallelism, changing the code with every hardware evolution is a tedious task. To avoid obsolescence and allow adaptivity, an abstract layer for intensive low-level computing tasks has been introduced. Low-level sections have been rewritten specifically for a few hardware types. A global change of the hybrid parallelism is necessary to adapt the code to the new and future many-core architectures, as well as to memory bandwidth. A positive side effect of memory sharing is a better convergence of the diagonalization algorithm. Performances will be shown on Intel Xeon Skylake and Intel Xeon Phi KNL. Vectorization with large vectors and multithreading with more and more tasks induce a non-predictability of the floating-point operations that increase numerical noise and instabilities. We tackle this issue with the use of stochastic arithmetic to estimate the number of significant digits of each code section. Doing this, we can identify numerically sensitive code sections.","filename":"msa134s1.pdf","bio":"","contributors":[{"type":"Author","first_name":"Marc","last_name":"Torrent","affiliation":"CEA","country":"France","bio":"","order":"1","is_presenter":true},{"type":"Author","first_name":"Jordan","last_name":"Bieder","affiliation":"CEA","country":"France","bio":"","order":"2","is_presenter":false},{"type":"Author","first_name":"Yohan","last_name":"Chatelain","affiliation":"Universit\u00e9 de Versailles Saint-Quentin-en-Yvelines","country":"France","bio":"","order":"3","is_presenter":false},{"type":"Author","first_name":"Pablo","last_name":"Oliveira","affiliation":"Universit\u00e9 de Versailles Saint-Quentin-en-Yvelines","country":"France","bio":"","order":"4","is_presenter":false}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Marc","last_name":"Torrent","affiliation":"CEA","country":"France","bio":"","order":"1","is_presenter":true}]}, "slotContributors": [{"type":"Author","first_name":"Marc","last_name":"Torrent","affiliation":"CEA","country":"France","bio":"","order":"1","is_presenter":true},{"type":"Author","first_name":"Jordan","last_name":"Bieder","affiliation":"CEA","country":"France","bio":"","order":"2","is_presenter":false},{"type":"Author","first_name":"Yohan","last_name":"Chatelain","affiliation":"Universit\u00e9 de Versailles Saint-Quentin-en-Yvelines","country":"France","bio":"","order":"3","is_presenter":false},{"type":"Author","first_name":"Pablo","last_name":"Oliveira","affiliation":"Universit\u00e9 de Versailles Saint-Quentin-en-Yvelines","country":"France","bio":"","order":"4","is_presenter":false}] } Presentation
Organizer(s):
Peter Dominik Dueben (ECMWF, United Kingdom)
, Rupert Ford (Science and Technology Facilities Council, United Kingdom)
, Willem Deconinck (ECMWF, United Kingdom)
Track(s):
Climate and Weather
The increasingly large amounts of data being produced by weather and climate simulations and earth system observations is sometimes characterised as a deluge. This deluge of data is both a challenge and an opportunity. The main opportunities are to make use of this wealth of data to 1) improve knowledge by extracting additional knowledge from the data and 2) to improve the quality of the models themselves by analysing the accuracy, or lack thereof, of the resultant simulation data. An example of the former case is improved prediction of large scale phenomena such as El Nino. An example of the latter is the improvement of a Physics parameterisation scheme through detailed analysis of the errors in a large number of datasets.
One way to realise these opportunities is to use machine learning approaches. As machine learning in weather and climate is a relatively new topic this minisymposium introduces the audience to how machine learning could be used in weather and climate and outlines its implications in terms of computing costs. To ground the ideas in concrete examples it also illustrates the use of machine learning in the weather and climate domain with practical examples.
One way to realise these opportunities is to use machine learning approaches. As machine learning in weather and climate is a relatively new topic this minisymposium introduces the audience to how machine learning could be used in weather and climate and outlines its implications in terms of computing costs. To ground the ideas in concrete examples it also illustrates the use of machine learning in the weather and climate domain with practical examples.
13:30 - 14:00
Deep Learning in Weather and Climate, Part 2: The Computing Perspective
, Jakob Progsch (NVIDIA Inc., Germany)
+ Abstract { "session": {"id":"sess185","title":"MS07 - Machine Learning in Weather and Climate","date":"Monday, July 2nd 2018","begin_time":"13:00","end_time":"15:00","room":"Rio Room","contributors":[{"type":"Session Chair","first_name":"Peter Dominik","last_name":"Dueben","affiliation":"University of Oxford","country":""}],"view_type":"VIII","view_type_id":"evtt110","tracks":["Climate and Weather"],"slots":[{"id":"symp155","type":"minisymposia","title":"MS07 - Machine Learning in Weather and Climate","has_begin_end_time":true,"has_just_one_minute":true,"is_parent":true,"abstract":"The increasingly large amounts of data being produced by weather and climate simulations and earth system observations is sometimes characterised as a deluge. This deluge of data is both a challenge and an opportunity. The main opportunities are to make use of this wealth of data to 1) improve knowledge by extracting additional knowledge from the data and 2) to improve the quality of the models themselves by analysing the accuracy, or lack thereof, of the resultant simulation data. An example of the former case is improved prediction of large scale phenomena such as El Nino. An example of the latter is the improvement of a Physics parameterisation scheme through detailed analysis of the errors in a large number of datasets.\u003Cbr \/\u003E\u003Cbr \/\u003EOne way to realise these opportunities is to use machine learning approaches. As machine learning in weather and climate is a relatively new topic this minisymposium introduces the audience to how machine learning could be used in weather and climate and outlines its implications in terms of computing costs. To ground the ideas in concrete examples it also illustrates the use of machine learning in the weather and climate domain with practical examples.","bio":"","contributors":[{"type":"Organizer","first_name":"Peter Dominik","last_name":"Dueben","affiliation":"ECMWF","country":"United Kingdom","bio":"","order":"1","is_presenter":true},{"type":"Organizer","first_name":"Rupert","last_name":"Ford","affiliation":"Science and Technology Facilities Council","country":"United Kingdom","bio":"","order":"2","is_presenter":false},{"type":"Organizer","first_name":"Willem","last_name":"Deconinck","affiliation":"ECMWF","country":"United Kingdom","bio":"","order":"3","is_presenter":false}],"has_presenters":true,"presenters":[{"type":"Organizer","first_name":"Peter Dominik","last_name":"Dueben","affiliation":"ECMWF","country":"United Kingdom","bio":"","order":"1","is_presenter":true}]},{"id":"msa111","type":"child","title":"Deep Learning in Weather and Climate, Part 1: The Domain Perspective","begin_time":"13:00","end_time":"13:30","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"From the perspective of Earth System modelling, the use of machine learning, an in particular deep learning, is still in its infancy. There are many possible ways how deep learning could improve model quality or generate significant speed-ups for simulations. However, it has yet to be shown that deep learning can hold what it is promising for this application and its specific needs. This talk will provide an overview how deep learning may impact Earth System modelling in the future. We will provide examples how these methods have been used until today and discuss both limitations and prospects for their application. We will present results when using deep learning to improve model simulations for a toy model of atmospheric dynamics (the Lorenz\u002795 model). We will also show preliminary results that use deep neural networks that are trained from global atmospheric data to represent atmospheric dynamics and networks that are designed to speed-up parts of a weather forecast model at full complexity.","bio":"","contributors":[{"type":"Author","first_name":"Peter Dominik","last_name":"Dueben","affiliation":"ECMWF","country":"United Kingdom","bio":"","order":"1","is_presenter":true}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Peter Dominik","last_name":"Dueben","affiliation":"ECMWF","country":"United Kingdom","bio":"","order":"1","is_presenter":true}]},{"id":"msa149","type":"child","title":"Deep Learning in Weather and Climate, Part 2: The Computing Perspective","begin_time":"13:30","end_time":"14:00","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"In this presentation we will discuss ways in which deep neural networks can be integrated with traditional climate and weather\u00a0simulations. In particular, we will be focusing on\u00a0the design, implementation, and training of a deep convolutional neural network and its integration with the IFS Forecast Model inside RAPS as a new stand-alone\u00a0radiation scheme. This work is a case study for how\u00a0AI and large scale simulation may be applied on a cooperative basis and let the strengths of each converge to form a new tool for science.","filename":"msa149s1.pdf","bio":"","contributors":[{"type":"Author","first_name":"Christoph","last_name":"Angerer","affiliation":"NVIDIA Inc.","country":"Germany","bio":"","order":"1","is_presenter":false},{"type":"Author","first_name":"Jakob","last_name":"Progsch","affiliation":"NVIDIA Inc.","country":"Germany","bio":"","order":"2","is_presenter":true}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Jakob","last_name":"Progsch","affiliation":"NVIDIA Inc.","country":"Germany","bio":"","order":"2","is_presenter":true}]},{"id":"msa253","type":"child","title":"Integrating Machine Learning Algorithms and HPDA Frameworks to Run Predictive Analytics on Large-Scale Climate and Weather Datasets","begin_time":"14:00","end_time":"14:30","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"This work relates to the integration of a recurrent neural network algorithm (Long Short-Term Memory - LSTM) into Ophidia, a datacube-oriented High Performance Data Analytics framework. More specifically, Ophidia provides a (big) datacube abstraction to the end users, while it physically builds its core set of functionalities (namely \u0027\u003Cem\u003Eoperators\u003C\/em\u003E\u0027) on top of an array-database system. Operators in Ophidia run in parallel over a cluster to tackle big data challenges on massive scientific datasets. At the array-database level, Ophidia allows end-user developing her own analytics functions (namely \u0027\u003Cem\u003Eprimitives\u003C\/em\u003E\u0027), which by definition represent a sequential array-based data transformation. By implementing the LSTM algorithm as a \u003Cem\u003Eprimitive\u003C\/em\u003E running over a long time series, machine learning capabilities can be integrated into Ophidia taking advantage of a HPDA approach applied over large-scale datasets. A couple of case studies have been considered: the former relates to the output of a WRF model running over the Brazilian region of Curitiba, whereas the latter includes both simulated data, through an unstructured grid forecasting model running at CMCC by the Ocean Predictions and Applications Division, and observations over the Apulia region in the South-East of Italy. Preliminary insights about the proposed approach seems promising and will be presented.","filename":"msa253s1.pdf","bio":"","contributors":[{"type":"Author","first_name":"Alessandro","last_name":"D\u0027Anca","affiliation":"CMCC","country":"Italy","bio":"","order":"1","is_presenter":true},{"type":"Author","first_name":"Giovanni","last_name":"Aloisio","affiliation":"University of Salento","country":"Italy","bio":"","order":"2","is_presenter":false},{"type":"Author","first_name":"Sandro","last_name":"Fiore","affiliation":"CMCC Foundation","country":"Italy","bio":"","order":"3","is_presenter":false}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Alessandro","last_name":"D\u0027Anca","affiliation":"CMCC","country":"Italy","bio":"","order":"1","is_presenter":true}]},{"id":"msa112","type":"child","title":"Using Self-Organising Maps to Understand Relationships between Clouds and Cloud Controlling Factors","begin_time":"14:30","end_time":"15:00","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"The long term global warming predicted for a doubling of carbon dioxide is known as the \u0027climate sensitivity\u0027. Many future regional impacts of climate change become more serious for larger climate sensitivities. Current estimates of climate sensitivity from different climate models around the world vary by more than a factor of two, from approximately 2 to 5K. The reason for such a large range of estimates is due to the uncertainty around how clouds will change as the climate warms. Changes in clouds are hard to predict because they depend non-linearly on many interacting environmental factors. In this work we are interested in establishing whether machine learning can provide new insights into a) the factors controlling cloud changes and b) how various climate models represent these relationships. We use the Self-Organising Map (SOM), an unsupervised learning technique well suited to analysing high-dimensional data, to explore relationships between cloud controlling factors and compare the results to standard linear correlation. We find that potentially interesting new relationships not shown by linear correlation are revealed by the SOM technique.","filename":"msa112s1.pdf","bio":"","contributors":[{"type":"Author","first_name":"Samantha V.","last_name":"Adams","affiliation":"Met Office","country":"United Kingdom","bio":"","order":"1","is_presenter":true},{"type":"Author","first_name":"Mark","last_name":"Webb","affiliation":"Met Office","country":"United Kingdom","bio":"","order":"2","is_presenter":false}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Samantha V.","last_name":"Adams","affiliation":"Met Office","country":"United Kingdom","bio":"","order":"1","is_presenter":true}]}]}, "slot": {"id":"msa149","type":"child","title":"Deep Learning in Weather and Climate, Part 2: The Computing Perspective","begin_time":"13:30","end_time":"14:00","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"In this presentation we will discuss ways in which deep neural networks can be integrated with traditional climate and weather\u00a0simulations. In particular, we will be focusing on\u00a0the design, implementation, and training of a deep convolutional neural network and its integration with the IFS Forecast Model inside RAPS as a new stand-alone\u00a0radiation scheme. This work is a case study for how\u00a0AI and large scale simulation may be applied on a cooperative basis and let the strengths of each converge to form a new tool for science.","filename":"msa149s1.pdf","bio":"","contributors":[{"type":"Author","first_name":"Christoph","last_name":"Angerer","affiliation":"NVIDIA Inc.","country":"Germany","bio":"","order":"1","is_presenter":false},{"type":"Author","first_name":"Jakob","last_name":"Progsch","affiliation":"NVIDIA Inc.","country":"Germany","bio":"","order":"2","is_presenter":true}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Jakob","last_name":"Progsch","affiliation":"NVIDIA Inc.","country":"Germany","bio":"","order":"2","is_presenter":true}]}, "slotContributors": [{"type":"Author","first_name":"Christoph","last_name":"Angerer","affiliation":"NVIDIA Inc.","country":"Germany","bio":"","order":"1","is_presenter":false},{"type":"Author","first_name":"Jakob","last_name":"Progsch","affiliation":"NVIDIA Inc.","country":"Germany","bio":"","order":"2","is_presenter":true}] } Presentation
14:00 - 14:30
Integrating Machine Learning Algorithms and HPDA Frameworks to Run Predictive Analytics on Large-Scale Climate and Weather Datasets
, Alessandro D'Anca (CMCC, Italy)
+ Abstract { "session": {"id":"sess185","title":"MS07 - Machine Learning in Weather and Climate","date":"Monday, July 2nd 2018","begin_time":"13:00","end_time":"15:00","room":"Rio Room","contributors":[{"type":"Session Chair","first_name":"Peter Dominik","last_name":"Dueben","affiliation":"University of Oxford","country":""}],"view_type":"VIII","view_type_id":"evtt110","tracks":["Climate and Weather"],"slots":[{"id":"symp155","type":"minisymposia","title":"MS07 - Machine Learning in Weather and Climate","has_begin_end_time":true,"has_just_one_minute":true,"is_parent":true,"abstract":"The increasingly large amounts of data being produced by weather and climate simulations and earth system observations is sometimes characterised as a deluge. This deluge of data is both a challenge and an opportunity. The main opportunities are to make use of this wealth of data to 1) improve knowledge by extracting additional knowledge from the data and 2) to improve the quality of the models themselves by analysing the accuracy, or lack thereof, of the resultant simulation data. An example of the former case is improved prediction of large scale phenomena such as El Nino. An example of the latter is the improvement of a Physics parameterisation scheme through detailed analysis of the errors in a large number of datasets.\u003Cbr \/\u003E\u003Cbr \/\u003EOne way to realise these opportunities is to use machine learning approaches. As machine learning in weather and climate is a relatively new topic this minisymposium introduces the audience to how machine learning could be used in weather and climate and outlines its implications in terms of computing costs. To ground the ideas in concrete examples it also illustrates the use of machine learning in the weather and climate domain with practical examples.","bio":"","contributors":[{"type":"Organizer","first_name":"Peter Dominik","last_name":"Dueben","affiliation":"ECMWF","country":"United Kingdom","bio":"","order":"1","is_presenter":true},{"type":"Organizer","first_name":"Rupert","last_name":"Ford","affiliation":"Science and Technology Facilities Council","country":"United Kingdom","bio":"","order":"2","is_presenter":false},{"type":"Organizer","first_name":"Willem","last_name":"Deconinck","affiliation":"ECMWF","country":"United Kingdom","bio":"","order":"3","is_presenter":false}],"has_presenters":true,"presenters":[{"type":"Organizer","first_name":"Peter Dominik","last_name":"Dueben","affiliation":"ECMWF","country":"United Kingdom","bio":"","order":"1","is_presenter":true}]},{"id":"msa111","type":"child","title":"Deep Learning in Weather and Climate, Part 1: The Domain Perspective","begin_time":"13:00","end_time":"13:30","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"From the perspective of Earth System modelling, the use of machine learning, an in particular deep learning, is still in its infancy. There are many possible ways how deep learning could improve model quality or generate significant speed-ups for simulations. However, it has yet to be shown that deep learning can hold what it is promising for this application and its specific needs. This talk will provide an overview how deep learning may impact Earth System modelling in the future. We will provide examples how these methods have been used until today and discuss both limitations and prospects for their application. We will present results when using deep learning to improve model simulations for a toy model of atmospheric dynamics (the Lorenz\u002795 model). We will also show preliminary results that use deep neural networks that are trained from global atmospheric data to represent atmospheric dynamics and networks that are designed to speed-up parts of a weather forecast model at full complexity.","bio":"","contributors":[{"type":"Author","first_name":"Peter Dominik","last_name":"Dueben","affiliation":"ECMWF","country":"United Kingdom","bio":"","order":"1","is_presenter":true}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Peter Dominik","last_name":"Dueben","affiliation":"ECMWF","country":"United Kingdom","bio":"","order":"1","is_presenter":true}]},{"id":"msa149","type":"child","title":"Deep Learning in Weather and Climate, Part 2: The Computing Perspective","begin_time":"13:30","end_time":"14:00","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"In this presentation we will discuss ways in which deep neural networks can be integrated with traditional climate and weather\u00a0simulations. In particular, we will be focusing on\u00a0the design, implementation, and training of a deep convolutional neural network and its integration with the IFS Forecast Model inside RAPS as a new stand-alone\u00a0radiation scheme. This work is a case study for how\u00a0AI and large scale simulation may be applied on a cooperative basis and let the strengths of each converge to form a new tool for science.","filename":"msa149s1.pdf","bio":"","contributors":[{"type":"Author","first_name":"Christoph","last_name":"Angerer","affiliation":"NVIDIA Inc.","country":"Germany","bio":"","order":"1","is_presenter":false},{"type":"Author","first_name":"Jakob","last_name":"Progsch","affiliation":"NVIDIA Inc.","country":"Germany","bio":"","order":"2","is_presenter":true}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Jakob","last_name":"Progsch","affiliation":"NVIDIA Inc.","country":"Germany","bio":"","order":"2","is_presenter":true}]},{"id":"msa253","type":"child","title":"Integrating Machine Learning Algorithms and HPDA Frameworks to Run Predictive Analytics on Large-Scale Climate and Weather Datasets","begin_time":"14:00","end_time":"14:30","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"This work relates to the integration of a recurrent neural network algorithm (Long Short-Term Memory - LSTM) into Ophidia, a datacube-oriented High Performance Data Analytics framework. More specifically, Ophidia provides a (big) datacube abstraction to the end users, while it physically builds its core set of functionalities (namely \u0027\u003Cem\u003Eoperators\u003C\/em\u003E\u0027) on top of an array-database system. Operators in Ophidia run in parallel over a cluster to tackle big data challenges on massive scientific datasets. At the array-database level, Ophidia allows end-user developing her own analytics functions (namely \u0027\u003Cem\u003Eprimitives\u003C\/em\u003E\u0027), which by definition represent a sequential array-based data transformation. By implementing the LSTM algorithm as a \u003Cem\u003Eprimitive\u003C\/em\u003E running over a long time series, machine learning capabilities can be integrated into Ophidia taking advantage of a HPDA approach applied over large-scale datasets. A couple of case studies have been considered: the former relates to the output of a WRF model running over the Brazilian region of Curitiba, whereas the latter includes both simulated data, through an unstructured grid forecasting model running at CMCC by the Ocean Predictions and Applications Division, and observations over the Apulia region in the South-East of Italy. Preliminary insights about the proposed approach seems promising and will be presented.","filename":"msa253s1.pdf","bio":"","contributors":[{"type":"Author","first_name":"Alessandro","last_name":"D\u0027Anca","affiliation":"CMCC","country":"Italy","bio":"","order":"1","is_presenter":true},{"type":"Author","first_name":"Giovanni","last_name":"Aloisio","affiliation":"University of Salento","country":"Italy","bio":"","order":"2","is_presenter":false},{"type":"Author","first_name":"Sandro","last_name":"Fiore","affiliation":"CMCC Foundation","country":"Italy","bio":"","order":"3","is_presenter":false}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Alessandro","last_name":"D\u0027Anca","affiliation":"CMCC","country":"Italy","bio":"","order":"1","is_presenter":true}]},{"id":"msa112","type":"child","title":"Using Self-Organising Maps to Understand Relationships between Clouds and Cloud Controlling Factors","begin_time":"14:30","end_time":"15:00","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"The long term global warming predicted for a doubling of carbon dioxide is known as the \u0027climate sensitivity\u0027. Many future regional impacts of climate change become more serious for larger climate sensitivities. Current estimates of climate sensitivity from different climate models around the world vary by more than a factor of two, from approximately 2 to 5K. The reason for such a large range of estimates is due to the uncertainty around how clouds will change as the climate warms. Changes in clouds are hard to predict because they depend non-linearly on many interacting environmental factors. In this work we are interested in establishing whether machine learning can provide new insights into a) the factors controlling cloud changes and b) how various climate models represent these relationships. We use the Self-Organising Map (SOM), an unsupervised learning technique well suited to analysing high-dimensional data, to explore relationships between cloud controlling factors and compare the results to standard linear correlation. We find that potentially interesting new relationships not shown by linear correlation are revealed by the SOM technique.","filename":"msa112s1.pdf","bio":"","contributors":[{"type":"Author","first_name":"Samantha V.","last_name":"Adams","affiliation":"Met Office","country":"United Kingdom","bio":"","order":"1","is_presenter":true},{"type":"Author","first_name":"Mark","last_name":"Webb","affiliation":"Met Office","country":"United Kingdom","bio":"","order":"2","is_presenter":false}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Samantha V.","last_name":"Adams","affiliation":"Met Office","country":"United Kingdom","bio":"","order":"1","is_presenter":true}]}]}, "slot": {"id":"msa253","type":"child","title":"Integrating Machine Learning Algorithms and HPDA Frameworks to Run Predictive Analytics on Large-Scale Climate and Weather Datasets","begin_time":"14:00","end_time":"14:30","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"This work relates to the integration of a recurrent neural network algorithm (Long Short-Term Memory - LSTM) into Ophidia, a datacube-oriented High Performance Data Analytics framework. More specifically, Ophidia provides a (big) datacube abstraction to the end users, while it physically builds its core set of functionalities (namely \u0027\u003Cem\u003Eoperators\u003C\/em\u003E\u0027) on top of an array-database system. Operators in Ophidia run in parallel over a cluster to tackle big data challenges on massive scientific datasets. At the array-database level, Ophidia allows end-user developing her own analytics functions (namely \u0027\u003Cem\u003Eprimitives\u003C\/em\u003E\u0027), which by definition represent a sequential array-based data transformation. By implementing the LSTM algorithm as a \u003Cem\u003Eprimitive\u003C\/em\u003E running over a long time series, machine learning capabilities can be integrated into Ophidia taking advantage of a HPDA approach applied over large-scale datasets. A couple of case studies have been considered: the former relates to the output of a WRF model running over the Brazilian region of Curitiba, whereas the latter includes both simulated data, through an unstructured grid forecasting model running at CMCC by the Ocean Predictions and Applications Division, and observations over the Apulia region in the South-East of Italy. Preliminary insights about the proposed approach seems promising and will be presented.","filename":"msa253s1.pdf","bio":"","contributors":[{"type":"Author","first_name":"Alessandro","last_name":"D\u0027Anca","affiliation":"CMCC","country":"Italy","bio":"","order":"1","is_presenter":true},{"type":"Author","first_name":"Giovanni","last_name":"Aloisio","affiliation":"University of Salento","country":"Italy","bio":"","order":"2","is_presenter":false},{"type":"Author","first_name":"Sandro","last_name":"Fiore","affiliation":"CMCC Foundation","country":"Italy","bio":"","order":"3","is_presenter":false}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Alessandro","last_name":"D\u0027Anca","affiliation":"CMCC","country":"Italy","bio":"","order":"1","is_presenter":true}]}, "slotContributors": [{"type":"Author","first_name":"Alessandro","last_name":"D\u0027Anca","affiliation":"CMCC","country":"Italy","bio":"","order":"1","is_presenter":true},{"type":"Author","first_name":"Giovanni","last_name":"Aloisio","affiliation":"University of Salento","country":"Italy","bio":"","order":"2","is_presenter":false},{"type":"Author","first_name":"Sandro","last_name":"Fiore","affiliation":"CMCC Foundation","country":"Italy","bio":"","order":"3","is_presenter":false}] } Presentation
14:30 - 15:00
Using Self-Organising Maps to Understand Relationships between Clouds and Cloud Controlling Factors
, Samantha V. Adams (Met Office, United Kingdom)
+ Abstract { "session": {"id":"sess185","title":"MS07 - Machine Learning in Weather and Climate","date":"Monday, July 2nd 2018","begin_time":"13:00","end_time":"15:00","room":"Rio Room","contributors":[{"type":"Session Chair","first_name":"Peter Dominik","last_name":"Dueben","affiliation":"University of Oxford","country":""}],"view_type":"VIII","view_type_id":"evtt110","tracks":["Climate and Weather"],"slots":[{"id":"symp155","type":"minisymposia","title":"MS07 - Machine Learning in Weather and Climate","has_begin_end_time":true,"has_just_one_minute":true,"is_parent":true,"abstract":"The increasingly large amounts of data being produced by weather and climate simulations and earth system observations is sometimes characterised as a deluge. This deluge of data is both a challenge and an opportunity. The main opportunities are to make use of this wealth of data to 1) improve knowledge by extracting additional knowledge from the data and 2) to improve the quality of the models themselves by analysing the accuracy, or lack thereof, of the resultant simulation data. An example of the former case is improved prediction of large scale phenomena such as El Nino. An example of the latter is the improvement of a Physics parameterisation scheme through detailed analysis of the errors in a large number of datasets.\u003Cbr \/\u003E\u003Cbr \/\u003EOne way to realise these opportunities is to use machine learning approaches. As machine learning in weather and climate is a relatively new topic this minisymposium introduces the audience to how machine learning could be used in weather and climate and outlines its implications in terms of computing costs. To ground the ideas in concrete examples it also illustrates the use of machine learning in the weather and climate domain with practical examples.","bio":"","contributors":[{"type":"Organizer","first_name":"Peter Dominik","last_name":"Dueben","affiliation":"ECMWF","country":"United Kingdom","bio":"","order":"1","is_presenter":true},{"type":"Organizer","first_name":"Rupert","last_name":"Ford","affiliation":"Science and Technology Facilities Council","country":"United Kingdom","bio":"","order":"2","is_presenter":false},{"type":"Organizer","first_name":"Willem","last_name":"Deconinck","affiliation":"ECMWF","country":"United Kingdom","bio":"","order":"3","is_presenter":false}],"has_presenters":true,"presenters":[{"type":"Organizer","first_name":"Peter Dominik","last_name":"Dueben","affiliation":"ECMWF","country":"United Kingdom","bio":"","order":"1","is_presenter":true}]},{"id":"msa111","type":"child","title":"Deep Learning in Weather and Climate, Part 1: The Domain Perspective","begin_time":"13:00","end_time":"13:30","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"From the perspective of Earth System modelling, the use of machine learning, an in particular deep learning, is still in its infancy. There are many possible ways how deep learning could improve model quality or generate significant speed-ups for simulations. However, it has yet to be shown that deep learning can hold what it is promising for this application and its specific needs. This talk will provide an overview how deep learning may impact Earth System modelling in the future. We will provide examples how these methods have been used until today and discuss both limitations and prospects for their application. We will present results when using deep learning to improve model simulations for a toy model of atmospheric dynamics (the Lorenz\u002795 model). We will also show preliminary results that use deep neural networks that are trained from global atmospheric data to represent atmospheric dynamics and networks that are designed to speed-up parts of a weather forecast model at full complexity.","bio":"","contributors":[{"type":"Author","first_name":"Peter Dominik","last_name":"Dueben","affiliation":"ECMWF","country":"United Kingdom","bio":"","order":"1","is_presenter":true}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Peter Dominik","last_name":"Dueben","affiliation":"ECMWF","country":"United Kingdom","bio":"","order":"1","is_presenter":true}]},{"id":"msa149","type":"child","title":"Deep Learning in Weather and Climate, Part 2: The Computing Perspective","begin_time":"13:30","end_time":"14:00","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"In this presentation we will discuss ways in which deep neural networks can be integrated with traditional climate and weather\u00a0simulations. In particular, we will be focusing on\u00a0the design, implementation, and training of a deep convolutional neural network and its integration with the IFS Forecast Model inside RAPS as a new stand-alone\u00a0radiation scheme. This work is a case study for how\u00a0AI and large scale simulation may be applied on a cooperative basis and let the strengths of each converge to form a new tool for science.","filename":"msa149s1.pdf","bio":"","contributors":[{"type":"Author","first_name":"Christoph","last_name":"Angerer","affiliation":"NVIDIA Inc.","country":"Germany","bio":"","order":"1","is_presenter":false},{"type":"Author","first_name":"Jakob","last_name":"Progsch","affiliation":"NVIDIA Inc.","country":"Germany","bio":"","order":"2","is_presenter":true}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Jakob","last_name":"Progsch","affiliation":"NVIDIA Inc.","country":"Germany","bio":"","order":"2","is_presenter":true}]},{"id":"msa253","type":"child","title":"Integrating Machine Learning Algorithms and HPDA Frameworks to Run Predictive Analytics on Large-Scale Climate and Weather Datasets","begin_time":"14:00","end_time":"14:30","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"This work relates to the integration of a recurrent neural network algorithm (Long Short-Term Memory - LSTM) into Ophidia, a datacube-oriented High Performance Data Analytics framework. More specifically, Ophidia provides a (big) datacube abstraction to the end users, while it physically builds its core set of functionalities (namely \u0027\u003Cem\u003Eoperators\u003C\/em\u003E\u0027) on top of an array-database system. Operators in Ophidia run in parallel over a cluster to tackle big data challenges on massive scientific datasets. At the array-database level, Ophidia allows end-user developing her own analytics functions (namely \u0027\u003Cem\u003Eprimitives\u003C\/em\u003E\u0027), which by definition represent a sequential array-based data transformation. By implementing the LSTM algorithm as a \u003Cem\u003Eprimitive\u003C\/em\u003E running over a long time series, machine learning capabilities can be integrated into Ophidia taking advantage of a HPDA approach applied over large-scale datasets. A couple of case studies have been considered: the former relates to the output of a WRF model running over the Brazilian region of Curitiba, whereas the latter includes both simulated data, through an unstructured grid forecasting model running at CMCC by the Ocean Predictions and Applications Division, and observations over the Apulia region in the South-East of Italy. Preliminary insights about the proposed approach seems promising and will be presented.","filename":"msa253s1.pdf","bio":"","contributors":[{"type":"Author","first_name":"Alessandro","last_name":"D\u0027Anca","affiliation":"CMCC","country":"Italy","bio":"","order":"1","is_presenter":true},{"type":"Author","first_name":"Giovanni","last_name":"Aloisio","affiliation":"University of Salento","country":"Italy","bio":"","order":"2","is_presenter":false},{"type":"Author","first_name":"Sandro","last_name":"Fiore","affiliation":"CMCC Foundation","country":"Italy","bio":"","order":"3","is_presenter":false}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Alessandro","last_name":"D\u0027Anca","affiliation":"CMCC","country":"Italy","bio":"","order":"1","is_presenter":true}]},{"id":"msa112","type":"child","title":"Using Self-Organising Maps to Understand Relationships between Clouds and Cloud Controlling Factors","begin_time":"14:30","end_time":"15:00","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"The long term global warming predicted for a doubling of carbon dioxide is known as the \u0027climate sensitivity\u0027. Many future regional impacts of climate change become more serious for larger climate sensitivities. Current estimates of climate sensitivity from different climate models around the world vary by more than a factor of two, from approximately 2 to 5K. The reason for such a large range of estimates is due to the uncertainty around how clouds will change as the climate warms. Changes in clouds are hard to predict because they depend non-linearly on many interacting environmental factors. In this work we are interested in establishing whether machine learning can provide new insights into a) the factors controlling cloud changes and b) how various climate models represent these relationships. We use the Self-Organising Map (SOM), an unsupervised learning technique well suited to analysing high-dimensional data, to explore relationships between cloud controlling factors and compare the results to standard linear correlation. We find that potentially interesting new relationships not shown by linear correlation are revealed by the SOM technique.","filename":"msa112s1.pdf","bio":"","contributors":[{"type":"Author","first_name":"Samantha V.","last_name":"Adams","affiliation":"Met Office","country":"United Kingdom","bio":"","order":"1","is_presenter":true},{"type":"Author","first_name":"Mark","last_name":"Webb","affiliation":"Met Office","country":"United Kingdom","bio":"","order":"2","is_presenter":false}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Samantha V.","last_name":"Adams","affiliation":"Met Office","country":"United Kingdom","bio":"","order":"1","is_presenter":true}]}]}, "slot": {"id":"msa112","type":"child","title":"Using Self-Organising Maps to Understand Relationships between Clouds and Cloud Controlling Factors","begin_time":"14:30","end_time":"15:00","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"The long term global warming predicted for a doubling of carbon dioxide is known as the \u0027climate sensitivity\u0027. Many future regional impacts of climate change become more serious for larger climate sensitivities. Current estimates of climate sensitivity from different climate models around the world vary by more than a factor of two, from approximately 2 to 5K. The reason for such a large range of estimates is due to the uncertainty around how clouds will change as the climate warms. Changes in clouds are hard to predict because they depend non-linearly on many interacting environmental factors. In this work we are interested in establishing whether machine learning can provide new insights into a) the factors controlling cloud changes and b) how various climate models represent these relationships. We use the Self-Organising Map (SOM), an unsupervised learning technique well suited to analysing high-dimensional data, to explore relationships between cloud controlling factors and compare the results to standard linear correlation. We find that potentially interesting new relationships not shown by linear correlation are revealed by the SOM technique.","filename":"msa112s1.pdf","bio":"","contributors":[{"type":"Author","first_name":"Samantha V.","last_name":"Adams","affiliation":"Met Office","country":"United Kingdom","bio":"","order":"1","is_presenter":true},{"type":"Author","first_name":"Mark","last_name":"Webb","affiliation":"Met Office","country":"United Kingdom","bio":"","order":"2","is_presenter":false}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Samantha V.","last_name":"Adams","affiliation":"Met Office","country":"United Kingdom","bio":"","order":"1","is_presenter":true}]}, "slotContributors": [{"type":"Author","first_name":"Samantha V.","last_name":"Adams","affiliation":"Met Office","country":"United Kingdom","bio":"","order":"1","is_presenter":true},{"type":"Author","first_name":"Mark","last_name":"Webb","affiliation":"Met Office","country":"United Kingdom","bio":"","order":"2","is_presenter":false}] } Presentation
Organizer(s):
Ramesh Balakrishnan (Argonne National Laboratory, United States of America)
, Philipp Schlatter (KTH Royal Institute of Technology, Sweden)
Track(s):
Engineering, Computer Science and Applied Mathematics
Wall resolved LES is prohibitively expensive, and a first principles based wall modeled LES, that is free of tunable parameters, is far from becoming an alternative to RANS as a predictive tool for design. On the spatial discretization front, most higher-order and spectral methods have had many of their successes in the realm of low to moderate Reynolds number flows, and on relatively less complex geometries. The bulk of the complex flow high Re simulations still employ nominally second-order accurate schemes in unstructured mesh finite volume flow solvers, and finite-element based solvers with modest polynomial order. Even in these fairly mature solvers, evidence suggests that merely running larger cases with an increased number of grid points and on larger computational domains does not always guarantee a better solution (in the LES sense). There is, therefore, a need for a two-level effort whereby benchmark higher-order simulations can serve to inform sub-grid models to improve the predictive capability of existing mature flow solvers. We hope to motivate a discussion along these lines with examples of results of higher-order simulations, as well as their role in assessing and improving the predictive capabilities of sub-grid models with more conventional flow solvers.
Organizer(s):
Andreas Vitalis (University of Zurich, Switzerland)
, Marco Bacci (University of Zurich, Switzerland)
, Amedeo Caflisch (University of Zurich, Switzerland)
Track(s):
Life Sciences, Emerging Application Domains, Computer Science and Applied Mathematics
A common problem in numerical optimization and sampling is the detection of relevant states. These could be, for instance, the local minima on a rugged parameter surface or the transition state of a chemical reaction. For most cases, an exhaustive search for the optimal solution is intractable. Here, we focus on parallel sampling and optimization strategies relying on multiple replicas, most prominently, adaptive methods where all simulated replicas use the same propagator and sample the same underlying surface. In these methods, replica intercommunication is used to provide a global assessment as to which replicas are most interesting. This implies, in general, periodic data mining steps across replicas. Furthermore, in order to extract and utilize the gained information in post-processing, data must often be stored, which poses stringent data management and analysis challenges in particular for high-dimensional cases. The minisymposium wishes to discuss the following questions: What are meaningful and easily generalizable tools, strategies, and algorithms to guide the sampling/exploration? How can we maintain scalability and load balance? What types of post-processing algorithms can be applied to the generated data, and are those scalable to provide on-the-fly solutions to direct the exploration?
16:30 - 17:00
On the Interpretation of Non-Equilibrium MD Trajectories
, Tanja Schilling (University of Freiburg, Germany)
+ Abstract { "session": {"id":"sess205","title":"MS09 - Adaptive Parallel Strategies for the Exploration of Challenging Search Spaces with Applications in Particle Simulations and Optimization, Part II","date":"Monday, July 2nd 2018","begin_time":"15:30","end_time":"17:30","room":"Samarkand Room","contributors":[{"type":"Session Chair","first_name":"Andreas","last_name":"Vitalis","affiliation":"University of Zurich","country":""}],"view_type":"VIII","view_type_id":"evtt110","tracks":["Life Sciences","Emerging Application Domains","Computer Science and Applied Mathematics"],"slots":[{"id":"symp160","type":"minisymposia","title":"MS09 - Adaptive Parallel Strategies for the Exploration of Challenging Search Spaces with Applications in Particle Simulations and Optimization, Part II","has_begin_end_time":true,"has_just_one_minute":true,"is_parent":true,"abstract":"A common problem in numerical optimization and sampling is the detection of relevant states. These could be, for instance, the local minima on a rugged parameter surface or the transition state of a chemical reaction. For most cases, an exhaustive search for the optimal solution is intractable. Here, we focus on parallel sampling and optimization strategies relying on multiple replicas, most prominently, adaptive methods where all simulated replicas use the same propagator and sample the same underlying surface. In these methods, replica intercommunication is used to provide a global assessment as to which replicas are most interesting. This implies, in general, periodic data mining steps across replicas. Furthermore, in order to extract and utilize the gained information in post-processing, data must often be stored, which poses stringent data management and analysis challenges in particular for high-dimensional cases. The minisymposium wishes to discuss the following questions: What are meaningful and easily generalizable tools, strategies, and algorithms to guide the sampling\/exploration? How can we maintain scalability and load balance? What types of post-processing algorithms can be applied to the generated data, and are those scalable to provide on-the-fly solutions to direct the exploration?","bio":"","contributors":[{"type":"Organizer","first_name":"Andreas","last_name":"Vitalis","affiliation":"University of Zurich","country":"Switzerland","bio":"","order":"1","is_presenter":true},{"type":"Organizer","first_name":"Marco","last_name":"Bacci","affiliation":"University of Zurich","country":"Switzerland","bio":"","order":"2","is_presenter":false},{"type":"Organizer","first_name":"Amedeo","last_name":"Caflisch","affiliation":"University of Zurich","country":"Switzerland","bio":"","order":"3","is_presenter":false}],"has_presenters":true,"presenters":[{"type":"Organizer","first_name":"Andreas","last_name":"Vitalis","affiliation":"University of Zurich","country":"Switzerland","bio":"","order":"1","is_presenter":true}]},{"id":"msa201","type":"child","title":"Task-Based Parallelization of Replica Exchange Transition Interface Sampling in OpenPathSampling","begin_time":"15:30","end_time":"16:00","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"Path sampling methods, such as transition path sampling and transition interface sampling, are powerful tools for studying rare events. They perform Monte Carlo simulations in the space of trajectories, focusing the simulation effort on the transition itself to avoid spending long waiting times in the stable states. Since they are Monte Carlo approaches, they can use multiple walkers, but some approaches also use replicas from different path ensembles. In particular, replica exchange transition interface sampling (RETIS) involves simultaneously sampling trajectories from several path ensembles. However, even within a single ensemble, the lengths of the sampled trajectories can vary and are unpredictable. This makes load balancing an extremely challenging problem. This presentation describes the parallelization of RETIS in the software package OpenPathSampling using dask.distributed, a Python package for task-based programming. The task-based approach enables parallelization that provides optimal use of computational resources, not only by load balancing, but also by allowing the allocated resources to be scaled up or down according to the needs of the simulation. While the approach is described here in the context of path sampling, the same technique could be applied to many trajectory-based simulation methods.","bio":"","contributors":[{"type":"Author","first_name":"David W. H.","last_name":"Swenson","affiliation":"University of Amsterdam","country":"Netherlands","bio":"","order":"1","is_presenter":true}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"David W. H.","last_name":"Swenson","affiliation":"University of Amsterdam","country":"Netherlands","bio":"","order":"1","is_presenter":true}]},{"id":"msa126","type":"child","title":"Replica-Exchange Enveloping Distribution Sampling (RE-EDS) to Calculate Multiple Free-Energy Differences in a Single Simulation","begin_time":"16:00","end_time":"16:30","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"Enveloping distribution sampling (EDS) allows the calculation of free-energy differences between multiple end states from a single simulation. A reference-state Hamiltonian is simulated which envelopes the Hamiltonians of the end states. The challenge when using EDS is the determination of optimal parameters for the reference-state Hamiltonian. Previously, the choice of parameters for an EDS simulation with multiple end states was a non-trivial problem that limited the application of the methodology. To overcome these limitations, we have generalized the replica-exchange EDS (RE-EDS) methodology to arbitrary systems. By exchanging configurations between replicas with different parameters for the reference-state Hamiltonian, major parts of the problem to choose optimal parameters are circumvented. Algorithms to estimate the energy offsets and optimize the replica distribution have been developed. Our approach was tested successfully using a system consisting of nine inhibitors of phenylethanolamine N-methyltransferase (PNMT), which were studied previously with thermodynamic integration and EDS.","bio":"","contributors":[{"type":"Author","first_name":"Sereina Z.","last_name":"Riniker","affiliation":"ETH Zurich","country":"Switzerland","bio":"","order":"1","is_presenter":true}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Sereina Z.","last_name":"Riniker","affiliation":"ETH Zurich","country":"Switzerland","bio":"","order":"1","is_presenter":true}]},{"id":"msa120","type":"child","title":"On the Interpretation of Non-Equilibrium MD Trajectories","begin_time":"16:30","end_time":"17:00","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"As a researcher in statistical physics, one may often be interested in reducing the complexity of a many-particle system to the study of a set of relevant observables. If the system is in equilibrium, a systematic way to derive an equation of motion for the \u0022relevant\u0022 observables from the microscopic dynamics has been known for some time as the \u0022Mori-Zwanzig\u0022 formalism, which leads to the Langevin equation. In contrast, if the dynamics is not stationary, it is not a priori clear which form the equation of motion for an averaged observable will have. We adapt Mori-Zwanzig formalism to derive the equation of motion for a non-equilibrium trajectory-averaged observable as well as for its non-stationary auto-correlation function. We also derive a fluctuation-dissipation-like relation which relates the memory kernel and the autocorrelation function of the fluctuating force. In addition, we show how to relate the Taylor expansion of the memory kernel to experimental data, thus allowing to construct the equation of motion from direct measurements.","filename":"msa120s1.pdf","bio":"","contributors":[{"type":"Author","first_name":"Tanja","last_name":"Schilling","affiliation":"University of Freiburg","country":"Germany","bio":"","order":"1","is_presenter":true},{"type":"Author","first_name":"Thomas","last_name":"Voigtmann","affiliation":"German Aerospace Center","country":"Germany","bio":"","order":"2","is_presenter":false},{"type":"Author","first_name":"Hugues","last_name":"Meyer","affiliation":"University of Luxembourg","country":"Luxembourg","bio":"","order":"3","is_presenter":false}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Tanja","last_name":"Schilling","affiliation":"University of Freiburg","country":"Germany","bio":"","order":"1","is_presenter":true}]},{"id":"msa252","type":"child","title":"Dynamic Histogram Analysis to Determine Free Energies and Rates from Biased Simulations","begin_time":"17:00","end_time":"17:30","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"Transitions between metastable states govern many fundamental processes in biology, such as biomolecular folding. The underlying free energy surfaces can be obtained from simulations using enhanced sampling methods. We present an algorithm to calculate free energies and rates from enhanced sampling simulations on biased potential energy surfaces. Inputs are the accumulated times spent in each state or bin of a histogram, and the transition counts between them. For each of the states\/bins optimal unbiased free energies are obtained by maximizing the likelihood of a master equation (i.e., first-order kinetic rate model). Unbiased rate coefficients for transitions between states can then be estimated. The resulting \u0022dynamic histogram analysis method extended to detailed balance\u0022 (DHAMed) improves on the DHAM method. DHAMed yields accurate free energies in cases where the common weighted-histogram analysis method (WHAM) for umbrella sampling fails because dynamics within the windows is slow. We illustrate DHAMed with applications to proteins and RNAs and accurately estimate free energies from sets of short trajectories, providing a way forward for computational drug design. Our rate formalism can be used to construct Markov state models from biased simulations and we demonstrate its practical applicability by determining RNA folding kinetics from replica exchange molecular dynamics.","bio":"","contributors":[{"type":"Author","first_name":"Lukas S.","last_name":"Stelzl","affiliation":"Max Planck Institute of Biophysics","country":"Germany","bio":"","order":"1","is_presenter":true},{"type":"Author","first_name":"Adam","last_name":"Kells","affiliation":"King\u0027s College London","country":"United Kingdom","bio":"","order":"2","is_presenter":false},{"type":"Author","first_name":"Edina","last_name":"Rosta","affiliation":"King\u0027s College London","country":"United Kingdom","bio":"","order":"3","is_presenter":false},{"type":"Author","first_name":"Gerhard","last_name":"Hummer","affiliation":"Max Planck Institute of Biophysics","country":"Germany","bio":"","order":"4","is_presenter":false}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Lukas S.","last_name":"Stelzl","affiliation":"Max Planck Institute of Biophysics","country":"Germany","bio":"","order":"1","is_presenter":true}]}]}, "slot": {"id":"msa120","type":"child","title":"On the Interpretation of Non-Equilibrium MD Trajectories","begin_time":"16:30","end_time":"17:00","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"As a researcher in statistical physics, one may often be interested in reducing the complexity of a many-particle system to the study of a set of relevant observables. If the system is in equilibrium, a systematic way to derive an equation of motion for the \u0022relevant\u0022 observables from the microscopic dynamics has been known for some time as the \u0022Mori-Zwanzig\u0022 formalism, which leads to the Langevin equation. In contrast, if the dynamics is not stationary, it is not a priori clear which form the equation of motion for an averaged observable will have. We adapt Mori-Zwanzig formalism to derive the equation of motion for a non-equilibrium trajectory-averaged observable as well as for its non-stationary auto-correlation function. We also derive a fluctuation-dissipation-like relation which relates the memory kernel and the autocorrelation function of the fluctuating force. In addition, we show how to relate the Taylor expansion of the memory kernel to experimental data, thus allowing to construct the equation of motion from direct measurements.","filename":"msa120s1.pdf","bio":"","contributors":[{"type":"Author","first_name":"Tanja","last_name":"Schilling","affiliation":"University of Freiburg","country":"Germany","bio":"","order":"1","is_presenter":true},{"type":"Author","first_name":"Thomas","last_name":"Voigtmann","affiliation":"German Aerospace Center","country":"Germany","bio":"","order":"2","is_presenter":false},{"type":"Author","first_name":"Hugues","last_name":"Meyer","affiliation":"University of Luxembourg","country":"Luxembourg","bio":"","order":"3","is_presenter":false}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Tanja","last_name":"Schilling","affiliation":"University of Freiburg","country":"Germany","bio":"","order":"1","is_presenter":true}]}, "slotContributors": [{"type":"Author","first_name":"Tanja","last_name":"Schilling","affiliation":"University of Freiburg","country":"Germany","bio":"","order":"1","is_presenter":true},{"type":"Author","first_name":"Thomas","last_name":"Voigtmann","affiliation":"German Aerospace Center","country":"Germany","bio":"","order":"2","is_presenter":false},{"type":"Author","first_name":"Hugues","last_name":"Meyer","affiliation":"University of Luxembourg","country":"Luxembourg","bio":"","order":"3","is_presenter":false}] } Presentation
Organizer(s):
Xavier Lapillonne (MeteoSwiss, Switzerland)
, Valentin Clement (Center for Climate System Modeling, Switzerland)
Track(s):
Climate and Weather
Numerical weather prediction and climate models are large and complex software applications that need to run efficiently on today's and future massively parallel computer systems. The rapid change in these computing architectures and the increase in diversity are seriously affecting the ability to retain a single source code that runs efficiently in different architectures. Several weather models have successfully adapted their codebases to many-core and heterogeneous architectures like GPUs and Xeon Phi using a combination of multiple traditional programming models for parallel architectures like OpenMP, OpenACC and MPI. However porting existing large community codes to multiple architectures is a daunting task and leads to codes that are more complex and difficult to maintain. As a result in the past years numerous new technologies and approaches are emerging in order to provide new programming models, like domain-specific languages (DSLs) or source-to-source translation tools that can increase the productivity of development in weather codes while providing a high degree of performance portability. In this minisymposium we propose a discussion with some of the most prominent novel approaches where the new advances in programming models used for heterogeneous architectures in weather and climate models will be presented.
16:00 - 16:30
Performance Portability for Next Generation HPC Architectures in E3SM via the Kokkos Programming Model
, Luca Bertagna (Sandia National Laboratories, United States of America)
+ Abstract { "session": {"id":"sess158","title":"MS10 - Bridging the Software Productivity Gap for Climate and Weather Models","date":"Monday, July 2nd 2018","begin_time":"15:30","end_time":"17:30","room":"Rio Room","contributors":[{"type":"Session Chair","first_name":"Xavier","last_name":"Lapillonne","affiliation":"MeteoSwiss","country":"Switzerland"}],"view_type":"VIII","view_type_id":"evtt110","tracks":["Climate and Weather"],"slots":[{"id":"symp119","type":"minisymposia","title":"MS10 - Bridging the Software Productivity Gap for Climate and Weather Models","has_begin_end_time":true,"has_just_one_minute":true,"is_parent":true,"abstract":"Numerical weather prediction and climate models are large and complex software applications that need to run efficiently on today\u0027s and future massively parallel computer systems. The rapid change in these computing architectures and the increase in diversity are seriously affecting the ability to retain a single source code that runs efficiently in different architectures. Several weather models have successfully adapted their codebases to many-core and heterogeneous architectures like GPUs and Xeon Phi using a combination of multiple traditional programming models for parallel architectures like OpenMP, OpenACC and MPI. However porting existing large community codes to multiple architectures is a daunting task and leads to codes that are more complex and difficult to maintain. As a result in the past years numerous new technologies and approaches are emerging in order to provide new programming models, like domain-specific languages (DSLs) or source-to-source translation tools that can increase the productivity of development in weather codes while providing a high degree of performance portability. In this minisymposium we propose a discussion with some of the most prominent novel approaches where the new advances in programming models used for heterogeneous architectures in weather and climate models will be presented.","bio":"","contributors":[{"type":"Organizer","first_name":"Xavier","last_name":"Lapillonne","affiliation":"MeteoSwiss","country":"Switzerland","bio":"","order":"1","is_presenter":true},{"type":"Organizer","first_name":"Valentin","last_name":"Clement","affiliation":"Center for Climate System Modeling","country":"Switzerland","bio":"","order":"2","is_presenter":false}],"has_presenters":true,"presenters":[{"type":"Organizer","first_name":"Xavier","last_name":"Lapillonne","affiliation":"MeteoSwiss","country":"Switzerland","bio":"","order":"1","is_presenter":true}]},{"id":"msa230","type":"child","title":"Experience on Porting Atmosphere Kernels on Many-Core Processors and Accelerators","begin_time":"15:30","end_time":"16:00","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"This talk includes a summary of our previous work that ports different atmosphere kernels onto various state-of-the-art platforms, including the Sunway TaihuLight system. Performance portability for atmosphere codes is no doubt a big challenge, so great efforts have to be made and patience is required as well. In addition to some experiences and lessons, we also take this opportunity to discuss on the novel Sunway processors. For Sunway system, different software is being developed to make it easy for applications to be ported.","bio":"","contributors":[{"type":"Author","first_name":"Lin","last_name":"Gan","affiliation":"Tsinghua University","country":"China","bio":"","order":"1","is_presenter":true}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Lin","last_name":"Gan","affiliation":"Tsinghua University","country":"China","bio":"","order":"1","is_presenter":true}]},{"id":"msa181","type":"child","title":"Performance Portability for Next Generation HPC Architectures in E3SM via the Kokkos Programming Model","begin_time":"16:00","end_time":"16:30","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"This work converts the atmospheric dynamical core (HOMME) of the Energy Exascale Earth System Model (E3SM) from the current CPU-centric implementation, in Fortran 90, to a new performance-portable implementation, in C++ with the Kokkos performance-portability framework. HOMME simulates the dynamics and physical processes of the atmosphere. It is the most computationally demanding part of E3SM. Kokkos provides performance-portable multidimensional arrays and intraprocess parallel execution constructs. These form an abstraction layer over the hardware architecture of a compute node within a supercomputer. We will present results for the performance of our implementation on conventional CPU, Intel Xeon Phi, and Nvidia GPU; compare performance with the original Fortran on CPU and Xeon Phi; and discuss details of the implementation.","filename":"msa181s1.pdf","bio":"","contributors":[{"type":"Author","first_name":"Luca","last_name":"Bertagna","affiliation":"Sandia National Laboratories","country":"United States of America","bio":"","order":"1","is_presenter":true},{"type":"Author","first_name":"Andrew","last_name":"Salinger","affiliation":"Sandia National Laboratories","country":"United States of America","bio":"","order":"2","is_presenter":false},{"type":"Author","first_name":"Irina","last_name":"Tezaur","affiliation":"Sandia National Laboratories","country":"United States of America","bio":"","order":"3","is_presenter":false},{"type":"Author","first_name":"Andrew","last_name":"Bradley","affiliation":"Sandia National Laboratories","country":"United States of America","bio":"","order":"4","is_presenter":false},{"type":"Author","first_name":"Michael","last_name":"Deakin","affiliation":"Sandia National Laboratories","country":"United States of America","bio":"","order":"5","is_presenter":false},{"type":"Author","first_name":"Daniel","last_name":"Sunderland","affiliation":"Sandia National Laboratories","country":"United States of America","bio":"","order":"6","is_presenter":false},{"type":"Author","first_name":"Oksana","last_name":"Guba","affiliation":"Sandia National Laboratories","country":"United States of America","bio":"","order":"7","is_presenter":false}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Luca","last_name":"Bertagna","affiliation":"Sandia National Laboratories","country":"United States of America","bio":"","order":"1","is_presenter":true}]},{"id":"msa138","type":"child","title":"Experience Applying the PSyclone Configurable Domain Specific Compiler to the Met Office LFRic Model","begin_time":"16:30","end_time":"17:00","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"Earth-system models tend to be large, complex codes developed by large teams of scientists over periods of years. However, the scale of the problems to be simulated calls for the highest levels of computational performance. Achieving good performance when both computer architectures and the underlying code base are constantly evolving is a complex challenge. In recent years, the use of Domain-Specific Languages (DSLs) as a potential solution to this problem has begun to be investigated. The UK Met Office\u0027s LFRic project is developing a new, Finite Element dynamical core and has adopted a DSL approach. In this talk we will describe this work and the functionality of the domain-specific compiler, PSyclone, which has been developed to process the (serial) code written by the natural scientists and generate the code required to run on massively parallel machines.","bio":"","contributors":[{"type":"Author","first_name":"Rupert","last_name":"Ford","affiliation":"Science and Technology Facilities Council","country":"United Kingdom","bio":"","order":"1","is_presenter":true},{"type":"Author","first_name":"Andrew R.","last_name":"Porter","affiliation":"Science and Technology Facilities Council","country":"United Kingdom","bio":"","order":"2","is_presenter":false},{"type":"Author","first_name":"Sergi","last_name":"Siso","affiliation":"Science and Technology Facilities Council","country":"United Kingdom","bio":"","order":"3","is_presenter":false}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Rupert","last_name":"Ford","affiliation":"Science and Technology Facilities Council","country":"United Kingdom","bio":"","order":"1","is_presenter":true}]},{"id":"msa254","type":"child","title":"Novel Programming Models for Large Geophysical Fluid Dynamics Models","begin_time":"17:00","end_time":"17:30","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"Running operationally high resolution (~1km) global weather and climate models will be a milestone for the scientific community since there is clear evidence of the importance of high horizontal resolutions in the quality and accuracy of the simulations. Yet achieving this will pose serious computational challenges for large scientific codes that are developed using traditional programming models such as OpenMP and MPI. In order to adapt models to run efficiently on modern computing architectures and accelerators, numerous domain specific languages (DSL) and libraries that abstract architecture dependent optimizations have been proposed, like the GridTools libraries used operationally for running COSMO on GPUs. Yet these tools are specific to a domain or model, and have little reuse among them of architecture specific optimizers which leads to high maintenance costs. We present a novel programming model based on the GridTools ecosystem of libraries, a toolchain that allows to develop and interoperate various DSL frontends by providing domain and architecture specific optimizers. It aims at standardizing tools for performance portability by proposing a standard intermediate representation for weather and climate codes. We demonstrate the toolchain for the COSMO regional model and evaluate performance results compared to the operational model running on NVIDIA GPUs.","filename":"msa254s1.pdf","bio":"","contributors":[{"type":"Author","first_name":"Carlos E.","last_name":"Osuna","affiliation":"MeteoSwiss","country":"Switzerland","bio":"","order":"1","is_presenter":true},{"type":"Author","first_name":"Tobias","last_name":"Wicky","affiliation":"ETH Zurich","country":"Switzerland","bio":"","order":"2","is_presenter":false},{"type":"Author","first_name":"Stefan","last_name":"Moosbrugger","affiliation":"MeteoSwiss","country":"Switzerland","bio":"","order":"3","is_presenter":false},{"type":"Author","first_name":"Oliver","last_name":"Fuhrer","affiliation":"MeteoSwiss","country":"Switzerland","bio":"","order":"4","is_presenter":false},{"type":"Author","first_name":"Torsten","last_name":"Hoefler","affiliation":"ETH Zurich","country":"Switzerland","bio":"","order":"5","is_presenter":false}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Carlos E.","last_name":"Osuna","affiliation":"MeteoSwiss","country":"Switzerland","bio":"","order":"1","is_presenter":true}]}]}, "slot": {"id":"msa181","type":"child","title":"Performance Portability for Next Generation HPC Architectures in E3SM via the Kokkos Programming Model","begin_time":"16:00","end_time":"16:30","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"This work converts the atmospheric dynamical core (HOMME) of the Energy Exascale Earth System Model (E3SM) from the current CPU-centric implementation, in Fortran 90, to a new performance-portable implementation, in C++ with the Kokkos performance-portability framework. HOMME simulates the dynamics and physical processes of the atmosphere. It is the most computationally demanding part of E3SM. Kokkos provides performance-portable multidimensional arrays and intraprocess parallel execution constructs. These form an abstraction layer over the hardware architecture of a compute node within a supercomputer. We will present results for the performance of our implementation on conventional CPU, Intel Xeon Phi, and Nvidia GPU; compare performance with the original Fortran on CPU and Xeon Phi; and discuss details of the implementation.","filename":"msa181s1.pdf","bio":"","contributors":[{"type":"Author","first_name":"Luca","last_name":"Bertagna","affiliation":"Sandia National Laboratories","country":"United States of America","bio":"","order":"1","is_presenter":true},{"type":"Author","first_name":"Andrew","last_name":"Salinger","affiliation":"Sandia National Laboratories","country":"United States of America","bio":"","order":"2","is_presenter":false},{"type":"Author","first_name":"Irina","last_name":"Tezaur","affiliation":"Sandia National Laboratories","country":"United States of America","bio":"","order":"3","is_presenter":false},{"type":"Author","first_name":"Andrew","last_name":"Bradley","affiliation":"Sandia National Laboratories","country":"United States of America","bio":"","order":"4","is_presenter":false},{"type":"Author","first_name":"Michael","last_name":"Deakin","affiliation":"Sandia National Laboratories","country":"United States of America","bio":"","order":"5","is_presenter":false},{"type":"Author","first_name":"Daniel","last_name":"Sunderland","affiliation":"Sandia National Laboratories","country":"United States of America","bio":"","order":"6","is_presenter":false},{"type":"Author","first_name":"Oksana","last_name":"Guba","affiliation":"Sandia National Laboratories","country":"United States of America","bio":"","order":"7","is_presenter":false}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Luca","last_name":"Bertagna","affiliation":"Sandia National Laboratories","country":"United States of America","bio":"","order":"1","is_presenter":true}]}, "slotContributors": [{"type":"Author","first_name":"Luca","last_name":"Bertagna","affiliation":"Sandia National Laboratories","country":"United States of America","bio":"","order":"1","is_presenter":true},{"type":"Author","first_name":"Andrew","last_name":"Salinger","affiliation":"Sandia National Laboratories","country":"United States of America","bio":"","order":"2","is_presenter":false},{"type":"Author","first_name":"Irina","last_name":"Tezaur","affiliation":"Sandia National Laboratories","country":"United States of America","bio":"","order":"3","is_presenter":false},{"type":"Author","first_name":"Andrew","last_name":"Bradley","affiliation":"Sandia National Laboratories","country":"United States of America","bio":"","order":"4","is_presenter":false},{"type":"Author","first_name":"Michael","last_name":"Deakin","affiliation":"Sandia National Laboratories","country":"United States of America","bio":"","order":"5","is_presenter":false},{"type":"Author","first_name":"Daniel","last_name":"Sunderland","affiliation":"Sandia National Laboratories","country":"United States of America","bio":"","order":"6","is_presenter":false},{"type":"Author","first_name":"Oksana","last_name":"Guba","affiliation":"Sandia National Laboratories","country":"United States of America","bio":"","order":"7","is_presenter":false}] } Presentation
17:00 - 17:30
Novel Programming Models for Large Geophysical Fluid Dynamics Models
, Carlos E. Osuna (MeteoSwiss, Switzerland)
+ Abstract { "session": {"id":"sess158","title":"MS10 - Bridging the Software Productivity Gap for Climate and Weather Models","date":"Monday, July 2nd 2018","begin_time":"15:30","end_time":"17:30","room":"Rio Room","contributors":[{"type":"Session Chair","first_name":"Xavier","last_name":"Lapillonne","affiliation":"MeteoSwiss","country":"Switzerland"}],"view_type":"VIII","view_type_id":"evtt110","tracks":["Climate and Weather"],"slots":[{"id":"symp119","type":"minisymposia","title":"MS10 - Bridging the Software Productivity Gap for Climate and Weather Models","has_begin_end_time":true,"has_just_one_minute":true,"is_parent":true,"abstract":"Numerical weather prediction and climate models are large and complex software applications that need to run efficiently on today\u0027s and future massively parallel computer systems. The rapid change in these computing architectures and the increase in diversity are seriously affecting the ability to retain a single source code that runs efficiently in different architectures. Several weather models have successfully adapted their codebases to many-core and heterogeneous architectures like GPUs and Xeon Phi using a combination of multiple traditional programming models for parallel architectures like OpenMP, OpenACC and MPI. However porting existing large community codes to multiple architectures is a daunting task and leads to codes that are more complex and difficult to maintain. As a result in the past years numerous new technologies and approaches are emerging in order to provide new programming models, like domain-specific languages (DSLs) or source-to-source translation tools that can increase the productivity of development in weather codes while providing a high degree of performance portability. In this minisymposium we propose a discussion with some of the most prominent novel approaches where the new advances in programming models used for heterogeneous architectures in weather and climate models will be presented.","bio":"","contributors":[{"type":"Organizer","first_name":"Xavier","last_name":"Lapillonne","affiliation":"MeteoSwiss","country":"Switzerland","bio":"","order":"1","is_presenter":true},{"type":"Organizer","first_name":"Valentin","last_name":"Clement","affiliation":"Center for Climate System Modeling","country":"Switzerland","bio":"","order":"2","is_presenter":false}],"has_presenters":true,"presenters":[{"type":"Organizer","first_name":"Xavier","last_name":"Lapillonne","affiliation":"MeteoSwiss","country":"Switzerland","bio":"","order":"1","is_presenter":true}]},{"id":"msa230","type":"child","title":"Experience on Porting Atmosphere Kernels on Many-Core Processors and Accelerators","begin_time":"15:30","end_time":"16:00","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"This talk includes a summary of our previous work that ports different atmosphere kernels onto various state-of-the-art platforms, including the Sunway TaihuLight system. Performance portability for atmosphere codes is no doubt a big challenge, so great efforts have to be made and patience is required as well. In addition to some experiences and lessons, we also take this opportunity to discuss on the novel Sunway processors. For Sunway system, different software is being developed to make it easy for applications to be ported.","bio":"","contributors":[{"type":"Author","first_name":"Lin","last_name":"Gan","affiliation":"Tsinghua University","country":"China","bio":"","order":"1","is_presenter":true}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Lin","last_name":"Gan","affiliation":"Tsinghua University","country":"China","bio":"","order":"1","is_presenter":true}]},{"id":"msa181","type":"child","title":"Performance Portability for Next Generation HPC Architectures in E3SM via the Kokkos Programming Model","begin_time":"16:00","end_time":"16:30","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"This work converts the atmospheric dynamical core (HOMME) of the Energy Exascale Earth System Model (E3SM) from the current CPU-centric implementation, in Fortran 90, to a new performance-portable implementation, in C++ with the Kokkos performance-portability framework. HOMME simulates the dynamics and physical processes of the atmosphere. It is the most computationally demanding part of E3SM. Kokkos provides performance-portable multidimensional arrays and intraprocess parallel execution constructs. These form an abstraction layer over the hardware architecture of a compute node within a supercomputer. We will present results for the performance of our implementation on conventional CPU, Intel Xeon Phi, and Nvidia GPU; compare performance with the original Fortran on CPU and Xeon Phi; and discuss details of the implementation.","filename":"msa181s1.pdf","bio":"","contributors":[{"type":"Author","first_name":"Luca","last_name":"Bertagna","affiliation":"Sandia National Laboratories","country":"United States of America","bio":"","order":"1","is_presenter":true},{"type":"Author","first_name":"Andrew","last_name":"Salinger","affiliation":"Sandia National Laboratories","country":"United States of America","bio":"","order":"2","is_presenter":false},{"type":"Author","first_name":"Irina","last_name":"Tezaur","affiliation":"Sandia National Laboratories","country":"United States of America","bio":"","order":"3","is_presenter":false},{"type":"Author","first_name":"Andrew","last_name":"Bradley","affiliation":"Sandia National Laboratories","country":"United States of America","bio":"","order":"4","is_presenter":false},{"type":"Author","first_name":"Michael","last_name":"Deakin","affiliation":"Sandia National Laboratories","country":"United States of America","bio":"","order":"5","is_presenter":false},{"type":"Author","first_name":"Daniel","last_name":"Sunderland","affiliation":"Sandia National Laboratories","country":"United States of America","bio":"","order":"6","is_presenter":false},{"type":"Author","first_name":"Oksana","last_name":"Guba","affiliation":"Sandia National Laboratories","country":"United States of America","bio":"","order":"7","is_presenter":false}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Luca","last_name":"Bertagna","affiliation":"Sandia National Laboratories","country":"United States of America","bio":"","order":"1","is_presenter":true}]},{"id":"msa138","type":"child","title":"Experience Applying the PSyclone Configurable Domain Specific Compiler to the Met Office LFRic Model","begin_time":"16:30","end_time":"17:00","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"Earth-system models tend to be large, complex codes developed by large teams of scientists over periods of years. However, the scale of the problems to be simulated calls for the highest levels of computational performance. Achieving good performance when both computer architectures and the underlying code base are constantly evolving is a complex challenge. In recent years, the use of Domain-Specific Languages (DSLs) as a potential solution to this problem has begun to be investigated. The UK Met Office\u0027s LFRic project is developing a new, Finite Element dynamical core and has adopted a DSL approach. In this talk we will describe this work and the functionality of the domain-specific compiler, PSyclone, which has been developed to process the (serial) code written by the natural scientists and generate the code required to run on massively parallel machines.","bio":"","contributors":[{"type":"Author","first_name":"Rupert","last_name":"Ford","affiliation":"Science and Technology Facilities Council","country":"United Kingdom","bio":"","order":"1","is_presenter":true},{"type":"Author","first_name":"Andrew R.","last_name":"Porter","affiliation":"Science and Technology Facilities Council","country":"United Kingdom","bio":"","order":"2","is_presenter":false},{"type":"Author","first_name":"Sergi","last_name":"Siso","affiliation":"Science and Technology Facilities Council","country":"United Kingdom","bio":"","order":"3","is_presenter":false}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Rupert","last_name":"Ford","affiliation":"Science and Technology Facilities Council","country":"United Kingdom","bio":"","order":"1","is_presenter":true}]},{"id":"msa254","type":"child","title":"Novel Programming Models for Large Geophysical Fluid Dynamics Models","begin_time":"17:00","end_time":"17:30","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"Running operationally high resolution (~1km) global weather and climate models will be a milestone for the scientific community since there is clear evidence of the importance of high horizontal resolutions in the quality and accuracy of the simulations. Yet achieving this will pose serious computational challenges for large scientific codes that are developed using traditional programming models such as OpenMP and MPI. In order to adapt models to run efficiently on modern computing architectures and accelerators, numerous domain specific languages (DSL) and libraries that abstract architecture dependent optimizations have been proposed, like the GridTools libraries used operationally for running COSMO on GPUs. Yet these tools are specific to a domain or model, and have little reuse among them of architecture specific optimizers which leads to high maintenance costs. We present a novel programming model based on the GridTools ecosystem of libraries, a toolchain that allows to develop and interoperate various DSL frontends by providing domain and architecture specific optimizers. It aims at standardizing tools for performance portability by proposing a standard intermediate representation for weather and climate codes. We demonstrate the toolchain for the COSMO regional model and evaluate performance results compared to the operational model running on NVIDIA GPUs.","filename":"msa254s1.pdf","bio":"","contributors":[{"type":"Author","first_name":"Carlos E.","last_name":"Osuna","affiliation":"MeteoSwiss","country":"Switzerland","bio":"","order":"1","is_presenter":true},{"type":"Author","first_name":"Tobias","last_name":"Wicky","affiliation":"ETH Zurich","country":"Switzerland","bio":"","order":"2","is_presenter":false},{"type":"Author","first_name":"Stefan","last_name":"Moosbrugger","affiliation":"MeteoSwiss","country":"Switzerland","bio":"","order":"3","is_presenter":false},{"type":"Author","first_name":"Oliver","last_name":"Fuhrer","affiliation":"MeteoSwiss","country":"Switzerland","bio":"","order":"4","is_presenter":false},{"type":"Author","first_name":"Torsten","last_name":"Hoefler","affiliation":"ETH Zurich","country":"Switzerland","bio":"","order":"5","is_presenter":false}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Carlos E.","last_name":"Osuna","affiliation":"MeteoSwiss","country":"Switzerland","bio":"","order":"1","is_presenter":true}]}]}, "slot": {"id":"msa254","type":"child","title":"Novel Programming Models for Large Geophysical Fluid Dynamics Models","begin_time":"17:00","end_time":"17:30","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"Running operationally high resolution (~1km) global weather and climate models will be a milestone for the scientific community since there is clear evidence of the importance of high horizontal resolutions in the quality and accuracy of the simulations. Yet achieving this will pose serious computational challenges for large scientific codes that are developed using traditional programming models such as OpenMP and MPI. In order to adapt models to run efficiently on modern computing architectures and accelerators, numerous domain specific languages (DSL) and libraries that abstract architecture dependent optimizations have been proposed, like the GridTools libraries used operationally for running COSMO on GPUs. Yet these tools are specific to a domain or model, and have little reuse among them of architecture specific optimizers which leads to high maintenance costs. We present a novel programming model based on the GridTools ecosystem of libraries, a toolchain that allows to develop and interoperate various DSL frontends by providing domain and architecture specific optimizers. It aims at standardizing tools for performance portability by proposing a standard intermediate representation for weather and climate codes. We demonstrate the toolchain for the COSMO regional model and evaluate performance results compared to the operational model running on NVIDIA GPUs.","filename":"msa254s1.pdf","bio":"","contributors":[{"type":"Author","first_name":"Carlos E.","last_name":"Osuna","affiliation":"MeteoSwiss","country":"Switzerland","bio":"","order":"1","is_presenter":true},{"type":"Author","first_name":"Tobias","last_name":"Wicky","affiliation":"ETH Zurich","country":"Switzerland","bio":"","order":"2","is_presenter":false},{"type":"Author","first_name":"Stefan","last_name":"Moosbrugger","affiliation":"MeteoSwiss","country":"Switzerland","bio":"","order":"3","is_presenter":false},{"type":"Author","first_name":"Oliver","last_name":"Fuhrer","affiliation":"MeteoSwiss","country":"Switzerland","bio":"","order":"4","is_presenter":false},{"type":"Author","first_name":"Torsten","last_name":"Hoefler","affiliation":"ETH Zurich","country":"Switzerland","bio":"","order":"5","is_presenter":false}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Carlos E.","last_name":"Osuna","affiliation":"MeteoSwiss","country":"Switzerland","bio":"","order":"1","is_presenter":true}]}, "slotContributors": [{"type":"Author","first_name":"Carlos E.","last_name":"Osuna","affiliation":"MeteoSwiss","country":"Switzerland","bio":"","order":"1","is_presenter":true},{"type":"Author","first_name":"Tobias","last_name":"Wicky","affiliation":"ETH Zurich","country":"Switzerland","bio":"","order":"2","is_presenter":false},{"type":"Author","first_name":"Stefan","last_name":"Moosbrugger","affiliation":"MeteoSwiss","country":"Switzerland","bio":"","order":"3","is_presenter":false},{"type":"Author","first_name":"Oliver","last_name":"Fuhrer","affiliation":"MeteoSwiss","country":"Switzerland","bio":"","order":"4","is_presenter":false},{"type":"Author","first_name":"Torsten","last_name":"Hoefler","affiliation":"ETH Zurich","country":"Switzerland","bio":"","order":"5","is_presenter":false}] } Presentation
Organizer(s):
Michel Juillard (Banque de France, France)
Track(s):
Emerging Application Domains
Many important economic phenomena relate to the notion of risk. Economic actors not only make decisions as a function of their current situation but also depending on their expectation of future developments. Because economic systems are not deterministic, future economic events are usually treated as stochastic phenomena. The general form of the problem at hand is to determine how the probabilistic distribution of future economic events influences current decisions. The wider the distribution, the more risk in today's decisions. The papers in this session present different computation challenges involved in attempting to describe the effect of risk on economic decisions.
15:30 - 16:00
Approximating Equilibria with Ex-Post Heterogeneity and Aggregate Risk
, Elisabeth Proehl (University of Geneva, Switzerland)
+ Abstract { "session": {"id":"sess163","title":"MS11 - Computing the Effect of Risk","date":"Monday, July 2nd 2018","begin_time":"15:30","end_time":"17:30","room":"Montreal Room","contributors":[{"type":"Session Chair","first_name":"Michel","last_name":"Juillard","affiliation":"Banque de France","country":""}],"view_type":"VIII","view_type_id":"evtt110","tracks":["Emerging Application Domains"],"slots":[{"id":"symp134","type":"minisymposia","title":"MS11 - Computing the Effect of Risk","has_begin_end_time":true,"has_just_one_minute":true,"is_parent":true,"abstract":"Many important economic phenomena relate to the notion of risk. Economic actors not only make decisions as a function of their current situation but also depending on their expectation of future developments. Because economic systems are not deterministic, future economic events are usually treated as stochastic phenomena. The general form of the problem at hand is to determine how the probabilistic distribution of future economic events influences current decisions. The wider the distribution, the more risk in today\u0027s decisions. The papers in this session present different computation challenges involved in attempting to describe the effect of risk on economic decisions.","bio":"","contributors":[{"type":"Organizer","first_name":"Michel","last_name":"Juillard","affiliation":"Banque de France","country":"France","bio":"","order":"1","is_presenter":true}],"has_presenters":true,"presenters":[{"type":"Organizer","first_name":"Michel","last_name":"Juillard","affiliation":"Banque de France","country":"France","bio":"","order":"1","is_presenter":true}]},{"id":"msa189","type":"child","title":"Approximating Equilibria with Ex-Post Heterogeneity and Aggregate Risk","begin_time":"15:30","end_time":"16:00","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"Dynamic stochastic general equilibrium models with ex-post heterogeneity due to idiosyncratic risk have to be solved numerically. This is a nontrivial task as the cross-sectional distribution of endogenous variables becomes an element of the state space due to aggregate risk. Existing global solution methods have assumed bounded rationality in terms of a parametric law of motion of aggregate variables in order to reduce dimensionality. In this paper,\u00a0I remove that assumption and compute a fully rational equilibrium dependent on the whole cross-sectional distribution. Dimensionality is tackled by polynomial chaos expansions, a projection technique for square-integrable random variables, resulting in a nonparametric law of motion.\u00a0I establish conditions under which the method converges and approximation error bounds. To illustrate the method, I compute the Aiyagari-Bewley growth model and the Huggett model with aggregate risk. In\u00a0the former,\u00a0I find that the bounded rationality assumption leads to significantly more inequality than in a fully rational equilibrium. Furthermore, more risk sharing in form of redistribution can lead to higher systemic risk. In the latter model, I find that prices increase with more stringent selling constraints, but are also more negatively skewed.","filename":"msa189s1.pdf","bio":"","contributors":[{"type":"Author","first_name":"Elisabeth","last_name":"Proehl","affiliation":"University of Geneva","country":"Switzerland","bio":"","order":"1","is_presenter":true}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Elisabeth","last_name":"Proehl","affiliation":"University of Geneva","country":"Switzerland","bio":"","order":"1","is_presenter":true}]},{"id":"msa122","type":"child","title":"The Extended Perturbation Method","begin_time":"16:00","end_time":"16:30","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"This presentation introduces the extended perturbation method, which improves upon standard perturbation by removing approximation errors under certainty equivalence. Using the neoclassical growth model and a New Keynesian model, we show that extended perturbation achieves higher accuracy than standard perturbation when using third order approximations. We also show that extended perturbation generates stable approximations even when standard perturbation explodes. This paper also adds to the literature on downward nominal wage rigidities in the New Keynesian model, by showing that this friction only plays a significant role when using standard perturbation but not when using the more accurate extended perturbation approximation.","filename":"msa122s1.pdf","bio":"","contributors":[{"type":"Author","first_name":"Martin M.","last_name":"Andreasen","affiliation":"Aarhus University","country":"Denmark","bio":"","order":"1","is_presenter":true},{"type":"Author","first_name":"Anders","last_name":"Kronborg","affiliation":"Danish National Bank","country":"Denmark","bio":"","order":"2","is_presenter":false}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Martin M.","last_name":"Andreasen","affiliation":"Aarhus University","country":"Denmark","bio":"","order":"1","is_presenter":true}]},{"id":"msa218","type":"child","title":"Back in Time. Fast. Improved Time Iterations.","begin_time":"16:30","end_time":"17:00","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"We consider a new solution algorithm to solve nonlinear economic models using projections. For Bellman problems, our method is a variant of Howard\u0027s improvement steps. Contrary to the original improvements, it generalizes to models specified by equilibrium conditions in which case it is equivalent to the Newton-Raphson algorithm applied to one big nonlinear system of equations, without requiring the explicit inversion of the (memory-hungry) Jacobian. In particular, convergence is quadratic, i.e. much faster than regular time-iterations. Convergence of each gradient improvement step requires the (local) contractivity of the time-iterations operator. We show how this property relates to eigenvalues coming from local perturbation analysis, and how to estimate the local spectral radius of this operator close to a solution candidate. Gradient improvements can be implemented easily, essentially by composing the same elements as time-iterations. Our timing comparisons still suggest it performs much faster, especially when the number of dimensions or the number of grid points increase.","filename":"msa218s1.pdf","bio":"","contributors":[{"type":"Author","first_name":"Pablo","last_name":"Winant","affiliation":"Bank of England","country":"United Kingdom","bio":"","order":"1","is_presenter":true}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Pablo","last_name":"Winant","affiliation":"Bank of England","country":"United Kingdom","bio":"","order":"1","is_presenter":true}]},{"id":"msa175","type":"child","title":"Taking Risk into Account with Higher-Order Approximations","begin_time":"17:00","end_time":"17:30","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"In a nonlinear model, expectation of future shocks entails expected benefits or expected losses. Rational agents can make decisions today so as to maximize expected benefits or minimize expected loss. These behaviors are related to economic concepts such as precautionary saving, asset prices, risk premium, term premium. By contrast, linear models are characterized by certainty equivalence, and in such environments, agents are indifferent to future uncertainty. One of the major benefits of using higher order approximation of a certain class of economic models, is the ability to analyse attitude towards risk. Computing higher-order approximation of DSGE models involves several computational challenges. Derivatives of the original model must be evaluated. These high dimensional objects must be stored in a convenient manner. Above second order, computations involve tensor algebra. A key component is a fast implementation of the Faa Di Bruno formula for the derivatives of the composition of two functions. Until now, all these steps have only been programmed in C++ in dynare++. They represent challenging tasks for a rather new programming language such as Julia and an interesting test case.","filename":"msa175s1.pdf","bio":"","contributors":[{"type":"Author","first_name":"Michel","last_name":"Juillard","affiliation":"Banque de France","country":"France","bio":"","order":"1","is_presenter":true}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Michel","last_name":"Juillard","affiliation":"Banque de France","country":"France","bio":"","order":"1","is_presenter":true}]}]}, "slot": {"id":"msa189","type":"child","title":"Approximating Equilibria with Ex-Post Heterogeneity and Aggregate Risk","begin_time":"15:30","end_time":"16:00","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"Dynamic stochastic general equilibrium models with ex-post heterogeneity due to idiosyncratic risk have to be solved numerically. This is a nontrivial task as the cross-sectional distribution of endogenous variables becomes an element of the state space due to aggregate risk. Existing global solution methods have assumed bounded rationality in terms of a parametric law of motion of aggregate variables in order to reduce dimensionality. In this paper,\u00a0I remove that assumption and compute a fully rational equilibrium dependent on the whole cross-sectional distribution. Dimensionality is tackled by polynomial chaos expansions, a projection technique for square-integrable random variables, resulting in a nonparametric law of motion.\u00a0I establish conditions under which the method converges and approximation error bounds. To illustrate the method, I compute the Aiyagari-Bewley growth model and the Huggett model with aggregate risk. In\u00a0the former,\u00a0I find that the bounded rationality assumption leads to significantly more inequality than in a fully rational equilibrium. Furthermore, more risk sharing in form of redistribution can lead to higher systemic risk. In the latter model, I find that prices increase with more stringent selling constraints, but are also more negatively skewed.","filename":"msa189s1.pdf","bio":"","contributors":[{"type":"Author","first_name":"Elisabeth","last_name":"Proehl","affiliation":"University of Geneva","country":"Switzerland","bio":"","order":"1","is_presenter":true}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Elisabeth","last_name":"Proehl","affiliation":"University of Geneva","country":"Switzerland","bio":"","order":"1","is_presenter":true}]}, "slotContributors": [{"type":"Author","first_name":"Elisabeth","last_name":"Proehl","affiliation":"University of Geneva","country":"Switzerland","bio":"","order":"1","is_presenter":true}] } Presentation
16:00 - 16:30
The Extended Perturbation Method
, Martin M. Andreasen (Aarhus University, Denmark)
+ Abstract { "session": {"id":"sess163","title":"MS11 - Computing the Effect of Risk","date":"Monday, July 2nd 2018","begin_time":"15:30","end_time":"17:30","room":"Montreal Room","contributors":[{"type":"Session Chair","first_name":"Michel","last_name":"Juillard","affiliation":"Banque de France","country":""}],"view_type":"VIII","view_type_id":"evtt110","tracks":["Emerging Application Domains"],"slots":[{"id":"symp134","type":"minisymposia","title":"MS11 - Computing the Effect of Risk","has_begin_end_time":true,"has_just_one_minute":true,"is_parent":true,"abstract":"Many important economic phenomena relate to the notion of risk. Economic actors not only make decisions as a function of their current situation but also depending on their expectation of future developments. Because economic systems are not deterministic, future economic events are usually treated as stochastic phenomena. The general form of the problem at hand is to determine how the probabilistic distribution of future economic events influences current decisions. The wider the distribution, the more risk in today\u0027s decisions. The papers in this session present different computation challenges involved in attempting to describe the effect of risk on economic decisions.","bio":"","contributors":[{"type":"Organizer","first_name":"Michel","last_name":"Juillard","affiliation":"Banque de France","country":"France","bio":"","order":"1","is_presenter":true}],"has_presenters":true,"presenters":[{"type":"Organizer","first_name":"Michel","last_name":"Juillard","affiliation":"Banque de France","country":"France","bio":"","order":"1","is_presenter":true}]},{"id":"msa189","type":"child","title":"Approximating Equilibria with Ex-Post Heterogeneity and Aggregate Risk","begin_time":"15:30","end_time":"16:00","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"Dynamic stochastic general equilibrium models with ex-post heterogeneity due to idiosyncratic risk have to be solved numerically. This is a nontrivial task as the cross-sectional distribution of endogenous variables becomes an element of the state space due to aggregate risk. Existing global solution methods have assumed bounded rationality in terms of a parametric law of motion of aggregate variables in order to reduce dimensionality. In this paper,\u00a0I remove that assumption and compute a fully rational equilibrium dependent on the whole cross-sectional distribution. Dimensionality is tackled by polynomial chaos expansions, a projection technique for square-integrable random variables, resulting in a nonparametric law of motion.\u00a0I establish conditions under which the method converges and approximation error bounds. To illustrate the method, I compute the Aiyagari-Bewley growth model and the Huggett model with aggregate risk. In\u00a0the former,\u00a0I find that the bounded rationality assumption leads to significantly more inequality than in a fully rational equilibrium. Furthermore, more risk sharing in form of redistribution can lead to higher systemic risk. In the latter model, I find that prices increase with more stringent selling constraints, but are also more negatively skewed.","filename":"msa189s1.pdf","bio":"","contributors":[{"type":"Author","first_name":"Elisabeth","last_name":"Proehl","affiliation":"University of Geneva","country":"Switzerland","bio":"","order":"1","is_presenter":true}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Elisabeth","last_name":"Proehl","affiliation":"University of Geneva","country":"Switzerland","bio":"","order":"1","is_presenter":true}]},{"id":"msa122","type":"child","title":"The Extended Perturbation Method","begin_time":"16:00","end_time":"16:30","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"This presentation introduces the extended perturbation method, which improves upon standard perturbation by removing approximation errors under certainty equivalence. Using the neoclassical growth model and a New Keynesian model, we show that extended perturbation achieves higher accuracy than standard perturbation when using third order approximations. We also show that extended perturbation generates stable approximations even when standard perturbation explodes. This paper also adds to the literature on downward nominal wage rigidities in the New Keynesian model, by showing that this friction only plays a significant role when using standard perturbation but not when using the more accurate extended perturbation approximation.","filename":"msa122s1.pdf","bio":"","contributors":[{"type":"Author","first_name":"Martin M.","last_name":"Andreasen","affiliation":"Aarhus University","country":"Denmark","bio":"","order":"1","is_presenter":true},{"type":"Author","first_name":"Anders","last_name":"Kronborg","affiliation":"Danish National Bank","country":"Denmark","bio":"","order":"2","is_presenter":false}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Martin M.","last_name":"Andreasen","affiliation":"Aarhus University","country":"Denmark","bio":"","order":"1","is_presenter":true}]},{"id":"msa218","type":"child","title":"Back in Time. Fast. Improved Time Iterations.","begin_time":"16:30","end_time":"17:00","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"We consider a new solution algorithm to solve nonlinear economic models using projections. For Bellman problems, our method is a variant of Howard\u0027s improvement steps. Contrary to the original improvements, it generalizes to models specified by equilibrium conditions in which case it is equivalent to the Newton-Raphson algorithm applied to one big nonlinear system of equations, without requiring the explicit inversion of the (memory-hungry) Jacobian. In particular, convergence is quadratic, i.e. much faster than regular time-iterations. Convergence of each gradient improvement step requires the (local) contractivity of the time-iterations operator. We show how this property relates to eigenvalues coming from local perturbation analysis, and how to estimate the local spectral radius of this operator close to a solution candidate. Gradient improvements can be implemented easily, essentially by composing the same elements as time-iterations. Our timing comparisons still suggest it performs much faster, especially when the number of dimensions or the number of grid points increase.","filename":"msa218s1.pdf","bio":"","contributors":[{"type":"Author","first_name":"Pablo","last_name":"Winant","affiliation":"Bank of England","country":"United Kingdom","bio":"","order":"1","is_presenter":true}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Pablo","last_name":"Winant","affiliation":"Bank of England","country":"United Kingdom","bio":"","order":"1","is_presenter":true}]},{"id":"msa175","type":"child","title":"Taking Risk into Account with Higher-Order Approximations","begin_time":"17:00","end_time":"17:30","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"In a nonlinear model, expectation of future shocks entails expected benefits or expected losses. Rational agents can make decisions today so as to maximize expected benefits or minimize expected loss. These behaviors are related to economic concepts such as precautionary saving, asset prices, risk premium, term premium. By contrast, linear models are characterized by certainty equivalence, and in such environments, agents are indifferent to future uncertainty. One of the major benefits of using higher order approximation of a certain class of economic models, is the ability to analyse attitude towards risk. Computing higher-order approximation of DSGE models involves several computational challenges. Derivatives of the original model must be evaluated. These high dimensional objects must be stored in a convenient manner. Above second order, computations involve tensor algebra. A key component is a fast implementation of the Faa Di Bruno formula for the derivatives of the composition of two functions. Until now, all these steps have only been programmed in C++ in dynare++. They represent challenging tasks for a rather new programming language such as Julia and an interesting test case.","filename":"msa175s1.pdf","bio":"","contributors":[{"type":"Author","first_name":"Michel","last_name":"Juillard","affiliation":"Banque de France","country":"France","bio":"","order":"1","is_presenter":true}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Michel","last_name":"Juillard","affiliation":"Banque de France","country":"France","bio":"","order":"1","is_presenter":true}]}]}, "slot": {"id":"msa122","type":"child","title":"The Extended Perturbation Method","begin_time":"16:00","end_time":"16:30","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"This presentation introduces the extended perturbation method, which improves upon standard perturbation by removing approximation errors under certainty equivalence. Using the neoclassical growth model and a New Keynesian model, we show that extended perturbation achieves higher accuracy than standard perturbation when using third order approximations. We also show that extended perturbation generates stable approximations even when standard perturbation explodes. This paper also adds to the literature on downward nominal wage rigidities in the New Keynesian model, by showing that this friction only plays a significant role when using standard perturbation but not when using the more accurate extended perturbation approximation.","filename":"msa122s1.pdf","bio":"","contributors":[{"type":"Author","first_name":"Martin M.","last_name":"Andreasen","affiliation":"Aarhus University","country":"Denmark","bio":"","order":"1","is_presenter":true},{"type":"Author","first_name":"Anders","last_name":"Kronborg","affiliation":"Danish National Bank","country":"Denmark","bio":"","order":"2","is_presenter":false}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Martin M.","last_name":"Andreasen","affiliation":"Aarhus University","country":"Denmark","bio":"","order":"1","is_presenter":true}]}, "slotContributors": [{"type":"Author","first_name":"Martin M.","last_name":"Andreasen","affiliation":"Aarhus University","country":"Denmark","bio":"","order":"1","is_presenter":true},{"type":"Author","first_name":"Anders","last_name":"Kronborg","affiliation":"Danish National Bank","country":"Denmark","bio":"","order":"2","is_presenter":false}] } Presentation
16:30 - 17:00
Back in Time. Fast. Improved Time Iterations.
, Pablo Winant (Bank of England, United Kingdom)
+ Abstract { "session": {"id":"sess163","title":"MS11 - Computing the Effect of Risk","date":"Monday, July 2nd 2018","begin_time":"15:30","end_time":"17:30","room":"Montreal Room","contributors":[{"type":"Session Chair","first_name":"Michel","last_name":"Juillard","affiliation":"Banque de France","country":""}],"view_type":"VIII","view_type_id":"evtt110","tracks":["Emerging Application Domains"],"slots":[{"id":"symp134","type":"minisymposia","title":"MS11 - Computing the Effect of Risk","has_begin_end_time":true,"has_just_one_minute":true,"is_parent":true,"abstract":"Many important economic phenomena relate to the notion of risk. Economic actors not only make decisions as a function of their current situation but also depending on their expectation of future developments. Because economic systems are not deterministic, future economic events are usually treated as stochastic phenomena. The general form of the problem at hand is to determine how the probabilistic distribution of future economic events influences current decisions. The wider the distribution, the more risk in today\u0027s decisions. The papers in this session present different computation challenges involved in attempting to describe the effect of risk on economic decisions.","bio":"","contributors":[{"type":"Organizer","first_name":"Michel","last_name":"Juillard","affiliation":"Banque de France","country":"France","bio":"","order":"1","is_presenter":true}],"has_presenters":true,"presenters":[{"type":"Organizer","first_name":"Michel","last_name":"Juillard","affiliation":"Banque de France","country":"France","bio":"","order":"1","is_presenter":true}]},{"id":"msa189","type":"child","title":"Approximating Equilibria with Ex-Post Heterogeneity and Aggregate Risk","begin_time":"15:30","end_time":"16:00","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"Dynamic stochastic general equilibrium models with ex-post heterogeneity due to idiosyncratic risk have to be solved numerically. This is a nontrivial task as the cross-sectional distribution of endogenous variables becomes an element of the state space due to aggregate risk. Existing global solution methods have assumed bounded rationality in terms of a parametric law of motion of aggregate variables in order to reduce dimensionality. In this paper,\u00a0I remove that assumption and compute a fully rational equilibrium dependent on the whole cross-sectional distribution. Dimensionality is tackled by polynomial chaos expansions, a projection technique for square-integrable random variables, resulting in a nonparametric law of motion.\u00a0I establish conditions under which the method converges and approximation error bounds. To illustrate the method, I compute the Aiyagari-Bewley growth model and the Huggett model with aggregate risk. In\u00a0the former,\u00a0I find that the bounded rationality assumption leads to significantly more inequality than in a fully rational equilibrium. Furthermore, more risk sharing in form of redistribution can lead to higher systemic risk. In the latter model, I find that prices increase with more stringent selling constraints, but are also more negatively skewed.","filename":"msa189s1.pdf","bio":"","contributors":[{"type":"Author","first_name":"Elisabeth","last_name":"Proehl","affiliation":"University of Geneva","country":"Switzerland","bio":"","order":"1","is_presenter":true}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Elisabeth","last_name":"Proehl","affiliation":"University of Geneva","country":"Switzerland","bio":"","order":"1","is_presenter":true}]},{"id":"msa122","type":"child","title":"The Extended Perturbation Method","begin_time":"16:00","end_time":"16:30","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"This presentation introduces the extended perturbation method, which improves upon standard perturbation by removing approximation errors under certainty equivalence. Using the neoclassical growth model and a New Keynesian model, we show that extended perturbation achieves higher accuracy than standard perturbation when using third order approximations. We also show that extended perturbation generates stable approximations even when standard perturbation explodes. This paper also adds to the literature on downward nominal wage rigidities in the New Keynesian model, by showing that this friction only plays a significant role when using standard perturbation but not when using the more accurate extended perturbation approximation.","filename":"msa122s1.pdf","bio":"","contributors":[{"type":"Author","first_name":"Martin M.","last_name":"Andreasen","affiliation":"Aarhus University","country":"Denmark","bio":"","order":"1","is_presenter":true},{"type":"Author","first_name":"Anders","last_name":"Kronborg","affiliation":"Danish National Bank","country":"Denmark","bio":"","order":"2","is_presenter":false}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Martin M.","last_name":"Andreasen","affiliation":"Aarhus University","country":"Denmark","bio":"","order":"1","is_presenter":true}]},{"id":"msa218","type":"child","title":"Back in Time. Fast. Improved Time Iterations.","begin_time":"16:30","end_time":"17:00","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"We consider a new solution algorithm to solve nonlinear economic models using projections. For Bellman problems, our method is a variant of Howard\u0027s improvement steps. Contrary to the original improvements, it generalizes to models specified by equilibrium conditions in which case it is equivalent to the Newton-Raphson algorithm applied to one big nonlinear system of equations, without requiring the explicit inversion of the (memory-hungry) Jacobian. In particular, convergence is quadratic, i.e. much faster than regular time-iterations. Convergence of each gradient improvement step requires the (local) contractivity of the time-iterations operator. We show how this property relates to eigenvalues coming from local perturbation analysis, and how to estimate the local spectral radius of this operator close to a solution candidate. Gradient improvements can be implemented easily, essentially by composing the same elements as time-iterations. Our timing comparisons still suggest it performs much faster, especially when the number of dimensions or the number of grid points increase.","filename":"msa218s1.pdf","bio":"","contributors":[{"type":"Author","first_name":"Pablo","last_name":"Winant","affiliation":"Bank of England","country":"United Kingdom","bio":"","order":"1","is_presenter":true}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Pablo","last_name":"Winant","affiliation":"Bank of England","country":"United Kingdom","bio":"","order":"1","is_presenter":true}]},{"id":"msa175","type":"child","title":"Taking Risk into Account with Higher-Order Approximations","begin_time":"17:00","end_time":"17:30","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"In a nonlinear model, expectation of future shocks entails expected benefits or expected losses. Rational agents can make decisions today so as to maximize expected benefits or minimize expected loss. These behaviors are related to economic concepts such as precautionary saving, asset prices, risk premium, term premium. By contrast, linear models are characterized by certainty equivalence, and in such environments, agents are indifferent to future uncertainty. One of the major benefits of using higher order approximation of a certain class of economic models, is the ability to analyse attitude towards risk. Computing higher-order approximation of DSGE models involves several computational challenges. Derivatives of the original model must be evaluated. These high dimensional objects must be stored in a convenient manner. Above second order, computations involve tensor algebra. A key component is a fast implementation of the Faa Di Bruno formula for the derivatives of the composition of two functions. Until now, all these steps have only been programmed in C++ in dynare++. They represent challenging tasks for a rather new programming language such as Julia and an interesting test case.","filename":"msa175s1.pdf","bio":"","contributors":[{"type":"Author","first_name":"Michel","last_name":"Juillard","affiliation":"Banque de France","country":"France","bio":"","order":"1","is_presenter":true}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Michel","last_name":"Juillard","affiliation":"Banque de France","country":"France","bio":"","order":"1","is_presenter":true}]}]}, "slot": {"id":"msa218","type":"child","title":"Back in Time. Fast. Improved Time Iterations.","begin_time":"16:30","end_time":"17:00","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"We consider a new solution algorithm to solve nonlinear economic models using projections. For Bellman problems, our method is a variant of Howard\u0027s improvement steps. Contrary to the original improvements, it generalizes to models specified by equilibrium conditions in which case it is equivalent to the Newton-Raphson algorithm applied to one big nonlinear system of equations, without requiring the explicit inversion of the (memory-hungry) Jacobian. In particular, convergence is quadratic, i.e. much faster than regular time-iterations. Convergence of each gradient improvement step requires the (local) contractivity of the time-iterations operator. We show how this property relates to eigenvalues coming from local perturbation analysis, and how to estimate the local spectral radius of this operator close to a solution candidate. Gradient improvements can be implemented easily, essentially by composing the same elements as time-iterations. Our timing comparisons still suggest it performs much faster, especially when the number of dimensions or the number of grid points increase.","filename":"msa218s1.pdf","bio":"","contributors":[{"type":"Author","first_name":"Pablo","last_name":"Winant","affiliation":"Bank of England","country":"United Kingdom","bio":"","order":"1","is_presenter":true}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Pablo","last_name":"Winant","affiliation":"Bank of England","country":"United Kingdom","bio":"","order":"1","is_presenter":true}]}, "slotContributors": [{"type":"Author","first_name":"Pablo","last_name":"Winant","affiliation":"Bank of England","country":"United Kingdom","bio":"","order":"1","is_presenter":true}] } Presentation
17:00 - 17:30
Taking Risk into Account with Higher-Order Approximations
, Michel Juillard (Banque de France, France)
+ Abstract { "session": {"id":"sess163","title":"MS11 - Computing the Effect of Risk","date":"Monday, July 2nd 2018","begin_time":"15:30","end_time":"17:30","room":"Montreal Room","contributors":[{"type":"Session Chair","first_name":"Michel","last_name":"Juillard","affiliation":"Banque de France","country":""}],"view_type":"VIII","view_type_id":"evtt110","tracks":["Emerging Application Domains"],"slots":[{"id":"symp134","type":"minisymposia","title":"MS11 - Computing the Effect of Risk","has_begin_end_time":true,"has_just_one_minute":true,"is_parent":true,"abstract":"Many important economic phenomena relate to the notion of risk. Economic actors not only make decisions as a function of their current situation but also depending on their expectation of future developments. Because economic systems are not deterministic, future economic events are usually treated as stochastic phenomena. The general form of the problem at hand is to determine how the probabilistic distribution of future economic events influences current decisions. The wider the distribution, the more risk in today\u0027s decisions. The papers in this session present different computation challenges involved in attempting to describe the effect of risk on economic decisions.","bio":"","contributors":[{"type":"Organizer","first_name":"Michel","last_name":"Juillard","affiliation":"Banque de France","country":"France","bio":"","order":"1","is_presenter":true}],"has_presenters":true,"presenters":[{"type":"Organizer","first_name":"Michel","last_name":"Juillard","affiliation":"Banque de France","country":"France","bio":"","order":"1","is_presenter":true}]},{"id":"msa189","type":"child","title":"Approximating Equilibria with Ex-Post Heterogeneity and Aggregate Risk","begin_time":"15:30","end_time":"16:00","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"Dynamic stochastic general equilibrium models with ex-post heterogeneity due to idiosyncratic risk have to be solved numerically. This is a nontrivial task as the cross-sectional distribution of endogenous variables becomes an element of the state space due to aggregate risk. Existing global solution methods have assumed bounded rationality in terms of a parametric law of motion of aggregate variables in order to reduce dimensionality. In this paper,\u00a0I remove that assumption and compute a fully rational equilibrium dependent on the whole cross-sectional distribution. Dimensionality is tackled by polynomial chaos expansions, a projection technique for square-integrable random variables, resulting in a nonparametric law of motion.\u00a0I establish conditions under which the method converges and approximation error bounds. To illustrate the method, I compute the Aiyagari-Bewley growth model and the Huggett model with aggregate risk. In\u00a0the former,\u00a0I find that the bounded rationality assumption leads to significantly more inequality than in a fully rational equilibrium. Furthermore, more risk sharing in form of redistribution can lead to higher systemic risk. In the latter model, I find that prices increase with more stringent selling constraints, but are also more negatively skewed.","filename":"msa189s1.pdf","bio":"","contributors":[{"type":"Author","first_name":"Elisabeth","last_name":"Proehl","affiliation":"University of Geneva","country":"Switzerland","bio":"","order":"1","is_presenter":true}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Elisabeth","last_name":"Proehl","affiliation":"University of Geneva","country":"Switzerland","bio":"","order":"1","is_presenter":true}]},{"id":"msa122","type":"child","title":"The Extended Perturbation Method","begin_time":"16:00","end_time":"16:30","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"This presentation introduces the extended perturbation method, which improves upon standard perturbation by removing approximation errors under certainty equivalence. Using the neoclassical growth model and a New Keynesian model, we show that extended perturbation achieves higher accuracy than standard perturbation when using third order approximations. We also show that extended perturbation generates stable approximations even when standard perturbation explodes. This paper also adds to the literature on downward nominal wage rigidities in the New Keynesian model, by showing that this friction only plays a significant role when using standard perturbation but not when using the more accurate extended perturbation approximation.","filename":"msa122s1.pdf","bio":"","contributors":[{"type":"Author","first_name":"Martin M.","last_name":"Andreasen","affiliation":"Aarhus University","country":"Denmark","bio":"","order":"1","is_presenter":true},{"type":"Author","first_name":"Anders","last_name":"Kronborg","affiliation":"Danish National Bank","country":"Denmark","bio":"","order":"2","is_presenter":false}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Martin M.","last_name":"Andreasen","affiliation":"Aarhus University","country":"Denmark","bio":"","order":"1","is_presenter":true}]},{"id":"msa218","type":"child","title":"Back in Time. Fast. Improved Time Iterations.","begin_time":"16:30","end_time":"17:00","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"We consider a new solution algorithm to solve nonlinear economic models using projections. For Bellman problems, our method is a variant of Howard\u0027s improvement steps. Contrary to the original improvements, it generalizes to models specified by equilibrium conditions in which case it is equivalent to the Newton-Raphson algorithm applied to one big nonlinear system of equations, without requiring the explicit inversion of the (memory-hungry) Jacobian. In particular, convergence is quadratic, i.e. much faster than regular time-iterations. Convergence of each gradient improvement step requires the (local) contractivity of the time-iterations operator. We show how this property relates to eigenvalues coming from local perturbation analysis, and how to estimate the local spectral radius of this operator close to a solution candidate. Gradient improvements can be implemented easily, essentially by composing the same elements as time-iterations. Our timing comparisons still suggest it performs much faster, especially when the number of dimensions or the number of grid points increase.","filename":"msa218s1.pdf","bio":"","contributors":[{"type":"Author","first_name":"Pablo","last_name":"Winant","affiliation":"Bank of England","country":"United Kingdom","bio":"","order":"1","is_presenter":true}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Pablo","last_name":"Winant","affiliation":"Bank of England","country":"United Kingdom","bio":"","order":"1","is_presenter":true}]},{"id":"msa175","type":"child","title":"Taking Risk into Account with Higher-Order Approximations","begin_time":"17:00","end_time":"17:30","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"In a nonlinear model, expectation of future shocks entails expected benefits or expected losses. Rational agents can make decisions today so as to maximize expected benefits or minimize expected loss. These behaviors are related to economic concepts such as precautionary saving, asset prices, risk premium, term premium. By contrast, linear models are characterized by certainty equivalence, and in such environments, agents are indifferent to future uncertainty. One of the major benefits of using higher order approximation of a certain class of economic models, is the ability to analyse attitude towards risk. Computing higher-order approximation of DSGE models involves several computational challenges. Derivatives of the original model must be evaluated. These high dimensional objects must be stored in a convenient manner. Above second order, computations involve tensor algebra. A key component is a fast implementation of the Faa Di Bruno formula for the derivatives of the composition of two functions. Until now, all these steps have only been programmed in C++ in dynare++. They represent challenging tasks for a rather new programming language such as Julia and an interesting test case.","filename":"msa175s1.pdf","bio":"","contributors":[{"type":"Author","first_name":"Michel","last_name":"Juillard","affiliation":"Banque de France","country":"France","bio":"","order":"1","is_presenter":true}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Michel","last_name":"Juillard","affiliation":"Banque de France","country":"France","bio":"","order":"1","is_presenter":true}]}]}, "slot": {"id":"msa175","type":"child","title":"Taking Risk into Account with Higher-Order Approximations","begin_time":"17:00","end_time":"17:30","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"In a nonlinear model, expectation of future shocks entails expected benefits or expected losses. Rational agents can make decisions today so as to maximize expected benefits or minimize expected loss. These behaviors are related to economic concepts such as precautionary saving, asset prices, risk premium, term premium. By contrast, linear models are characterized by certainty equivalence, and in such environments, agents are indifferent to future uncertainty. One of the major benefits of using higher order approximation of a certain class of economic models, is the ability to analyse attitude towards risk. Computing higher-order approximation of DSGE models involves several computational challenges. Derivatives of the original model must be evaluated. These high dimensional objects must be stored in a convenient manner. Above second order, computations involve tensor algebra. A key component is a fast implementation of the Faa Di Bruno formula for the derivatives of the composition of two functions. Until now, all these steps have only been programmed in C++ in dynare++. They represent challenging tasks for a rather new programming language such as Julia and an interesting test case.","filename":"msa175s1.pdf","bio":"","contributors":[{"type":"Author","first_name":"Michel","last_name":"Juillard","affiliation":"Banque de France","country":"France","bio":"","order":"1","is_presenter":true}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Michel","last_name":"Juillard","affiliation":"Banque de France","country":"France","bio":"","order":"1","is_presenter":true}]}, "slotContributors": [{"type":"Author","first_name":"Michel","last_name":"Juillard","affiliation":"Banque de France","country":"France","bio":"","order":"1","is_presenter":true}] } Presentation
Organizer(s):
Guido Juckeland (Helmholtz-Zentrum Dresden-Rossendorf, Germany)
Track(s):
Emerging Application Domains, Computer Science and Applied Mathematics
The dawn of Web 2.0 applications, smartphones/tablets and the omnipresent yet invisible cloud computing have dramatically changed our perception of the IT landscape in the last decade. At the same time these technologies delivered an abundance of new tools that are more suited to the needs of domain scientists: GitHub/GitLab with their inherent support for agile programming, user space package managers for easy software installations even when facing complex dependencies, and web portals to HPC systems or private clouds so that no prior knowledge is needed to use state-of-the-art compute resources. This minisymposium will showcase all these tools and how they are used in real scientific workflows. The shown best practices are easy to reproduce since they are all based on freely available software packages, so that the audience can use the presentations both as an inspiration but also as a kickstart for their own better science.
15:30 - 16:00
HPC-as-a-Service to Domain Scientists
, Sunita Chandrasekaran (University of Delaware, United States of America)
+ Abstract { "session": {"id":"sess169","title":"MS12 - Engineering Scientific Software in times of Agile Development, Continuous Integration and Cloud Computing","date":"Monday, July 2nd 2018","begin_time":"15:30","end_time":"17:30","room":"Sydney Room","contributors":[{"type":"Session Chair","first_name":"Guido","last_name":"Juckeland","affiliation":"Helmholtz-Zentrum Dresden-Rossendorf","country":""}],"view_type":"VIII","view_type_id":"evtt110","tracks":["Emerging Application Domains","Computer Science and Applied Mathematics"],"slots":[{"id":"symp128","type":"minisymposia","title":"MS12 - Engineering Scientific Software in times of Agile Development, Continuous Integration and Cloud Computing","has_begin_end_time":true,"has_just_one_minute":true,"is_parent":true,"abstract":"The dawn of Web 2.0 applications, smartphones\/tablets and the omnipresent yet invisible cloud computing have dramatically changed our perception of the IT landscape in the last decade. At the same time these technologies delivered an abundance of new tools that are more suited to the needs of domain scientists: GitHub\/GitLab with their inherent support for agile programming, user space package managers for easy software installations even when facing complex dependencies, and web portals to HPC systems or private clouds so that no prior knowledge is needed to use state-of-the-art compute resources. This minisymposium will showcase all these tools and how they are used in real scientific workflows. The shown best practices are easy to reproduce since they are all based on freely available software packages, so that the audience can use the presentations both as an inspiration but also as a kickstart for their own better science.","bio":"","contributors":[{"type":"Organizer","first_name":"Guido","last_name":"Juckeland","affiliation":"Helmholtz-Zentrum Dresden-Rossendorf","country":"Germany","bio":"","order":"1","is_presenter":true}],"has_presenters":true,"presenters":[{"type":"Organizer","first_name":"Guido","last_name":"Juckeland","affiliation":"Helmholtz-Zentrum Dresden-Rossendorf","country":"Germany","bio":"","order":"1","is_presenter":true}]},{"id":"msa137","type":"child","title":"HPC-as-a-Service to Domain Scientists","begin_time":"15:30","end_time":"16:00","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"Applications-as-a-service operating modes have changed the computing landscape in a multi-disciplinary research laboratory both from a user\u0027s and an HPC operator\u0027s perspective. Lately, applications are being offered as web-based user interfaces regardless of the actual location of the computation. To this end, the cloud computing revolution has had a wonderful side effect that everybody can now easily accept that certain tasks are transparently performed elsewhere - this talk will give an example from the bioinformatics application domain showing how cloud resources can be used for DNA sequencing. As such more and more HPC centers offer web-portals to access their systems along with applications developers also offering a web-based front-end so that the \u0022obscure green font on black screen magic\u0022 of a typical SSH session is hidden from the end user. This enables both new groups to use HPC systems but also provides users a more error-proof and efficient way of using installed applications. This talk will highlight the criticality of an application-as-a service mode and will also discuss how Docker containers and Jupyter notebooks can be used for this easy-to-use application-as-a-service notion. The parallel benchmark suite from SPEC HPG organization will be used for demo purposes.","filename":"msa137s1.pdf","bio":"","contributors":[{"type":"Author","first_name":"Sunita","last_name":"Chandrasekaran","affiliation":"University of Delaware","country":"United States of America","bio":"","order":"1","is_presenter":true}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Sunita","last_name":"Chandrasekaran","affiliation":"University of Delaware","country":"United States of America","bio":"","order":"1","is_presenter":true}]},{"id":"msa226","type":"child","title":"The Reality of Scientific Software Development is Agile - Best Practices and Lessons Learned","begin_time":"16:00","end_time":"16:30","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"The reality of the development of scientific software is often far from the clearly structured, cascading work packages that grant applications require, but rather in the spirit of agile programming: Start from a working minimal prototype, always have running code, work in sprints (typically towards the end of reporting periods). Those characteristics actually match rather well onto the concept of agile programming. This talk will explain the principles of agile software development, the existing software tool support using the free-of-charge GitLab\u00a0Libre Edition as an example, and the implementation of the processes, including team communication, continuous integration and publication of documentation.","bio":"","contributors":[{"type":"Author","first_name":"Guido","last_name":"Juckeland","affiliation":"Helmholtz-Zentrum Dresden-Rossendorf","country":"Germany","bio":"","order":"1","is_presenter":true},{"type":"Author","first_name":"Tobias","last_name":"Frust","affiliation":"Helmholtz-Zentrum Dresden-Rossendorf","country":"Germany","bio":"","order":"2","is_presenter":false}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Guido","last_name":"Juckeland","affiliation":"Helmholtz-Zentrum Dresden-Rossendorf","country":"Germany","bio":"","order":"1","is_presenter":true}]},{"id":"msa212","type":"child","title":"Using Jetstream and High Performance Remote Research Desktops to Lower the Barrier of Entry for HPC Resources","begin_time":"16:30","end_time":"17:00","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"Indiana University operates two environments designed to lower the barrier of entry for HPC resources. One is the NSF Jetstream project, the first NSF funded cloud designed for those who have not previously used high performance computing resources. Jetstream provides users with long running virtual machines with a customizable software stack to meet the needs of non-traditional HPC applications. The other environment is a research desktop solution that is making high performance Linux desktops available remotely. The desktops contain all the normal HPC command line tools and allow for direct job submission to the HPC machines, but also provide access to interactive applications like Matlab, Comsol Multiphysics, R-Studio and Jupyter. The goal of both projects is to lower the barrier of entry and broaden adoption of traditional HPC and high-throughput computing environments. The talk will provide an architectural overview, use cases and experiences for operating such environments.","filename":"msa212s1.pdf","bio":"","contributors":[{"type":"Author","first_name":"Robert","last_name":"Henschel","affiliation":"Indiana University","country":"United States of America","bio":"","order":"1","is_presenter":true},{"type":"Author","first_name":"Dave","last_name":"Hancock","affiliation":"Indiana University","country":"United States of America","bio":"","order":"2","is_presenter":false}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Robert","last_name":"Henschel","affiliation":"Indiana University","country":"United States of America","bio":"","order":"1","is_presenter":true}]},{"id":"msa213","type":"child","title":"Spack: A Package Manager for Scientific Software","begin_time":"17:00","end_time":"17:30","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"On mainstream Linux distributions, package managers simplify the software installation process by providing pre-built, generic binaries. Users can leverage a wide variety of libraries and applications without knowing how to build from source. On HPC systems users typically build from source, and software, is notoriously complex. Building even a moderately sized parallel simulation code can be a major effort. Scientists who use applications codes must typically also know how to build them from scratch, along with tens or hundreds of dependency libraries. Spack is an open source package manager that handles the complexity of HPC environments and allows scientists to automatically and reproducibly install complex software stacks. It allows users to experiment with different compilers, optimizations, build options, and dependency versions, without in-depth build knowledge. Spack is built to handle the complexities of the HPC environment that seldom arise on commodity systems, such as swapping compilers and ABI-incompatible dependencies, cross-compilation, compiler runtime libraries, and optimized binaries. Spack has a rapidly growing community, with over 240 contributors at organizations worldwide. In this talk, we will introduce Spack, show how it can make scientists more productive, and give an overview of ongoing Spack projects and its development road map.","bio":"","contributors":[{"type":"Author","first_name":"Todd","last_name":"Gamblin","affiliation":"Lawrence Livermore National Laboratory","country":"United States of America","bio":"","order":"1","is_presenter":true},{"type":"Author","first_name":"Massimiliano","last_name":"Culpo","affiliation":"EPFL","country":"Switzerland","bio":"","order":"2","is_presenter":true}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Todd","last_name":"Gamblin","affiliation":"Lawrence Livermore National Laboratory","country":"United States of America","bio":"","order":"1","is_presenter":true},{"type":"Author","first_name":"Massimiliano","last_name":"Culpo","affiliation":"EPFL","country":"Switzerland","bio":"","order":"2","is_presenter":true}]}]}, "slot": {"id":"msa137","type":"child","title":"HPC-as-a-Service to Domain Scientists","begin_time":"15:30","end_time":"16:00","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"Applications-as-a-service operating modes have changed the computing landscape in a multi-disciplinary research laboratory both from a user\u0027s and an HPC operator\u0027s perspective. Lately, applications are being offered as web-based user interfaces regardless of the actual location of the computation. To this end, the cloud computing revolution has had a wonderful side effect that everybody can now easily accept that certain tasks are transparently performed elsewhere - this talk will give an example from the bioinformatics application domain showing how cloud resources can be used for DNA sequencing. As such more and more HPC centers offer web-portals to access their systems along with applications developers also offering a web-based front-end so that the \u0022obscure green font on black screen magic\u0022 of a typical SSH session is hidden from the end user. This enables both new groups to use HPC systems but also provides users a more error-proof and efficient way of using installed applications. This talk will highlight the criticality of an application-as-a service mode and will also discuss how Docker containers and Jupyter notebooks can be used for this easy-to-use application-as-a-service notion. The parallel benchmark suite from SPEC HPG organization will be used for demo purposes.","filename":"msa137s1.pdf","bio":"","contributors":[{"type":"Author","first_name":"Sunita","last_name":"Chandrasekaran","affiliation":"University of Delaware","country":"United States of America","bio":"","order":"1","is_presenter":true}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Sunita","last_name":"Chandrasekaran","affiliation":"University of Delaware","country":"United States of America","bio":"","order":"1","is_presenter":true}]}, "slotContributors": [{"type":"Author","first_name":"Sunita","last_name":"Chandrasekaran","affiliation":"University of Delaware","country":"United States of America","bio":"","order":"1","is_presenter":true}] } Presentation
16:30 - 17:00
Using Jetstream and High Performance Remote Research Desktops to Lower the Barrier of Entry for HPC Resources
, Robert Henschel (Indiana University, United States of America)
+ Abstract { "session": {"id":"sess169","title":"MS12 - Engineering Scientific Software in times of Agile Development, Continuous Integration and Cloud Computing","date":"Monday, July 2nd 2018","begin_time":"15:30","end_time":"17:30","room":"Sydney Room","contributors":[{"type":"Session Chair","first_name":"Guido","last_name":"Juckeland","affiliation":"Helmholtz-Zentrum Dresden-Rossendorf","country":""}],"view_type":"VIII","view_type_id":"evtt110","tracks":["Emerging Application Domains","Computer Science and Applied Mathematics"],"slots":[{"id":"symp128","type":"minisymposia","title":"MS12 - Engineering Scientific Software in times of Agile Development, Continuous Integration and Cloud Computing","has_begin_end_time":true,"has_just_one_minute":true,"is_parent":true,"abstract":"The dawn of Web 2.0 applications, smartphones\/tablets and the omnipresent yet invisible cloud computing have dramatically changed our perception of the IT landscape in the last decade. At the same time these technologies delivered an abundance of new tools that are more suited to the needs of domain scientists: GitHub\/GitLab with their inherent support for agile programming, user space package managers for easy software installations even when facing complex dependencies, and web portals to HPC systems or private clouds so that no prior knowledge is needed to use state-of-the-art compute resources. This minisymposium will showcase all these tools and how they are used in real scientific workflows. The shown best practices are easy to reproduce since they are all based on freely available software packages, so that the audience can use the presentations both as an inspiration but also as a kickstart for their own better science.","bio":"","contributors":[{"type":"Organizer","first_name":"Guido","last_name":"Juckeland","affiliation":"Helmholtz-Zentrum Dresden-Rossendorf","country":"Germany","bio":"","order":"1","is_presenter":true}],"has_presenters":true,"presenters":[{"type":"Organizer","first_name":"Guido","last_name":"Juckeland","affiliation":"Helmholtz-Zentrum Dresden-Rossendorf","country":"Germany","bio":"","order":"1","is_presenter":true}]},{"id":"msa137","type":"child","title":"HPC-as-a-Service to Domain Scientists","begin_time":"15:30","end_time":"16:00","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"Applications-as-a-service operating modes have changed the computing landscape in a multi-disciplinary research laboratory both from a user\u0027s and an HPC operator\u0027s perspective. Lately, applications are being offered as web-based user interfaces regardless of the actual location of the computation. To this end, the cloud computing revolution has had a wonderful side effect that everybody can now easily accept that certain tasks are transparently performed elsewhere - this talk will give an example from the bioinformatics application domain showing how cloud resources can be used for DNA sequencing. As such more and more HPC centers offer web-portals to access their systems along with applications developers also offering a web-based front-end so that the \u0022obscure green font on black screen magic\u0022 of a typical SSH session is hidden from the end user. This enables both new groups to use HPC systems but also provides users a more error-proof and efficient way of using installed applications. This talk will highlight the criticality of an application-as-a service mode and will also discuss how Docker containers and Jupyter notebooks can be used for this easy-to-use application-as-a-service notion. The parallel benchmark suite from SPEC HPG organization will be used for demo purposes.","filename":"msa137s1.pdf","bio":"","contributors":[{"type":"Author","first_name":"Sunita","last_name":"Chandrasekaran","affiliation":"University of Delaware","country":"United States of America","bio":"","order":"1","is_presenter":true}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Sunita","last_name":"Chandrasekaran","affiliation":"University of Delaware","country":"United States of America","bio":"","order":"1","is_presenter":true}]},{"id":"msa226","type":"child","title":"The Reality of Scientific Software Development is Agile - Best Practices and Lessons Learned","begin_time":"16:00","end_time":"16:30","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"The reality of the development of scientific software is often far from the clearly structured, cascading work packages that grant applications require, but rather in the spirit of agile programming: Start from a working minimal prototype, always have running code, work in sprints (typically towards the end of reporting periods). Those characteristics actually match rather well onto the concept of agile programming. This talk will explain the principles of agile software development, the existing software tool support using the free-of-charge GitLab\u00a0Libre Edition as an example, and the implementation of the processes, including team communication, continuous integration and publication of documentation.","bio":"","contributors":[{"type":"Author","first_name":"Guido","last_name":"Juckeland","affiliation":"Helmholtz-Zentrum Dresden-Rossendorf","country":"Germany","bio":"","order":"1","is_presenter":true},{"type":"Author","first_name":"Tobias","last_name":"Frust","affiliation":"Helmholtz-Zentrum Dresden-Rossendorf","country":"Germany","bio":"","order":"2","is_presenter":false}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Guido","last_name":"Juckeland","affiliation":"Helmholtz-Zentrum Dresden-Rossendorf","country":"Germany","bio":"","order":"1","is_presenter":true}]},{"id":"msa212","type":"child","title":"Using Jetstream and High Performance Remote Research Desktops to Lower the Barrier of Entry for HPC Resources","begin_time":"16:30","end_time":"17:00","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"Indiana University operates two environments designed to lower the barrier of entry for HPC resources. One is the NSF Jetstream project, the first NSF funded cloud designed for those who have not previously used high performance computing resources. Jetstream provides users with long running virtual machines with a customizable software stack to meet the needs of non-traditional HPC applications. The other environment is a research desktop solution that is making high performance Linux desktops available remotely. The desktops contain all the normal HPC command line tools and allow for direct job submission to the HPC machines, but also provide access to interactive applications like Matlab, Comsol Multiphysics, R-Studio and Jupyter. The goal of both projects is to lower the barrier of entry and broaden adoption of traditional HPC and high-throughput computing environments. The talk will provide an architectural overview, use cases and experiences for operating such environments.","filename":"msa212s1.pdf","bio":"","contributors":[{"type":"Author","first_name":"Robert","last_name":"Henschel","affiliation":"Indiana University","country":"United States of America","bio":"","order":"1","is_presenter":true},{"type":"Author","first_name":"Dave","last_name":"Hancock","affiliation":"Indiana University","country":"United States of America","bio":"","order":"2","is_presenter":false}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Robert","last_name":"Henschel","affiliation":"Indiana University","country":"United States of America","bio":"","order":"1","is_presenter":true}]},{"id":"msa213","type":"child","title":"Spack: A Package Manager for Scientific Software","begin_time":"17:00","end_time":"17:30","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"On mainstream Linux distributions, package managers simplify the software installation process by providing pre-built, generic binaries. Users can leverage a wide variety of libraries and applications without knowing how to build from source. On HPC systems users typically build from source, and software, is notoriously complex. Building even a moderately sized parallel simulation code can be a major effort. Scientists who use applications codes must typically also know how to build them from scratch, along with tens or hundreds of dependency libraries. Spack is an open source package manager that handles the complexity of HPC environments and allows scientists to automatically and reproducibly install complex software stacks. It allows users to experiment with different compilers, optimizations, build options, and dependency versions, without in-depth build knowledge. Spack is built to handle the complexities of the HPC environment that seldom arise on commodity systems, such as swapping compilers and ABI-incompatible dependencies, cross-compilation, compiler runtime libraries, and optimized binaries. Spack has a rapidly growing community, with over 240 contributors at organizations worldwide. In this talk, we will introduce Spack, show how it can make scientists more productive, and give an overview of ongoing Spack projects and its development road map.","bio":"","contributors":[{"type":"Author","first_name":"Todd","last_name":"Gamblin","affiliation":"Lawrence Livermore National Laboratory","country":"United States of America","bio":"","order":"1","is_presenter":true},{"type":"Author","first_name":"Massimiliano","last_name":"Culpo","affiliation":"EPFL","country":"Switzerland","bio":"","order":"2","is_presenter":true}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Todd","last_name":"Gamblin","affiliation":"Lawrence Livermore National Laboratory","country":"United States of America","bio":"","order":"1","is_presenter":true},{"type":"Author","first_name":"Massimiliano","last_name":"Culpo","affiliation":"EPFL","country":"Switzerland","bio":"","order":"2","is_presenter":true}]}]}, "slot": {"id":"msa212","type":"child","title":"Using Jetstream and High Performance Remote Research Desktops to Lower the Barrier of Entry for HPC Resources","begin_time":"16:30","end_time":"17:00","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"Indiana University operates two environments designed to lower the barrier of entry for HPC resources. One is the NSF Jetstream project, the first NSF funded cloud designed for those who have not previously used high performance computing resources. Jetstream provides users with long running virtual machines with a customizable software stack to meet the needs of non-traditional HPC applications. The other environment is a research desktop solution that is making high performance Linux desktops available remotely. The desktops contain all the normal HPC command line tools and allow for direct job submission to the HPC machines, but also provide access to interactive applications like Matlab, Comsol Multiphysics, R-Studio and Jupyter. The goal of both projects is to lower the barrier of entry and broaden adoption of traditional HPC and high-throughput computing environments. The talk will provide an architectural overview, use cases and experiences for operating such environments.","filename":"msa212s1.pdf","bio":"","contributors":[{"type":"Author","first_name":"Robert","last_name":"Henschel","affiliation":"Indiana University","country":"United States of America","bio":"","order":"1","is_presenter":true},{"type":"Author","first_name":"Dave","last_name":"Hancock","affiliation":"Indiana University","country":"United States of America","bio":"","order":"2","is_presenter":false}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Robert","last_name":"Henschel","affiliation":"Indiana University","country":"United States of America","bio":"","order":"1","is_presenter":true}]}, "slotContributors": [{"type":"Author","first_name":"Robert","last_name":"Henschel","affiliation":"Indiana University","country":"United States of America","bio":"","order":"1","is_presenter":true},{"type":"Author","first_name":"Dave","last_name":"Hancock","affiliation":"Indiana University","country":"United States of America","bio":"","order":"2","is_presenter":false}] } Presentation
Organizer(s):
Sofia Vallecorsa (CERN, Switzerland)
, Jean-Roch Vlimant (California Institute of Technology, United States of America)
, Michela Paganini (Yale University, United States of America)
Track(s):
Computer Science and Applied Mathematics, Physics
The Large Hadron Collider at CERN is smashing high density bunches of protons near the speed of light at a frequency of 40 MHz. Most of the thousands of particles emitted at each bunch crossing are measured and collected with building-sized detectors consisting of multiple sub-detectors each serving its own purpose. The simulation of the signal created by a particle interacting with such a detector is typically done with very detailed simulations and needs to be stepped infinitesimally over meters of material. This simulation is as much computing intensive as the geometry is complex. In current and future detector design, the fine-grained simulation of such a detector is taking a great part of the full computing budget of experiments and poses a computing challenge. While a great deal of effort is being made to parallelise such software, one possible avenue to reduce the computational requirements is with generative models from the field of deep learning. Such generative models have seen success in conditionally generating images and video of various types. We present how such models are built and trained, and how well they can capture the physics of particle interaction and help generate realistic samples for high energy physics analysis.
15:30 - 16:00
The Success of Deep Generative Models
, Jakub Tomczak (University of Amsterdam, Netherlands)
+ Abstract { "session": {"id":"sess176","title":"MS13 - Generative Models and Density Estimator for High Energy Physics","date":"Monday, July 2nd 2018","begin_time":"15:30","end_time":"17:30","room":"Osaka Room","contributors":[{"type":"Session Chair","first_name":"Sofia","last_name":"Vallecorsa","affiliation":"CERN","country":""}],"view_type":"VIII","view_type_id":"evtt110","tracks":["Computer Science and Applied Mathematics","Physics"],"slots":[{"id":"symp144","type":"minisymposia","title":"MS13 - Generative Models and Density Estimator for High Energy Physics","has_begin_end_time":true,"has_just_one_minute":true,"is_parent":true,"abstract":"The Large Hadron Collider at CERN is smashing high density bunches of protons near the speed of light at a frequency of 40 MHz. Most of the thousands of particles emitted at each bunch crossing are measured and collected with building-sized detectors consisting of multiple sub-detectors each serving its own purpose. The simulation of the signal created by a particle interacting with such a detector is typically done with very detailed simulations and needs to be stepped infinitesimally over meters of material. This simulation is as much computing intensive as the geometry is complex. In current and future detector design, the fine-grained simulation of such a detector is taking a great part of the full computing budget of experiments and poses a computing challenge. While a great deal of effort is being made to parallelise such software, one possible avenue to reduce the computational requirements is with generative models from the field of deep learning. Such generative models have seen success in conditionally generating images and video of various types. We present how such models are built and trained, and how well they can capture the physics of particle interaction and help generate realistic samples for high energy physics analysis.","bio":"","contributors":[{"type":"Organizer","first_name":"Sofia","last_name":"Vallecorsa","affiliation":"CERN","country":"Switzerland","bio":"","order":"1","is_presenter":true},{"type":"Organizer","first_name":"Jean-Roch","last_name":"Vlimant","affiliation":"California Institute of Technology","country":"United States of America","bio":"","order":"2","is_presenter":false},{"type":"Organizer","first_name":"Michela","last_name":"Paganini","affiliation":"Yale University","country":"United States of America","bio":"","order":"3","is_presenter":false}],"has_presenters":true,"presenters":[{"type":"Organizer","first_name":"Sofia","last_name":"Vallecorsa","affiliation":"CERN","country":"Switzerland","bio":"","order":"1","is_presenter":true}]},{"id":"msa291","type":"child","title":"The Success of Deep Generative Models","begin_time":"15:30","end_time":"16:00","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"Deep generative models allow us to learn hidden representations of data and generate new examples. There are two major families of models that are exploited in current applications: Generative Adversarial Networks (GANs), and Variational Auto-Encoders (VAE). The principle of GANs is to train a generator that can generate examples from random noise, in adversary of a discriminative model that is forced to confuse true samples from generated ones. Generated images by GANs are very sharp and detailed. The biggest disadvantage of GANs is that they are trained through solving a minimax optimization problem that causes significant learning instability issues. VAEs are based on a fully probabilistic perspective of the variational inference. The learning problem aims at maximizing the variational lower bound for a given family of variational posteriors. The model can be trained by backpropagation but it was noticed that the resulting generated images are rather blurry. However, VAEs are probabilistic models, thus, they could be incorporated in almost any probabilistic framework. We will discuss basics of both approaches and present recent extensions. We will point out advantages and disadvantages of GANs and VAE. Some of most promising applications of deep generative models will be shown.","filename":"msa291s1.pdf","bio":"","contributors":[{"type":"Author","first_name":"Jakub","last_name":"Tomczak","affiliation":"University of Amsterdam","country":"Netherlands","bio":"","order":"1","is_presenter":true}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Jakub","last_name":"Tomczak","affiliation":"University of Amsterdam","country":"Netherlands","bio":"","order":"1","is_presenter":true}]},{"id":"msa256","type":"child","title":"Generative Models for Application-Specific Fast Simulation of LHC Collision Events","begin_time":"16:00","end_time":"16:30","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"We investigate the possibility of using generative models (e.g., GANs and variational autoencoders) as analysis-specific data augmentation tools to increase the size of the simulation data used by the LHC experiments. With the LHC entering its high-luminosity phase in 2025, the projected computing resources will not be able to sustain the demand for simulated events. Generative models are already investigated as the mean to speed up the centralized simulation process. Here we propose to investigate a different strategy: training deep networks to generate small-dimension ntuples of numbers (physics quantities such as reconstructed particle energy and direction), learning the distribution of these quantities from a sample of simulated data. In one step, one would then be able to generate the outcome of the full processing workflow (generation + simulation + reconstruction + selection).","bio":"","contributors":[{"type":"Author","first_name":"Maurizio","last_name":"Pierini","affiliation":"CERN","country":"Switzerland","bio":"","order":"1","is_presenter":true},{"type":"Author","first_name":"Dominick","last_name":"Olivito","affiliation":"UC San Diego","country":"United States of America","bio":"","order":"2","is_presenter":false},{"type":"Author","first_name":"Bobak","last_name":"Hashemi","affiliation":"UC San Diego","country":"United States of America","bio":"","order":"3","is_presenter":false},{"type":"Author","first_name":"Nick","last_name":"Amin","affiliation":"UC San Diego","country":"United States of America","bio":"","order":"4","is_presenter":false}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Maurizio","last_name":"Pierini","affiliation":"CERN","country":"Switzerland","bio":"","order":"1","is_presenter":true}]},{"id":"msa249","type":"child","title":"Using Generative Models for Fast Clusters Simulations in the TPC Detector for the ALICE Experiment","begin_time":"16:30","end_time":"17:00","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"Simulation of the events happening in the particle detector is a key component of many High Energy Physics experiments. Currently used Monte Carlo techniques allow to do it accurately, but their precision often comes at the expense of relatively high computational cost. In this work, we present a proof-of-concept solution for simulating clusters that occur after particle collision in the TPC detector in the ALICE Experiment at CERN. The new method we propose, dubbed ParticleGAN for simplicity, leverages recently developed Generative Adversarial Networks to learn the trajectories of particle tracks after collision. Although the quality of generated events is not even with the currently used solutions yet, ParticleGAN offer up to 10^3 speedups over the existing approaches. This applies also to other evaluated generative models namely Variational Autoencoders and variants of GANs. In this work we outline current bottlenecks of the proposed approach and discuss further steps that can allow to deploy the proposed generative models for simulation in production.","bio":"","contributors":[{"type":"Author","first_name":"Kamil","last_name":"Deja","affiliation":"Warsaw University of Technology","country":"Poland","bio":"","order":"1","is_presenter":true},{"type":"Author","first_name":"Tomasz","last_name":"Trzci\u0144ski","affiliation":"Warsaw University of Technology","country":"Poland","bio":"","order":"2","is_presenter":false},{"type":"Author","first_name":"\u0141ukasz","last_name":"Graczykowski","affiliation":"Warsaw University of Technology","country":"Poland","bio":"","order":"3","is_presenter":false}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Kamil","last_name":"Deja","affiliation":"Warsaw University of Technology","country":"Poland","bio":"","order":"1","is_presenter":true}]},{"id":"msa251","type":"child","title":"Generative Models for Simulating Highly Granular Calorimeters","begin_time":"17:00","end_time":"17:30","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"Machine Learning techniques have been used in different applications by the HEP community: in this talk, we discuss the case of detector simulation. The need for simulated events, expected in future High Luminosity LHC experiments is increasing dramatically and requires new fast simulation solutions. We will describe R\u0026amp;D activities, aimed at reproducing the detector response and replace standard Monte Carlo simulation with generative models, typically used in computer vision applications. Two common aspects characterize many of these applications: the representation of input data as regular arrays of numerical values and the use of raw data as the input information to feed the network. Next generation HEP experiments are expected to be more and more characterized by detector components that could comply to this paradigm. Calorimeters of the ILC and CLIC detector concepts are effectively 3D arrays of sensors. We will introduce the first application of three-dimensional convolutional Generative Adversarial Networks and of Variational Auto Econders to the simulation of highly granular calorimeters.\u00a0Finally we will present detailed validation studies comparing results to Monte Carlo simulation, showing the very good agreement we obtain for high level physics quantities and calorimeter response.","filename":"msa251s1.pdf","bio":"","contributors":[{"type":"Author","first_name":"Tobias","last_name":"Golling","affiliation":"University of Geneva","country":"Switzerland","bio":"","order":"1","is_presenter":true},{"type":"Author","first_name":"Sofia","last_name":"Vallecorsa","affiliation":"CERN","country":"Switzerland","bio":"","order":"2","is_presenter":false},{"type":"Author","first_name":"Dalila","last_name":"Salamani","affiliation":"University of Geneva","country":"Switzerland","bio":"","order":"3","is_presenter":false},{"type":"Author","first_name":"Federico","last_name":"Carminati","affiliation":"CERN","country":"Switzerland","bio":"","order":"4","is_presenter":false},{"type":"Author","first_name":"Gul Rukh","last_name":"Khattak","affiliation":"CERN","country":"Switzerland","bio":"","order":"5","is_presenter":false}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Tobias","last_name":"Golling","affiliation":"University of Geneva","country":"Switzerland","bio":"","order":"1","is_presenter":true}]}]}, "slot": {"id":"msa291","type":"child","title":"The Success of Deep Generative Models","begin_time":"15:30","end_time":"16:00","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"Deep generative models allow us to learn hidden representations of data and generate new examples. There are two major families of models that are exploited in current applications: Generative Adversarial Networks (GANs), and Variational Auto-Encoders (VAE). The principle of GANs is to train a generator that can generate examples from random noise, in adversary of a discriminative model that is forced to confuse true samples from generated ones. Generated images by GANs are very sharp and detailed. The biggest disadvantage of GANs is that they are trained through solving a minimax optimization problem that causes significant learning instability issues. VAEs are based on a fully probabilistic perspective of the variational inference. The learning problem aims at maximizing the variational lower bound for a given family of variational posteriors. The model can be trained by backpropagation but it was noticed that the resulting generated images are rather blurry. However, VAEs are probabilistic models, thus, they could be incorporated in almost any probabilistic framework. We will discuss basics of both approaches and present recent extensions. We will point out advantages and disadvantages of GANs and VAE. Some of most promising applications of deep generative models will be shown.","filename":"msa291s1.pdf","bio":"","contributors":[{"type":"Author","first_name":"Jakub","last_name":"Tomczak","affiliation":"University of Amsterdam","country":"Netherlands","bio":"","order":"1","is_presenter":true}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Jakub","last_name":"Tomczak","affiliation":"University of Amsterdam","country":"Netherlands","bio":"","order":"1","is_presenter":true}]}, "slotContributors": [{"type":"Author","first_name":"Jakub","last_name":"Tomczak","affiliation":"University of Amsterdam","country":"Netherlands","bio":"","order":"1","is_presenter":true}] } Presentation
17:00 - 17:30
Generative Models for Simulating Highly Granular Calorimeters
, Tobias Golling (University of Geneva, Switzerland)
+ Abstract { "session": {"id":"sess176","title":"MS13 - Generative Models and Density Estimator for High Energy Physics","date":"Monday, July 2nd 2018","begin_time":"15:30","end_time":"17:30","room":"Osaka Room","contributors":[{"type":"Session Chair","first_name":"Sofia","last_name":"Vallecorsa","affiliation":"CERN","country":""}],"view_type":"VIII","view_type_id":"evtt110","tracks":["Computer Science and Applied Mathematics","Physics"],"slots":[{"id":"symp144","type":"minisymposia","title":"MS13 - Generative Models and Density Estimator for High Energy Physics","has_begin_end_time":true,"has_just_one_minute":true,"is_parent":true,"abstract":"The Large Hadron Collider at CERN is smashing high density bunches of protons near the speed of light at a frequency of 40 MHz. Most of the thousands of particles emitted at each bunch crossing are measured and collected with building-sized detectors consisting of multiple sub-detectors each serving its own purpose. The simulation of the signal created by a particle interacting with such a detector is typically done with very detailed simulations and needs to be stepped infinitesimally over meters of material. This simulation is as much computing intensive as the geometry is complex. In current and future detector design, the fine-grained simulation of such a detector is taking a great part of the full computing budget of experiments and poses a computing challenge. While a great deal of effort is being made to parallelise such software, one possible avenue to reduce the computational requirements is with generative models from the field of deep learning. Such generative models have seen success in conditionally generating images and video of various types. We present how such models are built and trained, and how well they can capture the physics of particle interaction and help generate realistic samples for high energy physics analysis.","bio":"","contributors":[{"type":"Organizer","first_name":"Sofia","last_name":"Vallecorsa","affiliation":"CERN","country":"Switzerland","bio":"","order":"1","is_presenter":true},{"type":"Organizer","first_name":"Jean-Roch","last_name":"Vlimant","affiliation":"California Institute of Technology","country":"United States of America","bio":"","order":"2","is_presenter":false},{"type":"Organizer","first_name":"Michela","last_name":"Paganini","affiliation":"Yale University","country":"United States of America","bio":"","order":"3","is_presenter":false}],"has_presenters":true,"presenters":[{"type":"Organizer","first_name":"Sofia","last_name":"Vallecorsa","affiliation":"CERN","country":"Switzerland","bio":"","order":"1","is_presenter":true}]},{"id":"msa291","type":"child","title":"The Success of Deep Generative Models","begin_time":"15:30","end_time":"16:00","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"Deep generative models allow us to learn hidden representations of data and generate new examples. There are two major families of models that are exploited in current applications: Generative Adversarial Networks (GANs), and Variational Auto-Encoders (VAE). The principle of GANs is to train a generator that can generate examples from random noise, in adversary of a discriminative model that is forced to confuse true samples from generated ones. Generated images by GANs are very sharp and detailed. The biggest disadvantage of GANs is that they are trained through solving a minimax optimization problem that causes significant learning instability issues. VAEs are based on a fully probabilistic perspective of the variational inference. The learning problem aims at maximizing the variational lower bound for a given family of variational posteriors. The model can be trained by backpropagation but it was noticed that the resulting generated images are rather blurry. However, VAEs are probabilistic models, thus, they could be incorporated in almost any probabilistic framework. We will discuss basics of both approaches and present recent extensions. We will point out advantages and disadvantages of GANs and VAE. Some of most promising applications of deep generative models will be shown.","filename":"msa291s1.pdf","bio":"","contributors":[{"type":"Author","first_name":"Jakub","last_name":"Tomczak","affiliation":"University of Amsterdam","country":"Netherlands","bio":"","order":"1","is_presenter":true}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Jakub","last_name":"Tomczak","affiliation":"University of Amsterdam","country":"Netherlands","bio":"","order":"1","is_presenter":true}]},{"id":"msa256","type":"child","title":"Generative Models for Application-Specific Fast Simulation of LHC Collision Events","begin_time":"16:00","end_time":"16:30","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"We investigate the possibility of using generative models (e.g., GANs and variational autoencoders) as analysis-specific data augmentation tools to increase the size of the simulation data used by the LHC experiments. With the LHC entering its high-luminosity phase in 2025, the projected computing resources will not be able to sustain the demand for simulated events. Generative models are already investigated as the mean to speed up the centralized simulation process. Here we propose to investigate a different strategy: training deep networks to generate small-dimension ntuples of numbers (physics quantities such as reconstructed particle energy and direction), learning the distribution of these quantities from a sample of simulated data. In one step, one would then be able to generate the outcome of the full processing workflow (generation + simulation + reconstruction + selection).","bio":"","contributors":[{"type":"Author","first_name":"Maurizio","last_name":"Pierini","affiliation":"CERN","country":"Switzerland","bio":"","order":"1","is_presenter":true},{"type":"Author","first_name":"Dominick","last_name":"Olivito","affiliation":"UC San Diego","country":"United States of America","bio":"","order":"2","is_presenter":false},{"type":"Author","first_name":"Bobak","last_name":"Hashemi","affiliation":"UC San Diego","country":"United States of America","bio":"","order":"3","is_presenter":false},{"type":"Author","first_name":"Nick","last_name":"Amin","affiliation":"UC San Diego","country":"United States of America","bio":"","order":"4","is_presenter":false}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Maurizio","last_name":"Pierini","affiliation":"CERN","country":"Switzerland","bio":"","order":"1","is_presenter":true}]},{"id":"msa249","type":"child","title":"Using Generative Models for Fast Clusters Simulations in the TPC Detector for the ALICE Experiment","begin_time":"16:30","end_time":"17:00","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"Simulation of the events happening in the particle detector is a key component of many High Energy Physics experiments. Currently used Monte Carlo techniques allow to do it accurately, but their precision often comes at the expense of relatively high computational cost. In this work, we present a proof-of-concept solution for simulating clusters that occur after particle collision in the TPC detector in the ALICE Experiment at CERN. The new method we propose, dubbed ParticleGAN for simplicity, leverages recently developed Generative Adversarial Networks to learn the trajectories of particle tracks after collision. Although the quality of generated events is not even with the currently used solutions yet, ParticleGAN offer up to 10^3 speedups over the existing approaches. This applies also to other evaluated generative models namely Variational Autoencoders and variants of GANs. In this work we outline current bottlenecks of the proposed approach and discuss further steps that can allow to deploy the proposed generative models for simulation in production.","bio":"","contributors":[{"type":"Author","first_name":"Kamil","last_name":"Deja","affiliation":"Warsaw University of Technology","country":"Poland","bio":"","order":"1","is_presenter":true},{"type":"Author","first_name":"Tomasz","last_name":"Trzci\u0144ski","affiliation":"Warsaw University of Technology","country":"Poland","bio":"","order":"2","is_presenter":false},{"type":"Author","first_name":"\u0141ukasz","last_name":"Graczykowski","affiliation":"Warsaw University of Technology","country":"Poland","bio":"","order":"3","is_presenter":false}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Kamil","last_name":"Deja","affiliation":"Warsaw University of Technology","country":"Poland","bio":"","order":"1","is_presenter":true}]},{"id":"msa251","type":"child","title":"Generative Models for Simulating Highly Granular Calorimeters","begin_time":"17:00","end_time":"17:30","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"Machine Learning techniques have been used in different applications by the HEP community: in this talk, we discuss the case of detector simulation. The need for simulated events, expected in future High Luminosity LHC experiments is increasing dramatically and requires new fast simulation solutions. We will describe R\u0026amp;D activities, aimed at reproducing the detector response and replace standard Monte Carlo simulation with generative models, typically used in computer vision applications. Two common aspects characterize many of these applications: the representation of input data as regular arrays of numerical values and the use of raw data as the input information to feed the network. Next generation HEP experiments are expected to be more and more characterized by detector components that could comply to this paradigm. Calorimeters of the ILC and CLIC detector concepts are effectively 3D arrays of sensors. We will introduce the first application of three-dimensional convolutional Generative Adversarial Networks and of Variational Auto Econders to the simulation of highly granular calorimeters.\u00a0Finally we will present detailed validation studies comparing results to Monte Carlo simulation, showing the very good agreement we obtain for high level physics quantities and calorimeter response.","filename":"msa251s1.pdf","bio":"","contributors":[{"type":"Author","first_name":"Tobias","last_name":"Golling","affiliation":"University of Geneva","country":"Switzerland","bio":"","order":"1","is_presenter":true},{"type":"Author","first_name":"Sofia","last_name":"Vallecorsa","affiliation":"CERN","country":"Switzerland","bio":"","order":"2","is_presenter":false},{"type":"Author","first_name":"Dalila","last_name":"Salamani","affiliation":"University of Geneva","country":"Switzerland","bio":"","order":"3","is_presenter":false},{"type":"Author","first_name":"Federico","last_name":"Carminati","affiliation":"CERN","country":"Switzerland","bio":"","order":"4","is_presenter":false},{"type":"Author","first_name":"Gul Rukh","last_name":"Khattak","affiliation":"CERN","country":"Switzerland","bio":"","order":"5","is_presenter":false}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Tobias","last_name":"Golling","affiliation":"University of Geneva","country":"Switzerland","bio":"","order":"1","is_presenter":true}]}]}, "slot": {"id":"msa251","type":"child","title":"Generative Models for Simulating Highly Granular Calorimeters","begin_time":"17:00","end_time":"17:30","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"Machine Learning techniques have been used in different applications by the HEP community: in this talk, we discuss the case of detector simulation. The need for simulated events, expected in future High Luminosity LHC experiments is increasing dramatically and requires new fast simulation solutions. We will describe R\u0026amp;D activities, aimed at reproducing the detector response and replace standard Monte Carlo simulation with generative models, typically used in computer vision applications. Two common aspects characterize many of these applications: the representation of input data as regular arrays of numerical values and the use of raw data as the input information to feed the network. Next generation HEP experiments are expected to be more and more characterized by detector components that could comply to this paradigm. Calorimeters of the ILC and CLIC detector concepts are effectively 3D arrays of sensors. We will introduce the first application of three-dimensional convolutional Generative Adversarial Networks and of Variational Auto Econders to the simulation of highly granular calorimeters.\u00a0Finally we will present detailed validation studies comparing results to Monte Carlo simulation, showing the very good agreement we obtain for high level physics quantities and calorimeter response.","filename":"msa251s1.pdf","bio":"","contributors":[{"type":"Author","first_name":"Tobias","last_name":"Golling","affiliation":"University of Geneva","country":"Switzerland","bio":"","order":"1","is_presenter":true},{"type":"Author","first_name":"Sofia","last_name":"Vallecorsa","affiliation":"CERN","country":"Switzerland","bio":"","order":"2","is_presenter":false},{"type":"Author","first_name":"Dalila","last_name":"Salamani","affiliation":"University of Geneva","country":"Switzerland","bio":"","order":"3","is_presenter":false},{"type":"Author","first_name":"Federico","last_name":"Carminati","affiliation":"CERN","country":"Switzerland","bio":"","order":"4","is_presenter":false},{"type":"Author","first_name":"Gul Rukh","last_name":"Khattak","affiliation":"CERN","country":"Switzerland","bio":"","order":"5","is_presenter":false}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Tobias","last_name":"Golling","affiliation":"University of Geneva","country":"Switzerland","bio":"","order":"1","is_presenter":true}]}, "slotContributors": [{"type":"Author","first_name":"Tobias","last_name":"Golling","affiliation":"University of Geneva","country":"Switzerland","bio":"","order":"1","is_presenter":true},{"type":"Author","first_name":"Sofia","last_name":"Vallecorsa","affiliation":"CERN","country":"Switzerland","bio":"","order":"2","is_presenter":false},{"type":"Author","first_name":"Dalila","last_name":"Salamani","affiliation":"University of Geneva","country":"Switzerland","bio":"","order":"3","is_presenter":false},{"type":"Author","first_name":"Federico","last_name":"Carminati","affiliation":"CERN","country":"Switzerland","bio":"","order":"4","is_presenter":false},{"type":"Author","first_name":"Gul Rukh","last_name":"Khattak","affiliation":"CERN","country":"Switzerland","bio":"","order":"5","is_presenter":false}] } Presentation
Organizer(s):
Jean-Michel Benkert (Baloise Group, Switzerland)
, Michelle Allgöwer (Baloise Group, Switzerland)
Track(s):
Emerging Application Domains, Computer Science and Applied Mathematics
In the past couple years a large number of fintech and more recently insurtech startups have been founded and are challenging established players in the financial services sector. At Baloise – a Swiss company providing insurance services in Switzerland, Belgium, Germany and Luxembourg as well as banking services in Switzerland – we view startups as potential partners on our digital transformation journey rather than competition. This minisymposium aims to demonstrate what problems companies such as Baloise face in terms of digitizing their business and making use of their large amounts of data. To do so, the four sessions will cover the innovation framework Baloise employs in order to rapidly test prototypes, a presentation by Brainalyzed, a startup which aims to optimize and automatize investment decisions at Baloise using AI, a presentation about the challenges arising in the context of data warehouses and legacy systems, and a panel discussion with all the speakers.
15:30 - 16:00
Open Innovation at Baloise
, Jean-Michel Benkert (Baloise Group, Switzerland)
+ Abstract { "session": {"id":"sess179","title":"MS14 - How Fintech and Big Data Change and Challenge the Insurance Sector","date":"Monday, July 2nd 2018","begin_time":"15:30","end_time":"17:30","room":"Nairobi Room","contributors":[{"type":"Session Chair","first_name":"Jean-Michel","last_name":"Benkert","affiliation":"Baloise Group","country":""}],"view_type":"VIII","view_type_id":"evtt110","tracks":["Emerging Application Domains","Computer Science and Applied Mathematics"],"slots":[{"id":"symp120","type":"minisymposia","title":"MS14 - How Fintech and Big Data Change and Challenge the Insurance Sector","has_begin_end_time":true,"has_just_one_minute":true,"is_parent":true,"abstract":"In the past couple years a large number of fintech and more recently insurtech startups have been founded and are challenging established players in the financial services sector. At Baloise \u2013 a Swiss company providing insurance services in Switzerland, Belgium, Germany and Luxembourg as well as banking services in Switzerland \u2013 we view startups as potential partners on our digital transformation journey rather than competition. This minisymposium aims to demonstrate what problems companies such as Baloise face in terms of digitizing their business and making use of their large amounts of data. To do so, the four sessions will cover the innovation framework Baloise employs in order to rapidly test prototypes, a presentation by Brainalyzed, a startup which aims to optimize and automatize investment decisions at Baloise using AI, a presentation about the challenges arising in the context of data warehouses and legacy systems, and a panel discussion with all the speakers.","bio":"","contributors":[{"type":"Organizer","first_name":"Jean-Michel","last_name":"Benkert","affiliation":"Baloise Group","country":"Switzerland","bio":"","order":"1","is_presenter":true},{"type":"Organizer","first_name":"Michelle","last_name":"Allg\u00f6wer","affiliation":"Baloise Group","country":"Switzerland","bio":"","order":"2","is_presenter":false}],"has_presenters":true,"presenters":[{"type":"Organizer","first_name":"Jean-Michel","last_name":"Benkert","affiliation":"Baloise Group","country":"Switzerland","bio":"","order":"1","is_presenter":true}]},{"id":"msa194","type":"child","title":"Open Innovation at Baloise","begin_time":"15:30","end_time":"16:00","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"In recent years a large number of fintech and more recently insurtech startups have been founded and are challenging established players in the financial services sector.\u00a0 At Baloise \u2013 a Swiss company providing insurance services in Switzerland, Belgium, Germany and Luxembourg as well as banking services in Switzerland \u2013 we view startups as potential partners on our digital transformation journey rather than competition. Baloise has developed an open innovation framework with the goal of enabling easy and fast cooperation with startups and other external partners as well as intrapreneurs. In this session we will present this open innovation framework and its evolution over time as we have tailored it to the requirements of startups.","filename":"msa194s1.pdf","bio":"","contributors":[{"type":"Author","first_name":"Jean-Michel","last_name":"Benkert","affiliation":"Baloise Group","country":"Switzerland","bio":"","order":"1","is_presenter":true}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Jean-Michel","last_name":"Benkert","affiliation":"Baloise Group","country":"Switzerland","bio":"","order":"1","is_presenter":true}]},{"id":"msa116","type":"child","title":"Artificial Intelligence for Automated Investment Management","begin_time":"16:00","end_time":"16:30","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"Due to increasing digitization in all sectors, the amount of available data is almost unlimited. The challenge is not only to manage this data, but to make it usable. Therefore, data analysis becomes a key success factor for organizations. Especially in the financial sector, data-driven applications are necessary to keep up with the fast-moving financial market and growing competition. The answer to low interest rates and high volatility in the market are automated data-driven investment processes. Data analysis using artificial intelligence (AI) is therefore becoming increasingly important. In this session we will give insights and some practical examples how we worked together with Baloise Asset Management to use some of their data to enhance the investment management process. We will show how the scalability of the learning solution helps to analyze even very complex problems in a short time and what our vision of AI in the financial world looks like.","filename":"msa116s1.pdf","bio":"","contributors":[{"type":"Author","first_name":"Gunter","last_name":"Fischer","affiliation":"Brainalyzed","country":"Germany","bio":"","order":"1","is_presenter":true}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Gunter","last_name":"Fischer","affiliation":"Brainalyzed","country":"Germany","bio":"","order":"1","is_presenter":true}]},{"id":"msa169","type":"child","title":"The Challenges of Big Data for a Traditional Insurance Company","begin_time":"16:30","end_time":"17:00","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"With a company history of over 150 years, our IT landscape has grown in a highly fragmented way and consists of numerous legacy systems which have evolved over the last couple of decades covering a wide range of computer languages. Therefore a greenfield approach in terms of big data is out of question and the integration of data originating from these systems represents a costly and time-consuming challenge for Baloise. Securing the availability of internal data on one side and meeting the fast growing business requirements in connection with external (big) data integration on the other side is the balancing act of our digital transformation in the domain of business intelligence. How Baloise tackles these challenges and how the company benefits from cooperation with startups using artificial intelligence to boost this transformation will be explained in this session of the minisymposium. In the second part of this session an insight into projected use cases will be delivered to illuminate Baloise\u0027s strategic approaches related to machine learning and big data.","filename":"msa169s1.pdf","bio":"","contributors":[{"type":"Author","first_name":"Christoph","last_name":"Geering","affiliation":"Baloise Group","country":"Switzerland","bio":"","order":"1","is_presenter":false},{"type":"Author","first_name":"Klaus","last_name":"Rieger","affiliation":"Baloise Group","country":"Switzerland","bio":"","order":"2","is_presenter":true}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Klaus","last_name":"Rieger","affiliation":"Baloise Group","country":"Switzerland","bio":"","order":"2","is_presenter":true}]},{"id":"msa195","type":"child","title":"Panel Discussion on How Fintech and Big Data Change and Challenge the Insurance Sector","begin_time":"17:00","end_time":"17:30","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"Join us for a panel discussion on how fintech and big data change and challenge the insurance sector. The panelists are Dr. Gunter Fischer from Brainalyzed, an AI startup in the fintech space, Christoph Geering, responsible for business intelligence at Baloise Switzerland, and Dr. Jean-Michel Benkert, Innovation Manager at Baloise Group.","bio":"","contributors":[{"type":"Author","first_name":"Jean-Michel","last_name":"Benkert","affiliation":"Baloise Group","country":"Switzerland","bio":"","order":"1","is_presenter":true}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Jean-Michel","last_name":"Benkert","affiliation":"Baloise Group","country":"Switzerland","bio":"","order":"1","is_presenter":true}]}]}, "slot": {"id":"msa194","type":"child","title":"Open Innovation at Baloise","begin_time":"15:30","end_time":"16:00","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"In recent years a large number of fintech and more recently insurtech startups have been founded and are challenging established players in the financial services sector.\u00a0 At Baloise \u2013 a Swiss company providing insurance services in Switzerland, Belgium, Germany and Luxembourg as well as banking services in Switzerland \u2013 we view startups as potential partners on our digital transformation journey rather than competition. Baloise has developed an open innovation framework with the goal of enabling easy and fast cooperation with startups and other external partners as well as intrapreneurs. In this session we will present this open innovation framework and its evolution over time as we have tailored it to the requirements of startups.","filename":"msa194s1.pdf","bio":"","contributors":[{"type":"Author","first_name":"Jean-Michel","last_name":"Benkert","affiliation":"Baloise Group","country":"Switzerland","bio":"","order":"1","is_presenter":true}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Jean-Michel","last_name":"Benkert","affiliation":"Baloise Group","country":"Switzerland","bio":"","order":"1","is_presenter":true}]}, "slotContributors": [{"type":"Author","first_name":"Jean-Michel","last_name":"Benkert","affiliation":"Baloise Group","country":"Switzerland","bio":"","order":"1","is_presenter":true}] } Presentation
16:00 - 16:30
Artificial Intelligence for Automated Investment Management
, Gunter Fischer (Brainalyzed, Germany)
+ Abstract { "session": {"id":"sess179","title":"MS14 - How Fintech and Big Data Change and Challenge the Insurance Sector","date":"Monday, July 2nd 2018","begin_time":"15:30","end_time":"17:30","room":"Nairobi Room","contributors":[{"type":"Session Chair","first_name":"Jean-Michel","last_name":"Benkert","affiliation":"Baloise Group","country":""}],"view_type":"VIII","view_type_id":"evtt110","tracks":["Emerging Application Domains","Computer Science and Applied Mathematics"],"slots":[{"id":"symp120","type":"minisymposia","title":"MS14 - How Fintech and Big Data Change and Challenge the Insurance Sector","has_begin_end_time":true,"has_just_one_minute":true,"is_parent":true,"abstract":"In the past couple years a large number of fintech and more recently insurtech startups have been founded and are challenging established players in the financial services sector. At Baloise \u2013 a Swiss company providing insurance services in Switzerland, Belgium, Germany and Luxembourg as well as banking services in Switzerland \u2013 we view startups as potential partners on our digital transformation journey rather than competition. This minisymposium aims to demonstrate what problems companies such as Baloise face in terms of digitizing their business and making use of their large amounts of data. To do so, the four sessions will cover the innovation framework Baloise employs in order to rapidly test prototypes, a presentation by Brainalyzed, a startup which aims to optimize and automatize investment decisions at Baloise using AI, a presentation about the challenges arising in the context of data warehouses and legacy systems, and a panel discussion with all the speakers.","bio":"","contributors":[{"type":"Organizer","first_name":"Jean-Michel","last_name":"Benkert","affiliation":"Baloise Group","country":"Switzerland","bio":"","order":"1","is_presenter":true},{"type":"Organizer","first_name":"Michelle","last_name":"Allg\u00f6wer","affiliation":"Baloise Group","country":"Switzerland","bio":"","order":"2","is_presenter":false}],"has_presenters":true,"presenters":[{"type":"Organizer","first_name":"Jean-Michel","last_name":"Benkert","affiliation":"Baloise Group","country":"Switzerland","bio":"","order":"1","is_presenter":true}]},{"id":"msa194","type":"child","title":"Open Innovation at Baloise","begin_time":"15:30","end_time":"16:00","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"In recent years a large number of fintech and more recently insurtech startups have been founded and are challenging established players in the financial services sector.\u00a0 At Baloise \u2013 a Swiss company providing insurance services in Switzerland, Belgium, Germany and Luxembourg as well as banking services in Switzerland \u2013 we view startups as potential partners on our digital transformation journey rather than competition. Baloise has developed an open innovation framework with the goal of enabling easy and fast cooperation with startups and other external partners as well as intrapreneurs. In this session we will present this open innovation framework and its evolution over time as we have tailored it to the requirements of startups.","filename":"msa194s1.pdf","bio":"","contributors":[{"type":"Author","first_name":"Jean-Michel","last_name":"Benkert","affiliation":"Baloise Group","country":"Switzerland","bio":"","order":"1","is_presenter":true}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Jean-Michel","last_name":"Benkert","affiliation":"Baloise Group","country":"Switzerland","bio":"","order":"1","is_presenter":true}]},{"id":"msa116","type":"child","title":"Artificial Intelligence for Automated Investment Management","begin_time":"16:00","end_time":"16:30","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"Due to increasing digitization in all sectors, the amount of available data is almost unlimited. The challenge is not only to manage this data, but to make it usable. Therefore, data analysis becomes a key success factor for organizations. Especially in the financial sector, data-driven applications are necessary to keep up with the fast-moving financial market and growing competition. The answer to low interest rates and high volatility in the market are automated data-driven investment processes. Data analysis using artificial intelligence (AI) is therefore becoming increasingly important. In this session we will give insights and some practical examples how we worked together with Baloise Asset Management to use some of their data to enhance the investment management process. We will show how the scalability of the learning solution helps to analyze even very complex problems in a short time and what our vision of AI in the financial world looks like.","filename":"msa116s1.pdf","bio":"","contributors":[{"type":"Author","first_name":"Gunter","last_name":"Fischer","affiliation":"Brainalyzed","country":"Germany","bio":"","order":"1","is_presenter":true}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Gunter","last_name":"Fischer","affiliation":"Brainalyzed","country":"Germany","bio":"","order":"1","is_presenter":true}]},{"id":"msa169","type":"child","title":"The Challenges of Big Data for a Traditional Insurance Company","begin_time":"16:30","end_time":"17:00","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"With a company history of over 150 years, our IT landscape has grown in a highly fragmented way and consists of numerous legacy systems which have evolved over the last couple of decades covering a wide range of computer languages. Therefore a greenfield approach in terms of big data is out of question and the integration of data originating from these systems represents a costly and time-consuming challenge for Baloise. Securing the availability of internal data on one side and meeting the fast growing business requirements in connection with external (big) data integration on the other side is the balancing act of our digital transformation in the domain of business intelligence. How Baloise tackles these challenges and how the company benefits from cooperation with startups using artificial intelligence to boost this transformation will be explained in this session of the minisymposium. In the second part of this session an insight into projected use cases will be delivered to illuminate Baloise\u0027s strategic approaches related to machine learning and big data.","filename":"msa169s1.pdf","bio":"","contributors":[{"type":"Author","first_name":"Christoph","last_name":"Geering","affiliation":"Baloise Group","country":"Switzerland","bio":"","order":"1","is_presenter":false},{"type":"Author","first_name":"Klaus","last_name":"Rieger","affiliation":"Baloise Group","country":"Switzerland","bio":"","order":"2","is_presenter":true}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Klaus","last_name":"Rieger","affiliation":"Baloise Group","country":"Switzerland","bio":"","order":"2","is_presenter":true}]},{"id":"msa195","type":"child","title":"Panel Discussion on How Fintech and Big Data Change and Challenge the Insurance Sector","begin_time":"17:00","end_time":"17:30","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"Join us for a panel discussion on how fintech and big data change and challenge the insurance sector. The panelists are Dr. Gunter Fischer from Brainalyzed, an AI startup in the fintech space, Christoph Geering, responsible for business intelligence at Baloise Switzerland, and Dr. Jean-Michel Benkert, Innovation Manager at Baloise Group.","bio":"","contributors":[{"type":"Author","first_name":"Jean-Michel","last_name":"Benkert","affiliation":"Baloise Group","country":"Switzerland","bio":"","order":"1","is_presenter":true}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Jean-Michel","last_name":"Benkert","affiliation":"Baloise Group","country":"Switzerland","bio":"","order":"1","is_presenter":true}]}]}, "slot": {"id":"msa116","type":"child","title":"Artificial Intelligence for Automated Investment Management","begin_time":"16:00","end_time":"16:30","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"Due to increasing digitization in all sectors, the amount of available data is almost unlimited. The challenge is not only to manage this data, but to make it usable. Therefore, data analysis becomes a key success factor for organizations. Especially in the financial sector, data-driven applications are necessary to keep up with the fast-moving financial market and growing competition. The answer to low interest rates and high volatility in the market are automated data-driven investment processes. Data analysis using artificial intelligence (AI) is therefore becoming increasingly important. In this session we will give insights and some practical examples how we worked together with Baloise Asset Management to use some of their data to enhance the investment management process. We will show how the scalability of the learning solution helps to analyze even very complex problems in a short time and what our vision of AI in the financial world looks like.","filename":"msa116s1.pdf","bio":"","contributors":[{"type":"Author","first_name":"Gunter","last_name":"Fischer","affiliation":"Brainalyzed","country":"Germany","bio":"","order":"1","is_presenter":true}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Gunter","last_name":"Fischer","affiliation":"Brainalyzed","country":"Germany","bio":"","order":"1","is_presenter":true}]}, "slotContributors": [{"type":"Author","first_name":"Gunter","last_name":"Fischer","affiliation":"Brainalyzed","country":"Germany","bio":"","order":"1","is_presenter":true}] } Presentation
16:30 - 17:00
The Challenges of Big Data for a Traditional Insurance Company
, Klaus Rieger (Baloise Group, Switzerland)
+ Abstract { "session": {"id":"sess179","title":"MS14 - How Fintech and Big Data Change and Challenge the Insurance Sector","date":"Monday, July 2nd 2018","begin_time":"15:30","end_time":"17:30","room":"Nairobi Room","contributors":[{"type":"Session Chair","first_name":"Jean-Michel","last_name":"Benkert","affiliation":"Baloise Group","country":""}],"view_type":"VIII","view_type_id":"evtt110","tracks":["Emerging Application Domains","Computer Science and Applied Mathematics"],"slots":[{"id":"symp120","type":"minisymposia","title":"MS14 - How Fintech and Big Data Change and Challenge the Insurance Sector","has_begin_end_time":true,"has_just_one_minute":true,"is_parent":true,"abstract":"In the past couple years a large number of fintech and more recently insurtech startups have been founded and are challenging established players in the financial services sector. At Baloise \u2013 a Swiss company providing insurance services in Switzerland, Belgium, Germany and Luxembourg as well as banking services in Switzerland \u2013 we view startups as potential partners on our digital transformation journey rather than competition. This minisymposium aims to demonstrate what problems companies such as Baloise face in terms of digitizing their business and making use of their large amounts of data. To do so, the four sessions will cover the innovation framework Baloise employs in order to rapidly test prototypes, a presentation by Brainalyzed, a startup which aims to optimize and automatize investment decisions at Baloise using AI, a presentation about the challenges arising in the context of data warehouses and legacy systems, and a panel discussion with all the speakers.","bio":"","contributors":[{"type":"Organizer","first_name":"Jean-Michel","last_name":"Benkert","affiliation":"Baloise Group","country":"Switzerland","bio":"","order":"1","is_presenter":true},{"type":"Organizer","first_name":"Michelle","last_name":"Allg\u00f6wer","affiliation":"Baloise Group","country":"Switzerland","bio":"","order":"2","is_presenter":false}],"has_presenters":true,"presenters":[{"type":"Organizer","first_name":"Jean-Michel","last_name":"Benkert","affiliation":"Baloise Group","country":"Switzerland","bio":"","order":"1","is_presenter":true}]},{"id":"msa194","type":"child","title":"Open Innovation at Baloise","begin_time":"15:30","end_time":"16:00","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"In recent years a large number of fintech and more recently insurtech startups have been founded and are challenging established players in the financial services sector.\u00a0 At Baloise \u2013 a Swiss company providing insurance services in Switzerland, Belgium, Germany and Luxembourg as well as banking services in Switzerland \u2013 we view startups as potential partners on our digital transformation journey rather than competition. Baloise has developed an open innovation framework with the goal of enabling easy and fast cooperation with startups and other external partners as well as intrapreneurs. In this session we will present this open innovation framework and its evolution over time as we have tailored it to the requirements of startups.","filename":"msa194s1.pdf","bio":"","contributors":[{"type":"Author","first_name":"Jean-Michel","last_name":"Benkert","affiliation":"Baloise Group","country":"Switzerland","bio":"","order":"1","is_presenter":true}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Jean-Michel","last_name":"Benkert","affiliation":"Baloise Group","country":"Switzerland","bio":"","order":"1","is_presenter":true}]},{"id":"msa116","type":"child","title":"Artificial Intelligence for Automated Investment Management","begin_time":"16:00","end_time":"16:30","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"Due to increasing digitization in all sectors, the amount of available data is almost unlimited. The challenge is not only to manage this data, but to make it usable. Therefore, data analysis becomes a key success factor for organizations. Especially in the financial sector, data-driven applications are necessary to keep up with the fast-moving financial market and growing competition. The answer to low interest rates and high volatility in the market are automated data-driven investment processes. Data analysis using artificial intelligence (AI) is therefore becoming increasingly important. In this session we will give insights and some practical examples how we worked together with Baloise Asset Management to use some of their data to enhance the investment management process. We will show how the scalability of the learning solution helps to analyze even very complex problems in a short time and what our vision of AI in the financial world looks like.","filename":"msa116s1.pdf","bio":"","contributors":[{"type":"Author","first_name":"Gunter","last_name":"Fischer","affiliation":"Brainalyzed","country":"Germany","bio":"","order":"1","is_presenter":true}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Gunter","last_name":"Fischer","affiliation":"Brainalyzed","country":"Germany","bio":"","order":"1","is_presenter":true}]},{"id":"msa169","type":"child","title":"The Challenges of Big Data for a Traditional Insurance Company","begin_time":"16:30","end_time":"17:00","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"With a company history of over 150 years, our IT landscape has grown in a highly fragmented way and consists of numerous legacy systems which have evolved over the last couple of decades covering a wide range of computer languages. Therefore a greenfield approach in terms of big data is out of question and the integration of data originating from these systems represents a costly and time-consuming challenge for Baloise. Securing the availability of internal data on one side and meeting the fast growing business requirements in connection with external (big) data integration on the other side is the balancing act of our digital transformation in the domain of business intelligence. How Baloise tackles these challenges and how the company benefits from cooperation with startups using artificial intelligence to boost this transformation will be explained in this session of the minisymposium. In the second part of this session an insight into projected use cases will be delivered to illuminate Baloise\u0027s strategic approaches related to machine learning and big data.","filename":"msa169s1.pdf","bio":"","contributors":[{"type":"Author","first_name":"Christoph","last_name":"Geering","affiliation":"Baloise Group","country":"Switzerland","bio":"","order":"1","is_presenter":false},{"type":"Author","first_name":"Klaus","last_name":"Rieger","affiliation":"Baloise Group","country":"Switzerland","bio":"","order":"2","is_presenter":true}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Klaus","last_name":"Rieger","affiliation":"Baloise Group","country":"Switzerland","bio":"","order":"2","is_presenter":true}]},{"id":"msa195","type":"child","title":"Panel Discussion on How Fintech and Big Data Change and Challenge the Insurance Sector","begin_time":"17:00","end_time":"17:30","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"Join us for a panel discussion on how fintech and big data change and challenge the insurance sector. The panelists are Dr. Gunter Fischer from Brainalyzed, an AI startup in the fintech space, Christoph Geering, responsible for business intelligence at Baloise Switzerland, and Dr. Jean-Michel Benkert, Innovation Manager at Baloise Group.","bio":"","contributors":[{"type":"Author","first_name":"Jean-Michel","last_name":"Benkert","affiliation":"Baloise Group","country":"Switzerland","bio":"","order":"1","is_presenter":true}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Jean-Michel","last_name":"Benkert","affiliation":"Baloise Group","country":"Switzerland","bio":"","order":"1","is_presenter":true}]}]}, "slot": {"id":"msa169","type":"child","title":"The Challenges of Big Data for a Traditional Insurance Company","begin_time":"16:30","end_time":"17:00","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"With a company history of over 150 years, our IT landscape has grown in a highly fragmented way and consists of numerous legacy systems which have evolved over the last couple of decades covering a wide range of computer languages. Therefore a greenfield approach in terms of big data is out of question and the integration of data originating from these systems represents a costly and time-consuming challenge for Baloise. Securing the availability of internal data on one side and meeting the fast growing business requirements in connection with external (big) data integration on the other side is the balancing act of our digital transformation in the domain of business intelligence. How Baloise tackles these challenges and how the company benefits from cooperation with startups using artificial intelligence to boost this transformation will be explained in this session of the minisymposium. In the second part of this session an insight into projected use cases will be delivered to illuminate Baloise\u0027s strategic approaches related to machine learning and big data.","filename":"msa169s1.pdf","bio":"","contributors":[{"type":"Author","first_name":"Christoph","last_name":"Geering","affiliation":"Baloise Group","country":"Switzerland","bio":"","order":"1","is_presenter":false},{"type":"Author","first_name":"Klaus","last_name":"Rieger","affiliation":"Baloise Group","country":"Switzerland","bio":"","order":"2","is_presenter":true}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Klaus","last_name":"Rieger","affiliation":"Baloise Group","country":"Switzerland","bio":"","order":"2","is_presenter":true}]}, "slotContributors": [{"type":"Author","first_name":"Christoph","last_name":"Geering","affiliation":"Baloise Group","country":"Switzerland","bio":"","order":"1","is_presenter":false},{"type":"Author","first_name":"Klaus","last_name":"Rieger","affiliation":"Baloise Group","country":"Switzerland","bio":"","order":"2","is_presenter":true}] } Presentation
Organizer(s):
Roland Lindh (Uppsala University, Sweden)
Track(s):
Emerging Application Domains, Computer Science and Applied Mathematics, Chemistry and Materials, Physics
Machine Learning is right now a booming field of computer science which finds applications in the development of computer-human interfaces, in the analysis of medical data of huge populations, in the maintenance of cars, planes and elevators, and self-driving cars, to mention a few. During the last twenty years the field has gone through a development and refinement which has been spectacular. For some reason the use of the technology in pure science has been lagging behind; however, we are now starting to see the use of machine learning in the field of quantum chemistry. Here, the approach will enhance the performance of standard quantum chemical calculations – improving convergence, could serve as a tool for post-analysis of huge sets of ab initio results, or could simply replace computationally expensive procedures. Machine learning offers practical alternatives where standard quantum chemical simulations would be prohibitive. During the last few years a small number of quantum chemistry groups have explored the potential of machine learning – the results have been extraordinary and spectacular. Here in this minisymposium we would like to inspire by presenting four different applications in which machine learning is fundamental to success.
15:30 - 16:00
Quantum Machine Learning in Chemical Compound Space
, Anders S. Christensen (University of Basel, Switzerland)
+ Abstract { "session": {"id":"sess186","title":"MS15 - Machine Learning and Quantum Chemistry","date":"Monday, July 2nd 2018","begin_time":"15:30","end_time":"17:30","room":"Boston 3 Room","contributors":[{"type":"Session Chair","first_name":"Roland","last_name":"Lindh","affiliation":"Uppsala University","country":""}],"view_type":"VIII","view_type_id":"evtt110","tracks":["Emerging Application Domains","Computer Science and Applied Mathematics","Chemistry and Materials","Physics"],"slots":[{"id":"symp135","type":"minisymposia","title":"MS15 - Machine Learning and Quantum Chemistry","has_begin_end_time":true,"has_just_one_minute":true,"is_parent":true,"abstract":"Machine Learning is right now a booming field of computer science which finds applications in the development of computer-human interfaces, in the analysis of medical data of huge populations, in the maintenance of cars, planes and elevators, and self-driving cars, to mention a few. During the last twenty years the field has gone through a development and refinement which has been spectacular. For some reason the use of the technology in pure science has been lagging behind; however, we are now starting to see the use of machine learning in the field of quantum chemistry. Here, the approach will enhance the performance of standard quantum chemical calculations \u2013 improving convergence, could serve as a tool for post-analysis of huge sets of \u003Cem\u003Eab initio\u003C\/em\u003E results, or could simply replace computationally expensive procedures. Machine learning offers practical alternatives where standard quantum chemical simulations would be prohibitive. During the last few years a small number of quantum chemistry groups have explored the potential of machine learning \u2013 the results have been extraordinary and spectacular. Here in this minisymposium we would like to inspire by presenting four different applications in which machine learning is fundamental to success.","bio":"","contributors":[{"type":"Organizer","first_name":"Roland","last_name":"Lindh","affiliation":"Uppsala University","country":"Sweden","bio":"","order":"1","is_presenter":true}],"has_presenters":true,"presenters":[{"type":"Organizer","first_name":"Roland","last_name":"Lindh","affiliation":"Uppsala University","country":"Sweden","bio":"","order":"1","is_presenter":true}]},{"id":"msa199","type":"child","title":"Quantum Machine Learning in Chemical Compound Space","begin_time":"15:30","end_time":"16:00","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"Many of the most relevant chemical properties of matter depend explicitly on atomistic and electronic details, rendering a first principles approach to chemistry mandatory. Alas, even when using high-performance computers, brute force high-throughput screening of compounds is beyond any capacity for all but the simplest systems and properties due to the combinatorial nature of chemical space, i.e. all compositional, constitutional, and conformational isomers. Consequently, efficient exploration algorithms need to exploit all implicit redundancies present in chemical space. I will discuss recently developed statistical learning approaches for interpolating quantum mechanical observables in compositional and constitutional space. Results for our models indicate remarkable performance in terms of accuracy, speed, universality, and size scalability.","filename":"msa199s1.pdf","bio":"","contributors":[{"type":"Author","first_name":"Anders S.","last_name":"Christensen","affiliation":"University of Basel","country":"Switzerland","bio":"","order":"1","is_presenter":true}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Anders S.","last_name":"Christensen","affiliation":"University of Basel","country":"Switzerland","bio":"","order":"1","is_presenter":true}]},{"id":"msa269","type":"child","title":"Neural Networks Learning Quantum Chemistry","begin_time":"16:00","end_time":"16:30","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"The development of accurate and transferable machine learning (ML) potentials for predicting molecular energetics is a challenging task. The process of data generation to train such ML potentials is a task neither well understood nor researched in detail. In this talk, we will present a fully transferable deep learning potential that is applicable to complex and diverse molecular systems well beyond the training dataset. Recently we introduced ANAKIN-ME (Accurate NeurAl networK engINe for Molecular Energies) or ANI in short. [doi: 10.1039\/C6SC05720A] ANI is a new \u003Cem\u003Emethod and sampling procedure\u003C\/em\u003E for training NNPs that utilizes a special kind of symmetry functions to build single-atom atomic environment vectors (AEV) as a molecular representation. To train ANI potential we use fully automated approach for the generation of datasets.[arXiv:1801.09319] It is based on the concept of active learning (AL). We show the use of our proposed AL technique develops a universal ANI potential, which provides very accurate energy and force predictions on the entire COMP6 benchmark. This universal potential achieves a level of accuracy on par with the best ML potentials for single molecule or materials while remaining applicable to the general class of organic molecules comprised of the elements CHNOSFCl.","filename":"msa269s1.pdf","bio":"","contributors":[{"type":"Author","first_name":"Olexandr","last_name":"Isayev","affiliation":"University of North Carolina","country":"United States of America","bio":"","order":"1","is_presenter":true}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Olexandr","last_name":"Isayev","affiliation":"University of North Carolina","country":"United States of America","bio":"","order":"1","is_presenter":true}]},{"id":"msa237","type":"child","title":"Neural Network Representations of Non-Equilibrium Potential Energy Surfaces Sampled in Virtual Reality","begin_time":"16:30","end_time":"17:00","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"I will outline recent developments in our group aimed at developing efficient potential energy surface (PES) representations of molecular geometries which are far from equilibrium. Recent progress developing a framework for interactive molecular dynamics in a multi-user virtual reality environment (combining rigorous cloud-mounted physical atomistic simulation with commodity virtual reality hardware) enables us to visualize and sample, with atomic-level precision, the structures and dynamics of complex molecular structures \u0027on the fly\u0027.(arXiv:1801.02884) From within this framework, we can run real-time molecular dynamics (using density functional and semi-empirical theory), accelerating the sampling of high-energy reaction pathways. Combined, these reactive pathways represent a test set of molecular geometries whose energies and forces we then calculate at higher levels (e.g., explicitly correlated local coupled cluster theory), and fit using neural networks. The resultant PES provides coupled cluster quality energies at the cost of classical force fields, enabling us to run thousands of trajectories and thereby make comparisons with experimental dynamical observables in non-equilibrium regimes. I will illustrate this coupled virtual-reality-machine-learning workflow by focusing on recent applications where we have been studying heterogeneous reaction dynamics wherein cyano radicals (CN) undergo reactive scattering at the surfaces of liquids which are composed of long chain hydrocarbons.","filename":"msa237s1.pdf","bio":"","contributors":[{"type":"Author","first_name":"Silvia","last_name":"Amabilino","affiliation":"University of Bristol","country":"United Kingdom","bio":"","order":"1","is_presenter":false},{"type":"Author","first_name":"Lars","last_name":"Bratholm","affiliation":"University of Bristol","country":"United Kingdom","bio":"","order":"2","is_presenter":false},{"type":"Author","first_name":"Simon","last_name":"Bennie","affiliation":"University of Bristol","country":"United Kingdom","bio":"","order":"3","is_presenter":false},{"type":"Author","first_name":"David","last_name":"Glowacki","affiliation":"University of Bristol","country":"United Kingdom","bio":"","order":"4","is_presenter":true}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"David","last_name":"Glowacki","affiliation":"University of Bristol","country":"United Kingdom","bio":"","order":"4","is_presenter":true}]},{"id":"msa272","type":"child","title":"Predicting the Stability of Solids with Density Functional Theory and Machine Learning","begin_time":"17:00","end_time":"17:30","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"We use a combination of machine learning techniques and high-throughput density-functional theory calculations to explore ternary compounds with the AB2C2 composition. We chose the two most common intermetallic prototypes for this composition, namely the tI10-CeAl2Ga 2 and the tP10-FeMo2B2 structures. We find that there may be \u223c10 times more stable compounds in these phases than previously known. These are mostly metallic and non-magnetic. While the use of machine learning reduces the overall calculation cost by around 75%, some limitations still exist, in particular for compounds involving the second-row of the periodic table or magnetic elements.","filename":"msa272s1.pdf","bio":"","contributors":[{"type":"Author","first_name":"Miguel A. L.","last_name":"Marques","affiliation":"Martin Luther University Halle-Wittenberg","country":"Germany","bio":"","order":"1","is_presenter":true}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Miguel A. L.","last_name":"Marques","affiliation":"Martin Luther University Halle-Wittenberg","country":"Germany","bio":"","order":"1","is_presenter":true}]}]}, "slot": {"id":"msa199","type":"child","title":"Quantum Machine Learning in Chemical Compound Space","begin_time":"15:30","end_time":"16:00","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"Many of the most relevant chemical properties of matter depend explicitly on atomistic and electronic details, rendering a first principles approach to chemistry mandatory. Alas, even when using high-performance computers, brute force high-throughput screening of compounds is beyond any capacity for all but the simplest systems and properties due to the combinatorial nature of chemical space, i.e. all compositional, constitutional, and conformational isomers. Consequently, efficient exploration algorithms need to exploit all implicit redundancies present in chemical space. I will discuss recently developed statistical learning approaches for interpolating quantum mechanical observables in compositional and constitutional space. Results for our models indicate remarkable performance in terms of accuracy, speed, universality, and size scalability.","filename":"msa199s1.pdf","bio":"","contributors":[{"type":"Author","first_name":"Anders S.","last_name":"Christensen","affiliation":"University of Basel","country":"Switzerland","bio":"","order":"1","is_presenter":true}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Anders S.","last_name":"Christensen","affiliation":"University of Basel","country":"Switzerland","bio":"","order":"1","is_presenter":true}]}, "slotContributors": [{"type":"Author","first_name":"Anders S.","last_name":"Christensen","affiliation":"University of Basel","country":"Switzerland","bio":"","order":"1","is_presenter":true}] } Presentation
16:00 - 16:30
Neural Networks Learning Quantum Chemistry
, Olexandr Isayev (University of North Carolina, United States of America)
+ Abstract { "session": {"id":"sess186","title":"MS15 - Machine Learning and Quantum Chemistry","date":"Monday, July 2nd 2018","begin_time":"15:30","end_time":"17:30","room":"Boston 3 Room","contributors":[{"type":"Session Chair","first_name":"Roland","last_name":"Lindh","affiliation":"Uppsala University","country":""}],"view_type":"VIII","view_type_id":"evtt110","tracks":["Emerging Application Domains","Computer Science and Applied Mathematics","Chemistry and Materials","Physics"],"slots":[{"id":"symp135","type":"minisymposia","title":"MS15 - Machine Learning and Quantum Chemistry","has_begin_end_time":true,"has_just_one_minute":true,"is_parent":true,"abstract":"Machine Learning is right now a booming field of computer science which finds applications in the development of computer-human interfaces, in the analysis of medical data of huge populations, in the maintenance of cars, planes and elevators, and self-driving cars, to mention a few. During the last twenty years the field has gone through a development and refinement which has been spectacular. For some reason the use of the technology in pure science has been lagging behind; however, we are now starting to see the use of machine learning in the field of quantum chemistry. Here, the approach will enhance the performance of standard quantum chemical calculations \u2013 improving convergence, could serve as a tool for post-analysis of huge sets of \u003Cem\u003Eab initio\u003C\/em\u003E results, or could simply replace computationally expensive procedures. Machine learning offers practical alternatives where standard quantum chemical simulations would be prohibitive. During the last few years a small number of quantum chemistry groups have explored the potential of machine learning \u2013 the results have been extraordinary and spectacular. Here in this minisymposium we would like to inspire by presenting four different applications in which machine learning is fundamental to success.","bio":"","contributors":[{"type":"Organizer","first_name":"Roland","last_name":"Lindh","affiliation":"Uppsala University","country":"Sweden","bio":"","order":"1","is_presenter":true}],"has_presenters":true,"presenters":[{"type":"Organizer","first_name":"Roland","last_name":"Lindh","affiliation":"Uppsala University","country":"Sweden","bio":"","order":"1","is_presenter":true}]},{"id":"msa199","type":"child","title":"Quantum Machine Learning in Chemical Compound Space","begin_time":"15:30","end_time":"16:00","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"Many of the most relevant chemical properties of matter depend explicitly on atomistic and electronic details, rendering a first principles approach to chemistry mandatory. Alas, even when using high-performance computers, brute force high-throughput screening of compounds is beyond any capacity for all but the simplest systems and properties due to the combinatorial nature of chemical space, i.e. all compositional, constitutional, and conformational isomers. Consequently, efficient exploration algorithms need to exploit all implicit redundancies present in chemical space. I will discuss recently developed statistical learning approaches for interpolating quantum mechanical observables in compositional and constitutional space. Results for our models indicate remarkable performance in terms of accuracy, speed, universality, and size scalability.","filename":"msa199s1.pdf","bio":"","contributors":[{"type":"Author","first_name":"Anders S.","last_name":"Christensen","affiliation":"University of Basel","country":"Switzerland","bio":"","order":"1","is_presenter":true}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Anders S.","last_name":"Christensen","affiliation":"University of Basel","country":"Switzerland","bio":"","order":"1","is_presenter":true}]},{"id":"msa269","type":"child","title":"Neural Networks Learning Quantum Chemistry","begin_time":"16:00","end_time":"16:30","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"The development of accurate and transferable machine learning (ML) potentials for predicting molecular energetics is a challenging task. The process of data generation to train such ML potentials is a task neither well understood nor researched in detail. In this talk, we will present a fully transferable deep learning potential that is applicable to complex and diverse molecular systems well beyond the training dataset. Recently we introduced ANAKIN-ME (Accurate NeurAl networK engINe for Molecular Energies) or ANI in short. [doi: 10.1039\/C6SC05720A] ANI is a new \u003Cem\u003Emethod and sampling procedure\u003C\/em\u003E for training NNPs that utilizes a special kind of symmetry functions to build single-atom atomic environment vectors (AEV) as a molecular representation. To train ANI potential we use fully automated approach for the generation of datasets.[arXiv:1801.09319] It is based on the concept of active learning (AL). We show the use of our proposed AL technique develops a universal ANI potential, which provides very accurate energy and force predictions on the entire COMP6 benchmark. This universal potential achieves a level of accuracy on par with the best ML potentials for single molecule or materials while remaining applicable to the general class of organic molecules comprised of the elements CHNOSFCl.","filename":"msa269s1.pdf","bio":"","contributors":[{"type":"Author","first_name":"Olexandr","last_name":"Isayev","affiliation":"University of North Carolina","country":"United States of America","bio":"","order":"1","is_presenter":true}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Olexandr","last_name":"Isayev","affiliation":"University of North Carolina","country":"United States of America","bio":"","order":"1","is_presenter":true}]},{"id":"msa237","type":"child","title":"Neural Network Representations of Non-Equilibrium Potential Energy Surfaces Sampled in Virtual Reality","begin_time":"16:30","end_time":"17:00","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"I will outline recent developments in our group aimed at developing efficient potential energy surface (PES) representations of molecular geometries which are far from equilibrium. Recent progress developing a framework for interactive molecular dynamics in a multi-user virtual reality environment (combining rigorous cloud-mounted physical atomistic simulation with commodity virtual reality hardware) enables us to visualize and sample, with atomic-level precision, the structures and dynamics of complex molecular structures \u0027on the fly\u0027.(arXiv:1801.02884) From within this framework, we can run real-time molecular dynamics (using density functional and semi-empirical theory), accelerating the sampling of high-energy reaction pathways. Combined, these reactive pathways represent a test set of molecular geometries whose energies and forces we then calculate at higher levels (e.g., explicitly correlated local coupled cluster theory), and fit using neural networks. The resultant PES provides coupled cluster quality energies at the cost of classical force fields, enabling us to run thousands of trajectories and thereby make comparisons with experimental dynamical observables in non-equilibrium regimes. I will illustrate this coupled virtual-reality-machine-learning workflow by focusing on recent applications where we have been studying heterogeneous reaction dynamics wherein cyano radicals (CN) undergo reactive scattering at the surfaces of liquids which are composed of long chain hydrocarbons.","filename":"msa237s1.pdf","bio":"","contributors":[{"type":"Author","first_name":"Silvia","last_name":"Amabilino","affiliation":"University of Bristol","country":"United Kingdom","bio":"","order":"1","is_presenter":false},{"type":"Author","first_name":"Lars","last_name":"Bratholm","affiliation":"University of Bristol","country":"United Kingdom","bio":"","order":"2","is_presenter":false},{"type":"Author","first_name":"Simon","last_name":"Bennie","affiliation":"University of Bristol","country":"United Kingdom","bio":"","order":"3","is_presenter":false},{"type":"Author","first_name":"David","last_name":"Glowacki","affiliation":"University of Bristol","country":"United Kingdom","bio":"","order":"4","is_presenter":true}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"David","last_name":"Glowacki","affiliation":"University of Bristol","country":"United Kingdom","bio":"","order":"4","is_presenter":true}]},{"id":"msa272","type":"child","title":"Predicting the Stability of Solids with Density Functional Theory and Machine Learning","begin_time":"17:00","end_time":"17:30","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"We use a combination of machine learning techniques and high-throughput density-functional theory calculations to explore ternary compounds with the AB2C2 composition. We chose the two most common intermetallic prototypes for this composition, namely the tI10-CeAl2Ga 2 and the tP10-FeMo2B2 structures. We find that there may be \u223c10 times more stable compounds in these phases than previously known. These are mostly metallic and non-magnetic. While the use of machine learning reduces the overall calculation cost by around 75%, some limitations still exist, in particular for compounds involving the second-row of the periodic table or magnetic elements.","filename":"msa272s1.pdf","bio":"","contributors":[{"type":"Author","first_name":"Miguel A. L.","last_name":"Marques","affiliation":"Martin Luther University Halle-Wittenberg","country":"Germany","bio":"","order":"1","is_presenter":true}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Miguel A. L.","last_name":"Marques","affiliation":"Martin Luther University Halle-Wittenberg","country":"Germany","bio":"","order":"1","is_presenter":true}]}]}, "slot": {"id":"msa269","type":"child","title":"Neural Networks Learning Quantum Chemistry","begin_time":"16:00","end_time":"16:30","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"The development of accurate and transferable machine learning (ML) potentials for predicting molecular energetics is a challenging task. The process of data generation to train such ML potentials is a task neither well understood nor researched in detail. In this talk, we will present a fully transferable deep learning potential that is applicable to complex and diverse molecular systems well beyond the training dataset. Recently we introduced ANAKIN-ME (Accurate NeurAl networK engINe for Molecular Energies) or ANI in short. [doi: 10.1039\/C6SC05720A] ANI is a new \u003Cem\u003Emethod and sampling procedure\u003C\/em\u003E for training NNPs that utilizes a special kind of symmetry functions to build single-atom atomic environment vectors (AEV) as a molecular representation. To train ANI potential we use fully automated approach for the generation of datasets.[arXiv:1801.09319] It is based on the concept of active learning (AL). We show the use of our proposed AL technique develops a universal ANI potential, which provides very accurate energy and force predictions on the entire COMP6 benchmark. This universal potential achieves a level of accuracy on par with the best ML potentials for single molecule or materials while remaining applicable to the general class of organic molecules comprised of the elements CHNOSFCl.","filename":"msa269s1.pdf","bio":"","contributors":[{"type":"Author","first_name":"Olexandr","last_name":"Isayev","affiliation":"University of North Carolina","country":"United States of America","bio":"","order":"1","is_presenter":true}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Olexandr","last_name":"Isayev","affiliation":"University of North Carolina","country":"United States of America","bio":"","order":"1","is_presenter":true}]}, "slotContributors": [{"type":"Author","first_name":"Olexandr","last_name":"Isayev","affiliation":"University of North Carolina","country":"United States of America","bio":"","order":"1","is_presenter":true}] } Presentation
16:30 - 17:00
Neural Network Representations of Non-Equilibrium Potential Energy Surfaces Sampled in Virtual Reality
, David Glowacki (University of Bristol, United Kingdom)
+ Abstract { "session": {"id":"sess186","title":"MS15 - Machine Learning and Quantum Chemistry","date":"Monday, July 2nd 2018","begin_time":"15:30","end_time":"17:30","room":"Boston 3 Room","contributors":[{"type":"Session Chair","first_name":"Roland","last_name":"Lindh","affiliation":"Uppsala University","country":""}],"view_type":"VIII","view_type_id":"evtt110","tracks":["Emerging Application Domains","Computer Science and Applied Mathematics","Chemistry and Materials","Physics"],"slots":[{"id":"symp135","type":"minisymposia","title":"MS15 - Machine Learning and Quantum Chemistry","has_begin_end_time":true,"has_just_one_minute":true,"is_parent":true,"abstract":"Machine Learning is right now a booming field of computer science which finds applications in the development of computer-human interfaces, in the analysis of medical data of huge populations, in the maintenance of cars, planes and elevators, and self-driving cars, to mention a few. During the last twenty years the field has gone through a development and refinement which has been spectacular. For some reason the use of the technology in pure science has been lagging behind; however, we are now starting to see the use of machine learning in the field of quantum chemistry. Here, the approach will enhance the performance of standard quantum chemical calculations \u2013 improving convergence, could serve as a tool for post-analysis of huge sets of \u003Cem\u003Eab initio\u003C\/em\u003E results, or could simply replace computationally expensive procedures. Machine learning offers practical alternatives where standard quantum chemical simulations would be prohibitive. During the last few years a small number of quantum chemistry groups have explored the potential of machine learning \u2013 the results have been extraordinary and spectacular. Here in this minisymposium we would like to inspire by presenting four different applications in which machine learning is fundamental to success.","bio":"","contributors":[{"type":"Organizer","first_name":"Roland","last_name":"Lindh","affiliation":"Uppsala University","country":"Sweden","bio":"","order":"1","is_presenter":true}],"has_presenters":true,"presenters":[{"type":"Organizer","first_name":"Roland","last_name":"Lindh","affiliation":"Uppsala University","country":"Sweden","bio":"","order":"1","is_presenter":true}]},{"id":"msa199","type":"child","title":"Quantum Machine Learning in Chemical Compound Space","begin_time":"15:30","end_time":"16:00","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"Many of the most relevant chemical properties of matter depend explicitly on atomistic and electronic details, rendering a first principles approach to chemistry mandatory. Alas, even when using high-performance computers, brute force high-throughput screening of compounds is beyond any capacity for all but the simplest systems and properties due to the combinatorial nature of chemical space, i.e. all compositional, constitutional, and conformational isomers. Consequently, efficient exploration algorithms need to exploit all implicit redundancies present in chemical space. I will discuss recently developed statistical learning approaches for interpolating quantum mechanical observables in compositional and constitutional space. Results for our models indicate remarkable performance in terms of accuracy, speed, universality, and size scalability.","filename":"msa199s1.pdf","bio":"","contributors":[{"type":"Author","first_name":"Anders S.","last_name":"Christensen","affiliation":"University of Basel","country":"Switzerland","bio":"","order":"1","is_presenter":true}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Anders S.","last_name":"Christensen","affiliation":"University of Basel","country":"Switzerland","bio":"","order":"1","is_presenter":true}]},{"id":"msa269","type":"child","title":"Neural Networks Learning Quantum Chemistry","begin_time":"16:00","end_time":"16:30","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"The development of accurate and transferable machine learning (ML) potentials for predicting molecular energetics is a challenging task. The process of data generation to train such ML potentials is a task neither well understood nor researched in detail. In this talk, we will present a fully transferable deep learning potential that is applicable to complex and diverse molecular systems well beyond the training dataset. Recently we introduced ANAKIN-ME (Accurate NeurAl networK engINe for Molecular Energies) or ANI in short. [doi: 10.1039\/C6SC05720A] ANI is a new \u003Cem\u003Emethod and sampling procedure\u003C\/em\u003E for training NNPs that utilizes a special kind of symmetry functions to build single-atom atomic environment vectors (AEV) as a molecular representation. To train ANI potential we use fully automated approach for the generation of datasets.[arXiv:1801.09319] It is based on the concept of active learning (AL). We show the use of our proposed AL technique develops a universal ANI potential, which provides very accurate energy and force predictions on the entire COMP6 benchmark. This universal potential achieves a level of accuracy on par with the best ML potentials for single molecule or materials while remaining applicable to the general class of organic molecules comprised of the elements CHNOSFCl.","filename":"msa269s1.pdf","bio":"","contributors":[{"type":"Author","first_name":"Olexandr","last_name":"Isayev","affiliation":"University of North Carolina","country":"United States of America","bio":"","order":"1","is_presenter":true}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Olexandr","last_name":"Isayev","affiliation":"University of North Carolina","country":"United States of America","bio":"","order":"1","is_presenter":true}]},{"id":"msa237","type":"child","title":"Neural Network Representations of Non-Equilibrium Potential Energy Surfaces Sampled in Virtual Reality","begin_time":"16:30","end_time":"17:00","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"I will outline recent developments in our group aimed at developing efficient potential energy surface (PES) representations of molecular geometries which are far from equilibrium. Recent progress developing a framework for interactive molecular dynamics in a multi-user virtual reality environment (combining rigorous cloud-mounted physical atomistic simulation with commodity virtual reality hardware) enables us to visualize and sample, with atomic-level precision, the structures and dynamics of complex molecular structures \u0027on the fly\u0027.(arXiv:1801.02884) From within this framework, we can run real-time molecular dynamics (using density functional and semi-empirical theory), accelerating the sampling of high-energy reaction pathways. Combined, these reactive pathways represent a test set of molecular geometries whose energies and forces we then calculate at higher levels (e.g., explicitly correlated local coupled cluster theory), and fit using neural networks. The resultant PES provides coupled cluster quality energies at the cost of classical force fields, enabling us to run thousands of trajectories and thereby make comparisons with experimental dynamical observables in non-equilibrium regimes. I will illustrate this coupled virtual-reality-machine-learning workflow by focusing on recent applications where we have been studying heterogeneous reaction dynamics wherein cyano radicals (CN) undergo reactive scattering at the surfaces of liquids which are composed of long chain hydrocarbons.","filename":"msa237s1.pdf","bio":"","contributors":[{"type":"Author","first_name":"Silvia","last_name":"Amabilino","affiliation":"University of Bristol","country":"United Kingdom","bio":"","order":"1","is_presenter":false},{"type":"Author","first_name":"Lars","last_name":"Bratholm","affiliation":"University of Bristol","country":"United Kingdom","bio":"","order":"2","is_presenter":false},{"type":"Author","first_name":"Simon","last_name":"Bennie","affiliation":"University of Bristol","country":"United Kingdom","bio":"","order":"3","is_presenter":false},{"type":"Author","first_name":"David","last_name":"Glowacki","affiliation":"University of Bristol","country":"United Kingdom","bio":"","order":"4","is_presenter":true}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"David","last_name":"Glowacki","affiliation":"University of Bristol","country":"United Kingdom","bio":"","order":"4","is_presenter":true}]},{"id":"msa272","type":"child","title":"Predicting the Stability of Solids with Density Functional Theory and Machine Learning","begin_time":"17:00","end_time":"17:30","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"We use a combination of machine learning techniques and high-throughput density-functional theory calculations to explore ternary compounds with the AB2C2 composition. We chose the two most common intermetallic prototypes for this composition, namely the tI10-CeAl2Ga 2 and the tP10-FeMo2B2 structures. We find that there may be \u223c10 times more stable compounds in these phases than previously known. These are mostly metallic and non-magnetic. While the use of machine learning reduces the overall calculation cost by around 75%, some limitations still exist, in particular for compounds involving the second-row of the periodic table or magnetic elements.","filename":"msa272s1.pdf","bio":"","contributors":[{"type":"Author","first_name":"Miguel A. L.","last_name":"Marques","affiliation":"Martin Luther University Halle-Wittenberg","country":"Germany","bio":"","order":"1","is_presenter":true}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Miguel A. L.","last_name":"Marques","affiliation":"Martin Luther University Halle-Wittenberg","country":"Germany","bio":"","order":"1","is_presenter":true}]}]}, "slot": {"id":"msa237","type":"child","title":"Neural Network Representations of Non-Equilibrium Potential Energy Surfaces Sampled in Virtual Reality","begin_time":"16:30","end_time":"17:00","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"I will outline recent developments in our group aimed at developing efficient potential energy surface (PES) representations of molecular geometries which are far from equilibrium. Recent progress developing a framework for interactive molecular dynamics in a multi-user virtual reality environment (combining rigorous cloud-mounted physical atomistic simulation with commodity virtual reality hardware) enables us to visualize and sample, with atomic-level precision, the structures and dynamics of complex molecular structures \u0027on the fly\u0027.(arXiv:1801.02884) From within this framework, we can run real-time molecular dynamics (using density functional and semi-empirical theory), accelerating the sampling of high-energy reaction pathways. Combined, these reactive pathways represent a test set of molecular geometries whose energies and forces we then calculate at higher levels (e.g., explicitly correlated local coupled cluster theory), and fit using neural networks. The resultant PES provides coupled cluster quality energies at the cost of classical force fields, enabling us to run thousands of trajectories and thereby make comparisons with experimental dynamical observables in non-equilibrium regimes. I will illustrate this coupled virtual-reality-machine-learning workflow by focusing on recent applications where we have been studying heterogeneous reaction dynamics wherein cyano radicals (CN) undergo reactive scattering at the surfaces of liquids which are composed of long chain hydrocarbons.","filename":"msa237s1.pdf","bio":"","contributors":[{"type":"Author","first_name":"Silvia","last_name":"Amabilino","affiliation":"University of Bristol","country":"United Kingdom","bio":"","order":"1","is_presenter":false},{"type":"Author","first_name":"Lars","last_name":"Bratholm","affiliation":"University of Bristol","country":"United Kingdom","bio":"","order":"2","is_presenter":false},{"type":"Author","first_name":"Simon","last_name":"Bennie","affiliation":"University of Bristol","country":"United Kingdom","bio":"","order":"3","is_presenter":false},{"type":"Author","first_name":"David","last_name":"Glowacki","affiliation":"University of Bristol","country":"United Kingdom","bio":"","order":"4","is_presenter":true}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"David","last_name":"Glowacki","affiliation":"University of Bristol","country":"United Kingdom","bio":"","order":"4","is_presenter":true}]}, "slotContributors": [{"type":"Author","first_name":"Silvia","last_name":"Amabilino","affiliation":"University of Bristol","country":"United Kingdom","bio":"","order":"1","is_presenter":false},{"type":"Author","first_name":"Lars","last_name":"Bratholm","affiliation":"University of Bristol","country":"United Kingdom","bio":"","order":"2","is_presenter":false},{"type":"Author","first_name":"Simon","last_name":"Bennie","affiliation":"University of Bristol","country":"United Kingdom","bio":"","order":"3","is_presenter":false},{"type":"Author","first_name":"David","last_name":"Glowacki","affiliation":"University of Bristol","country":"United Kingdom","bio":"","order":"4","is_presenter":true}] } Presentation
17:00 - 17:30
Predicting the Stability of Solids with Density Functional Theory and Machine Learning
, Miguel A. L. Marques (Martin Luther University Halle-Wittenberg, Germany)
+ Abstract { "session": {"id":"sess186","title":"MS15 - Machine Learning and Quantum Chemistry","date":"Monday, July 2nd 2018","begin_time":"15:30","end_time":"17:30","room":"Boston 3 Room","contributors":[{"type":"Session Chair","first_name":"Roland","last_name":"Lindh","affiliation":"Uppsala University","country":""}],"view_type":"VIII","view_type_id":"evtt110","tracks":["Emerging Application Domains","Computer Science and Applied Mathematics","Chemistry and Materials","Physics"],"slots":[{"id":"symp135","type":"minisymposia","title":"MS15 - Machine Learning and Quantum Chemistry","has_begin_end_time":true,"has_just_one_minute":true,"is_parent":true,"abstract":"Machine Learning is right now a booming field of computer science which finds applications in the development of computer-human interfaces, in the analysis of medical data of huge populations, in the maintenance of cars, planes and elevators, and self-driving cars, to mention a few. During the last twenty years the field has gone through a development and refinement which has been spectacular. For some reason the use of the technology in pure science has been lagging behind; however, we are now starting to see the use of machine learning in the field of quantum chemistry. Here, the approach will enhance the performance of standard quantum chemical calculations \u2013 improving convergence, could serve as a tool for post-analysis of huge sets of \u003Cem\u003Eab initio\u003C\/em\u003E results, or could simply replace computationally expensive procedures. Machine learning offers practical alternatives where standard quantum chemical simulations would be prohibitive. During the last few years a small number of quantum chemistry groups have explored the potential of machine learning \u2013 the results have been extraordinary and spectacular. Here in this minisymposium we would like to inspire by presenting four different applications in which machine learning is fundamental to success.","bio":"","contributors":[{"type":"Organizer","first_name":"Roland","last_name":"Lindh","affiliation":"Uppsala University","country":"Sweden","bio":"","order":"1","is_presenter":true}],"has_presenters":true,"presenters":[{"type":"Organizer","first_name":"Roland","last_name":"Lindh","affiliation":"Uppsala University","country":"Sweden","bio":"","order":"1","is_presenter":true}]},{"id":"msa199","type":"child","title":"Quantum Machine Learning in Chemical Compound Space","begin_time":"15:30","end_time":"16:00","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"Many of the most relevant chemical properties of matter depend explicitly on atomistic and electronic details, rendering a first principles approach to chemistry mandatory. Alas, even when using high-performance computers, brute force high-throughput screening of compounds is beyond any capacity for all but the simplest systems and properties due to the combinatorial nature of chemical space, i.e. all compositional, constitutional, and conformational isomers. Consequently, efficient exploration algorithms need to exploit all implicit redundancies present in chemical space. I will discuss recently developed statistical learning approaches for interpolating quantum mechanical observables in compositional and constitutional space. Results for our models indicate remarkable performance in terms of accuracy, speed, universality, and size scalability.","filename":"msa199s1.pdf","bio":"","contributors":[{"type":"Author","first_name":"Anders S.","last_name":"Christensen","affiliation":"University of Basel","country":"Switzerland","bio":"","order":"1","is_presenter":true}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Anders S.","last_name":"Christensen","affiliation":"University of Basel","country":"Switzerland","bio":"","order":"1","is_presenter":true}]},{"id":"msa269","type":"child","title":"Neural Networks Learning Quantum Chemistry","begin_time":"16:00","end_time":"16:30","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"The development of accurate and transferable machine learning (ML) potentials for predicting molecular energetics is a challenging task. The process of data generation to train such ML potentials is a task neither well understood nor researched in detail. In this talk, we will present a fully transferable deep learning potential that is applicable to complex and diverse molecular systems well beyond the training dataset. Recently we introduced ANAKIN-ME (Accurate NeurAl networK engINe for Molecular Energies) or ANI in short. [doi: 10.1039\/C6SC05720A] ANI is a new \u003Cem\u003Emethod and sampling procedure\u003C\/em\u003E for training NNPs that utilizes a special kind of symmetry functions to build single-atom atomic environment vectors (AEV) as a molecular representation. To train ANI potential we use fully automated approach for the generation of datasets.[arXiv:1801.09319] It is based on the concept of active learning (AL). We show the use of our proposed AL technique develops a universal ANI potential, which provides very accurate energy and force predictions on the entire COMP6 benchmark. This universal potential achieves a level of accuracy on par with the best ML potentials for single molecule or materials while remaining applicable to the general class of organic molecules comprised of the elements CHNOSFCl.","filename":"msa269s1.pdf","bio":"","contributors":[{"type":"Author","first_name":"Olexandr","last_name":"Isayev","affiliation":"University of North Carolina","country":"United States of America","bio":"","order":"1","is_presenter":true}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Olexandr","last_name":"Isayev","affiliation":"University of North Carolina","country":"United States of America","bio":"","order":"1","is_presenter":true}]},{"id":"msa237","type":"child","title":"Neural Network Representations of Non-Equilibrium Potential Energy Surfaces Sampled in Virtual Reality","begin_time":"16:30","end_time":"17:00","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"I will outline recent developments in our group aimed at developing efficient potential energy surface (PES) representations of molecular geometries which are far from equilibrium. Recent progress developing a framework for interactive molecular dynamics in a multi-user virtual reality environment (combining rigorous cloud-mounted physical atomistic simulation with commodity virtual reality hardware) enables us to visualize and sample, with atomic-level precision, the structures and dynamics of complex molecular structures \u0027on the fly\u0027.(arXiv:1801.02884) From within this framework, we can run real-time molecular dynamics (using density functional and semi-empirical theory), accelerating the sampling of high-energy reaction pathways. Combined, these reactive pathways represent a test set of molecular geometries whose energies and forces we then calculate at higher levels (e.g., explicitly correlated local coupled cluster theory), and fit using neural networks. The resultant PES provides coupled cluster quality energies at the cost of classical force fields, enabling us to run thousands of trajectories and thereby make comparisons with experimental dynamical observables in non-equilibrium regimes. I will illustrate this coupled virtual-reality-machine-learning workflow by focusing on recent applications where we have been studying heterogeneous reaction dynamics wherein cyano radicals (CN) undergo reactive scattering at the surfaces of liquids which are composed of long chain hydrocarbons.","filename":"msa237s1.pdf","bio":"","contributors":[{"type":"Author","first_name":"Silvia","last_name":"Amabilino","affiliation":"University of Bristol","country":"United Kingdom","bio":"","order":"1","is_presenter":false},{"type":"Author","first_name":"Lars","last_name":"Bratholm","affiliation":"University of Bristol","country":"United Kingdom","bio":"","order":"2","is_presenter":false},{"type":"Author","first_name":"Simon","last_name":"Bennie","affiliation":"University of Bristol","country":"United Kingdom","bio":"","order":"3","is_presenter":false},{"type":"Author","first_name":"David","last_name":"Glowacki","affiliation":"University of Bristol","country":"United Kingdom","bio":"","order":"4","is_presenter":true}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"David","last_name":"Glowacki","affiliation":"University of Bristol","country":"United Kingdom","bio":"","order":"4","is_presenter":true}]},{"id":"msa272","type":"child","title":"Predicting the Stability of Solids with Density Functional Theory and Machine Learning","begin_time":"17:00","end_time":"17:30","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"We use a combination of machine learning techniques and high-throughput density-functional theory calculations to explore ternary compounds with the AB2C2 composition. We chose the two most common intermetallic prototypes for this composition, namely the tI10-CeAl2Ga 2 and the tP10-FeMo2B2 structures. We find that there may be \u223c10 times more stable compounds in these phases than previously known. These are mostly metallic and non-magnetic. While the use of machine learning reduces the overall calculation cost by around 75%, some limitations still exist, in particular for compounds involving the second-row of the periodic table or magnetic elements.","filename":"msa272s1.pdf","bio":"","contributors":[{"type":"Author","first_name":"Miguel A. L.","last_name":"Marques","affiliation":"Martin Luther University Halle-Wittenberg","country":"Germany","bio":"","order":"1","is_presenter":true}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Miguel A. L.","last_name":"Marques","affiliation":"Martin Luther University Halle-Wittenberg","country":"Germany","bio":"","order":"1","is_presenter":true}]}]}, "slot": {"id":"msa272","type":"child","title":"Predicting the Stability of Solids with Density Functional Theory and Machine Learning","begin_time":"17:00","end_time":"17:30","has_begin_end_time":true,"has_just_one_minute":false,"is_parent":false,"abstract":"We use a combination of machine learning techniques and high-throughput density-functional theory calculations to explore ternary compounds with the AB2C2 composition. We chose the two most common intermetallic prototypes for this composition, namely the tI10-CeAl2Ga 2 and the tP10-FeMo2B2 structures. We find that there may be \u223c10 times more stable compounds in these phases than previously known. These are mostly metallic and non-magnetic. While the use of machine learning reduces the overall calculation cost by around 75%, some limitations still exist, in particular for compounds involving the second-row of the periodic table or magnetic elements.","filename":"msa272s1.pdf","bio":"","contributors":[{"type":"Author","first_name":"Miguel A. L.","last_name":"Marques","affiliation":"Martin Luther University Halle-Wittenberg","country":"Germany","bio":"","order":"1","is_presenter":true}],"has_presenters":true,"presenters":[{"type":"Author","first_name":"Miguel A. L.","last_name":"Marques","affiliation":"Martin Luther University Halle-Wittenberg","country":"Germany","bio":"","order":"1","is_presenter":true}]}, "slotContributors": [{"type":"Author","first_name":"Miguel A. L.","last_name":"Marques","affiliation":"Martin Luther University Halle-Wittenberg","country":"Germany","bio":"","order":"1","is_presenter":true}] } Presentation
Organizer(s):
Sharlee Climer (University of Missouri - St. Louis, United States of America)
, Daniel Jacobson (Oak Ridge National Laboratory, United States of America)
Track(s):
Life Sciences, Engineering, Emerging Application Domains, Computer Science and Applied Mathematics
Difficult combinatorial problems permeate virtually every area of the sciences, business, and government and many of these problems can be cast as mixed-integer programs (MIPs). A MIP is a mathematical definition of a problem that is comprised of a set of constraints and an objective function. In general, MIPs are NP-hard and require exponential amounts of computation time in the worst case. However, search strategies, such as branch-and-bound, branch-and-cut, and cut-and-solve have evolved to provide optimal solutions for many instances.
Although there has been great progress in this field and computational power has dramatically increased over the years, many important MIPs remain intractable and the use of massive parallelization appears to be a promising means to address this great need. However, many challenges lie ahead. This minisymposium will elucidate some of these challenges, while highlighting progress in this field. It includes a round table discussion with Michael Chan, Sharlee Climer, Daniel Jacobson, Sarah Powers, and Daniel Rehfeldt, and is open to conference participants. The goal of the discussions will be to explore and integrate high-performance expertise with domain-specific insights with an aim to identify strategies that may resolve these pressing challenges.
Although there has been great progress in this field and computational power has dramatically increased over the years, many important MIPs remain intractable and the use of massive parallelization appears to be a promising means to address this great need. However, many challenges lie ahead. This minisymposium will elucidate some of these challenges, while highlighting progress in this field. It includes a round table discussion with Michael Chan, Sharlee Climer, Daniel Jacobson, Sarah Powers, and Daniel Rehfeldt, and is open to conference participants. The goal of the discussions will be to explore and integrate high-performance expertise with domain-specific insights with an aim to identify strategies that may resolve these pressing challenges.
16:00 - 16:30
Parallel Cut-and-Solve: A Method for Solving Mixed-Integer Programs Utilizing Distributed Computational Power
, Michael Chan (University of Missouri - St. Louis, United States of America)