DeepModeling

Define the future of scientific computing together

Pre-trained models are sweeping through the AI field by extracting representative information from large-scale unlabeled data and then performing supervised learning on small-scale labeled downstream tasks, becoming the de facto solution in many application scenarios. In drug design, there is still no consensus on the "best way to represent molecules." In the field of materials chemistry, predicting molecular properties is equally important. Mainstream molecular pre-training models typically start from one-dimensional sequences or two-dimensional graph structures, but molecular structures are inherently represented in three-dimensional space. Therefore, directly constructing pre-trained models from three-dimensional information to achieve better molecular representations has become an important and meaningful problem. To further promote research on molecular representation and pre-trained models, Uni-Mol will join the DeepModeling community to work with community developers to advance the development of a three-dimensional molecular representation pre-training framework.

Read more »

On the journey toward developing a Large Atomic Model (LAM), the core Deep Potential development team has launched the OpenLAM initiative for the community. OpenLAM’s slogan is "Conquer the Periodic Table!" The project aims to create an open-source ecosystem centered on microscale large models, providing new infrastructure for microscopic scientific research and driving transformative advancements in microscale industrial design across fields such as materials, energy, and biopharmaceuticals.

Read more »

From the development of software ecosystems in fields such as electronic structure calculations and molecular dynamics, to the systematic evaluation of large models like OpenLAM, and gradually addressing scientific and industrial R&D problems such as biological simulations, drug design, and molecular property prediction, a series of AI4Science scientific computing software and models are rapidly advancing. This progress is closely linked to better research infrastructure, with the Dflow project being a key component.

Read more »

The tight-binding model based on second quantization is a widely used theoretical model in condensed matter physics. In this model:

  • Atoms in a lattice are represented as discrete points with a specific number of electrons.
  • Each electron occupies a corresponding atomic orbital.
  • Using creation and annihilation operators, electron transitions between atomic orbitals are described in the second quantization framework.
  • The Hamiltonian comprises:
    • Transition terms between atomic orbitals.
    • Energy levels of the orbitals.

Project on GitHub: https://github.com/deepmodeling/tbplas

Read more »

The slogan for OpenLAM is "Conquer the Periodic Table!" We hope to provide a new infrastructure for microscale scientific research and drive the transformation of microscale industrial design in fields such as materials, energy, and biopharmaceuticals by establishing an open-source ecosystem around large microscale models. Relevant models, data, and workflows will be consolidated around the AIS Square; related software development will take place in the DeepModeling open-source community. At the same time, we welcome open interaction from different communities in model development, data sharing, evaluation, and testing.

See AIS Square for more details.

Read more »

"The integration of machine learning and physical modeling is revolutionizing the paradigm of scientific research. People aiming to push the boundaries of science and solve challenging problems through computational modeling are coming together in unprecedented ways." Recently, the DeepModeling open-source community has welcomed a new member in the field of macro-scale computation. To further advance the development of the JAX-FEM project, a differentiable finite element method library, JAX-FEM will join the DeepModeling community. Together with developers and users in the community, it aims to expand the frontiers of finite element methods in the AI4Science era.

Community project homepage:
https://github.com/deepmodeling/jax-fem

Read more »

The slogan for OpenLAM is "Conquer the Periodic Table!" We hope to provide a new infrastructure for microscale scientific research and drive the transformation of microscale industrial design in fields such as materials, energy, and biopharmaceuticals by establishing an open-source ecosystem around large microscale models. Relevant models, data, and workflows will be consolidated around the AIS Square; related software development will take place in the DeepModeling open-source community. At the same time, we welcome open interaction from different communities in model development, data sharing, evaluation, and testing.

See AIS Square for more details.

Read more »

Peter Thiel once said, "We wanted flying cars, instead we got 140 characters (Twitter)." Over the past decade, we have made great strides at the bit level (internet), but progress at the atomic level (cutting-edge technology) has been relatively slow.

The accumulation of linguistic data has propelled the development of machine learning and ultimately led to the emergence of Large Language Models (LLMs). With the push from AI, progress at the atomic level is also accelerating. Methods like Deep Potential, by learning quantum mechanical data, have increased the space-time scale of microscopic simulations by several orders of magnitude and have made significant progress in fields like drug design, material design, and chemical engineering.

The accumulation of quantum mechanical data is gradually covering the entire periodic table, and the Deep Potential team has also begun the practice of the DPA pre-training model. Analogous to the progress of LLMs, we are on the eve of the emergence of a general Large Atom Model (LAM). At the same time, we believe that open-source and openness will play an increasingly important role in the development of LAM.

Read more »

In the field of magnetic material science, many frontier challenges require considering the coupling between lattices and spins at the atomic scale. For example, areas such as ultrafast magnetization dynamics, terahertz spintronics, and magnetocaloric materials rely on simulations of energy transfer between lattice and spin subsystems. Additionally, the interplay between magnetism and the lattice directly impacts the performance of materials critical to high-voltage power transmission, the automotive industry, and high-performance batteries.

Read more »

The AI for Science approach, exemplified by DeePMD-kit, has demonstrated immense potential in precise molecular/atomic-scale simulations. At the same time, a robust, efficient, and automated model evaluation system plays a vital role in the ongoing development and practical applications of these methods.

For alloy systems, researchers are particularly interested in the calculation of equilibrium structures and energies, energy-volume relationships, elastic constants, vacancy formation energy, interstitial defect formation energy, and surface formation energy. Comparing these properties with Density Functional Theory (DFT) or experimental results for the same alloy system helps evaluate model accuracy. For machine learning potentials like Deep Potential (DP), such evaluations can also guide optimization of training strategies and parameter settings during preliminary training.

To address these needs, developers from the DeepModeling community, including @kevinwenminion and @ZLI-afk, leveraged the capabilities of the maturing dflow cloud-native workflow for scientific computing. They released an open-source software package called APEX (Alloy Properties EXplorer using simulations). The project repository is available at: APEX GitHub.

1 Historical Background:

The early workflow for alloy property calculations was released in 2019 as part of the DP-GEN code and functioned as a submodule, dpgen autotest. Despite multiple optimizations and iterations, this setup faced challenges:

  • Significant redundancy in development efforts when adding new property calculation modules or simulation methods, making maintenance difficult.
  • Weak error detection and exception handling capabilities.

In response, developers rebuilt the alloy property workflow using dflow's comprehensive and user-friendly workflow development tools, along with tools like the first-principles operator library fpop. The restructured workflow was named Alloy Properties EXplorer using simulations (APEX). APEX aims to provide the community with an efficient, user-friendly, and highly compatible workflow for alloy property testing.


2 Current Features of APEX

The current APEX alloy property testing workflow supports:

  • First-principles calculations (VASP, ABACUS)
  • Molecular dynamics calculations (LAMMPS)

Currently supported alloy properties include:

  • Energy-volume relationship curves (EOS)
  • Elastic constants
  • Surface energy
  • Interstitial formation energy
  • Vacancy formation energy
  • Stacking fault energy (Gamma Line)

New property calculation functionalities will be added in future updates.

2.1 Key Highlights of APEX:

  1. Efficiency in Workflow Management: Enhanced by the integration with dflow, ensuring smooth process control.
  2. Simplicity and User-Friendliness: Easy to install and operate, with intuitive interfaces and intelligent interactions.
  3. Parallel Task Execution: Enables parallel submission of multiple configurations and properties, with one-click execution and result retrieval.
  4. High Extensibility: Facilitates expansion to support additional properties and computational software.

2.2 APEX Workflow Overview

APEX maintains a two-step structure for alloy property calculations:

  1. Relaxation
  2. Property Calculation

These steps are encapsulated into two independent workflows, allowing users to execute either step independently. Alternatively, APEX provides a joint workflow that merges both steps for a streamlined, one-click property testing process.


2.3 Example Outputs:

Full Workflow for Relaxation + Testing (Joint Workflow)

Elastic Constants and Modulus

Equation of State (EOS)

By combining advanced AI methods, user-friendly design, and open collaboration, APEX establishes itself as a powerful tool for alloy property exploration in scientific research and engineering applications.

The full code for APEX is available on GitHub at https://github.com/deepmodeling/APEX.

Looking ahead, APEX will expand to support functionalities such as phonon spectrum calculations, dislocation structure calculations, and finite-temperature property calculations. It will also enhance post-processing capabilities to enable automatic extraction of results, plotting, and preparation of final reports, making it easier for users to intuitively analyze test results.

We welcome everyone to provide feedback by submitting issues on APEX's GitHub repository or contribute code via pull requests.

0%