add Trinity model and debug the lem #256

floatingCatty · 2025-05-29T09:38:35Z

This pull request introduces a new ScalarMLPFunction module for modular MLP functionality, integrates the Trinity embedding method, and refactors existing code to enhance modularity and maintainability. Key changes include the addition of new functionality for two-body predictions in the e3tb method, improved handling of onsite shifts in loss computation, and cleanup of duplicate code.

New Features and Modules:

Added ScalarMLPFunction to dptb/nn/base.py, which implements a modular MLP with configurable dimensions, nonlinearity, initialization, and optional dropout and batch normalization. This replaces duplicate implementations in other files. [1] [2] [3]
Introduced the Trinity embedding method in dptb/nn/embedding/__init__.py and integrated it into the e3tb prediction workflow in dptb/nn/deeptb.py. This enables additional embedding options for specific use cases. [1] [2] [3]

Enhancements to `e3tb` Predictions:

Added functionality for two-body predictions in e3tb by introducing edge_prediction_h2 and h2miltonian modules in dptb/nn/deeptb.py. These handle additional edge attributes and Hamiltonian calculations. [1] [2] [3]

Refactoring and Code Cleanup:

Removed duplicate ScalarMLPFunction implementations from dptb/nn/embedding/lem.py and dptb/nn/embedding/slem.py, replacing them with the centralized implementation in dptb/nn/base.py. This reduces redundancy and simplifies maintenance. [1] [2]
Updated lem.py to use the new ScalarMLPFunction and fixed dimension mismatches in latent and output scalars. [1] [2]

Loss Computation Improvements:

Enhanced the onsite shift calculation in dptb/nnops/loss.py to account for overlap weights and avoid overflow issues. This ensures more accurate loss adjustment for batch data.

…rinity

dptb/nn/embedding/trinity.py

Franklalalala · 2025-06-01T04:57:19Z

Kindly suggest the flag 'init2b' change to 'only2b'.
'only2b'=true, train 2b only and skip the heavy message passing
'only2b'=false, freeze 2b and train the message passing

dptb/nn/embedding/trinity.py

dptb/nn/base.py

dptb/nnops/loss.py

QG-phy · 2025-06-05T02:57:09Z

如果是用了先用双中心，再加E3还需要对数据统计分析吗？

floatingCatty · 2025-06-05T12:19:47Z

如果是用了先用双中心，再加E3还需要对数据统计分析吗？

不需要了，我comment下

QG-phy · 2025-06-05T18:02:27Z

dptb/nnops/loss.py

                ref_data[AtomicDataDict.NODE_FEATURES_KEY] = ref_data[AtomicDataDict.NODE_FEATURES_KEY] + mu * ref_data[AtomicDataDict.NODE_OVERLAP_KEY]
                ref_data[AtomicDataDict.EDGE_FEATURES_KEY] = ref_data[AtomicDataDict.EDGE_FEATURES_KEY] + mu * ref_data[AtomicDataDict.EDGE_OVERLAP_KEY]
            elif batch.max() >= 1:
                slices = [data["__slices__"]["pos"][i]-data["__slices__"]["pos"][i-1] for i in range(1,len(data["__slices__"]["pos"]))]
                slices = [0] + slices
-                ndiag_batch = torch.stack([i.sum() for i in self.idp.mask_to_ndiag[data[AtomicDataDict.ATOM_TYPE_KEY].flatten()].split(slices)])
+                ndiag_batch = torch.stack([i.shape[0] for i in self.idp.mask_to_ndiag[data[AtomicDataDict.ATOM_TYPE_KEY].flatten()].split(slices)])


这里要改为torch.tensor(i.shape[0]) 不然会报错。因为torch.stack要对tensor操作不能是int 之前i.sum() 没问题。改成i.shape[0]就会报错了。

…shift

…lements Initialize full_mask_to_diag tensor to track diagonal orbital pairs in the reduced matrix. This helps in identifying diagonal elements during further processing.

…alculation The shift_mu function was extracted to avoid code duplication across multiple loss classes. The onsite shift calculation was simplified by using a more accurate formula that accounts for both node and edge features.

QG-phy · 2025-06-10T19:05:05Z

dptb/data/transforms.py

+            for orbs, islice in self.orbpair_maps.items():
+                fio, fjo = orbs.split('-')
+                if fio == fjo:
+                    self.full_mask_to_diag[islice] = True



这一块我新加了一个属性。获取在full basis 的feature 中轨道相同的那些指标。其实就是feature to block 里需要*0.5的那部分！

QG-phy · 2025-06-10T19:05:34Z

dptb/nnops/loss.py

+    norm_ss_e =  (ref_data[AtomicDataDict.EDGE_OVERLAP_KEY] * ref_data[AtomicDataDict.EDGE_OVERLAP_KEY]).sum(dim=-1)
+    norm_ss_e_diag = (ref_data[AtomicDataDict.EDGE_OVERLAP_KEY][:,idp.full_mask_to_diag] * ref_data[AtomicDataDict.EDGE_OVERLAP_KEY][:,idp.full_mask_to_diag]).sum(dim=-1)
+
+    return mu_n, mu_e, mu_e_diag, norm_ss_n, norm_ss_e, norm_ss_e_diag


这一个函数我新增了。计算各个部分。

QG-phy · 2025-06-10T19:06:25Z

dptb/nnops/loss.py

+
+                mu = mu_n + 2 * mu_e - mu_e_diag
+                ss = norm_ss_n + 2 * norm_ss_e - norm_ss_e_diag
+                mu = mu / ss


优化了这部分的逻辑！

QG-phy added 3 commits April 9, 2025 10:24

update input and e3tb hands on doc

632d381

debug lem and add trinity

4a4c4aa

Merge branch 'main' of https://github.com/floatingCatty/DeePTB into t…

61fec60

…rinity

QG-phy reviewed Jun 1, 2025

View reviewed changes

dptb/nn/embedding/trinity.py Show resolved Hide resolved

QG-phy reviewed Jun 1, 2025

View reviewed changes

dptb/nn/embedding/trinity.py Outdated Show resolved Hide resolved

QG-phy reviewed Jun 1, 2025

View reviewed changes

dptb/nn/embedding/trinity.py Show resolved Hide resolved

change init2b to only2b, move scalarmlpfunction to base.py

36b04d7

QG-phy reviewed Jun 2, 2025

View reviewed changes

dptb/nn/base.py Show resolved Hide resolved

compute onsite shift mu from whole onsite block

39d4d4a

QG-phy reviewed Jun 5, 2025

View reviewed changes

dptb/nnops/loss.py Outdated Show resolved Hide resolved

QG-phy reviewed Jun 5, 2025

View reviewed changes

dptb/nnops/loss.py Outdated Show resolved Hide resolved

QG-phy reviewed Jun 5, 2025

View reviewed changes

floatingCatty and others added 4 commits June 6, 2025 08:30

drop E3statistics when model does is valid and does not have a scale …

09f1a39

…shift

add edge component in onsite shift computation

cfef0d0

feat(transforms): add full_mask_to_diag initialization for diagonal e…

2bf34be

…lements Initialize full_mask_to_diag tensor to track diagonal orbital pairs in the reduced matrix. This helps in identifying diagonal elements during further processing.

QG-phy reviewed Jun 10, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

add Trinity model and debug the lem #256

add Trinity model and debug the lem #256

Uh oh!

floatingCatty commented May 29, 2025 •

edited by QG-phy

Loading

Uh oh!

Uh oh!

Franklalalala commented Jun 1, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

QG-phy commented Jun 5, 2025

Uh oh!

floatingCatty commented Jun 5, 2025

Uh oh!

QG-phy Jun 5, 2025

Uh oh!

floatingCatty Jun 6, 2025

Uh oh!

QG-phy Jun 10, 2025

Uh oh!

QG-phy Jun 10, 2025

Uh oh!

QG-phy Jun 10, 2025

Uh oh!

Uh oh!

add Trinity model and debug the lem #256

Are you sure you want to change the base?

add Trinity model and debug the lem #256

Uh oh!

Conversation

floatingCatty commented May 29, 2025 • edited by QG-phy Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

New Features and Modules:

Enhancements to e3tb Predictions:

Refactoring and Code Cleanup:

Loss Computation Improvements:

Uh oh!

Uh oh!

Franklalalala commented Jun 1, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

QG-phy commented Jun 5, 2025

Uh oh!

floatingCatty commented Jun 5, 2025

Uh oh!

QG-phy Jun 5, 2025

Choose a reason for hiding this comment

Uh oh!

floatingCatty Jun 6, 2025

Choose a reason for hiding this comment

Uh oh!

QG-phy Jun 10, 2025

Choose a reason for hiding this comment

Uh oh!

QG-phy Jun 10, 2025

Choose a reason for hiding this comment

Uh oh!

QG-phy Jun 10, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

floatingCatty commented May 29, 2025 •

edited by QG-phy

Loading

Enhancements to `e3tb` Predictions: