AlloyBERT

This research presents AlloyBERT, a transformer encoder model tailored for predicting properties like elastic modulus and yield strength of alloys based on textual inputs.

Getting Started

Clone the repository

$ git clone https://github.com/cakshat/AlloyBERT.git
cd AlloyBERT

Datasets

For this research, we utilized two primary datasets to explore the performance of transformer models compared to shallow machine learning models in predicting target property values with text inputs.

Multi Principal Elemental Alloys (MPEA) dataset: This dataset, sourced from Citrine Informatics, contains mechanical properties of several alloys. We focused on predicting the experimental Young’s modulus, and the dataset comprises 1546 entries.
Refractory Alloy Yield Strength (RAYS) dataset: This dataset includes experimental yield strength values for refractory alloys. With 813 entries, it provides alloy composition, testing temperature from previous literature, and data from the MPEA30–32 dataset. The dataset offers average yield strength values obtained from various processing methods.

Both the datasets can be found in the data folder as : cd data/MPEA/MPEA.csv and cd data/ys_clean/ys_clean.csv.

How to use

Update the config.py file with desired parameters.
Run python main.py to train the model.
While pretraining make sure to set the configuration to pretrain.
After pretraining, update the path of pretrained model and change mode to finetune.
Our custom trained tokenizer which was used for training can be found in tokenizer folder and can be used if required.

Name		Name	Last commit message	Last commit date
Latest commit History 32 Commits
data		data
model		model
tokenizer		tokenizer
.gitignore		.gitignore
CleanMPEA.ipynb		CleanMPEA.ipynb
LICENSE		LICENSE
MPEA_dataset.csv		MPEA_dataset.csv
README.md		README.md
config.yaml		config.yaml
main.py		main.py
requirements.txt		requirements.txt
shallowML.py		shallowML.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AlloyBERT

Getting Started

Datasets

How to use

About

Releases

Packages

Contributors 2

Languages

License

cakshat/AlloyBERT

Folders and files

Latest commit

History

Repository files navigation

AlloyBERT

Getting Started

Datasets

How to use

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages