Graph-Induced Sum-Product Networks (GSPN)

Official Repository of the ICLR 2024 paper "Tractable Probabilistic Graph Representation Learning with Graph-Induced Sum-Product Networks".

Citing us

Please consider citing us if you find the code and paper useful:

@inproceedings{errica_tractable_2024,
  title={Tractable Probabilistic Graph Representation Learning with Graph-Induced Sum-Product Networks},
  author={Errica, Federico and Niepert, Mathias},
  booktitle={The 12th International Conference on Learning Representations (ICLR)},
  year={2024},
}

Requirements

An environment with PyTorch (>=2.0.0), PytorchGeometric (>=2.3.0) and PyDGN (==1.5.0) installed.

You can install PyDGN using pip install pydgn==1.5.0

How to reproduce the results

Remove the --debug option to run experiments in parallel. Please refer to the PyDGN tutorial for an in-depth explanation.

Scarce supervision Experiments

Prepare data (e.g., for benzene)

pydgn-dataset --config-file DATA_CONFIGS/config_benzene.yml

Run the same command for different data configuration files to create the datasets.

Launch Exp (e.g., for benzene)

First, build unsupervised embeddings

pydgn-train  --config-file WEAK_SUP_MODEL_CONFIGS/unsup_model_embedding_generation_categorical.yml --debug

Then, train a classifier on top of them

pydgn-train  --config-file WEAK_SUP_MODEL_CONFIGS/unsup_model_embedding_regression_mlp_weak_supervision.yml --debug

Modify the configuration files accordingly (dataset_name and data_splits_file fields) to run experiments on different datasets. Note that OGBG-molpcba has different configuration files (unsup_model_embedding_generation_multicategorical.yml and unsup_model_embedding_regression_mlp_weak_supervision_ogbg.yml).

Graph Classification Experiments

Prepare data (e.g., for NCI1)

pydgn-dataset --config-file DATA_CONFIGS/config_NCI1.yml

Run the same command for different data configuration files to create the datasets.

Launch Exp (e.g., for NCI1)

Unsupervised GSPN

First, build unsupervised embeddings

pydgn-train  --config-file MODEL_CONFIGS/unsup_model_embedding_generation_categorical.yml --debug

Then, train a classifier on top of them

pydgn-train  --config-file MODEL_CONFIGS/unsup_model_embedding_classification_CHEMICAL.yml --debug

Supervised GSPN

pydgn-train  --config-file MODEL_CONFIGS/sup_model_embedding_classification_CHEMICAL.yml --debug

Modify the configuration files accordingly (dataset_name and data_splits_file fields) to run experiments on different datasets.

Missing Data Experiments

Prepare data (e.g., for benzene)

Run the first part of the Dataset Creation and Model Analysis notebook using jupyter to generate the raw dataset.

Then

pydgn-dataset --config-file DATA_CONFIGS/config_benzene_missing_data.yml

Launch Exp (e.g., for benzene)

pydgn-train  --config-file MODEL_CONFIGS/missing_gaussian_molecular.yml --debug

Modify the configuration files accordingly (dataset_name and data_splits_file fields) to run experiments on different datasets.

Remarks

Once more, these commands show how to run experiments for GPSN, but not all of them. By easily changing the path of the configuration files, you can run all experiments (please have a look at the folders) and reproduce the results for all baselines and datasets.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
DATA_CONFIGS		DATA_CONFIGS
DATA_SPLITS		DATA_SPLITS
DGI_DATA_CONFIGS		DGI_DATA_CONFIGS
LAYERING_CONFIGS		LAYERING_CONFIGS
MODEL_CONFIGS		MODEL_CONFIGS
WEAK_SUP_MODEL_CONFIGS		WEAK_SUP_MODEL_CONFIGS
.gitignore		.gitignore
Dataset Creation and Model Analysis.ipynb		Dataset Creation and Model Analysis.ipynb
Hyper-Parameter Analysis (Supervised Version).ipynb		Hyper-Parameter Analysis (Supervised Version).ipynb
LICENSE.txt		LICENSE.txt
OGBG SMILES Likelihood.ipynb		OGBG SMILES Likelihood.ipynb
README.md		README.md
baseline_mask.py		baseline_mask.py
baselines.py		baselines.py
dataset.py		dataset.py
gspn.png		gspn.png
metric.py		metric.py
model.py		model.py
optimizer.py		optimizer.py
provider.py		provider.py
readout.py		readout.py
sup_model.py		sup_model.py
supervised_embedding_classification.py		supervised_embedding_classification.py
transform.py		transform.py
unsupervised_embedding_classification.py		unsupervised_embedding_classification.py
unsupervised_embedding_generation.py		unsupervised_embedding_generation.py
weakly_supervised_task.py		weakly_supervised_task.py

License

nec-research/graph-sum-product-networks

Folders and files

Latest commit

History

Repository files navigation

Graph-Induced Sum-Product Networks (GSPN)

Citing us

Requirements

How to reproduce the results

Scarce supervision Experiments

Prepare data (e.g., for benzene)

Launch Exp (e.g., for benzene)

Graph Classification Experiments

Prepare data (e.g., for NCI1)

Launch Exp (e.g., for NCI1)

Unsupervised GSPN

Supervised GSPN

Missing Data Experiments

Prepare data (e.g., for benzene)

Launch Exp (e.g., for benzene)

Remarks

About

Resources

License

Stars

Watchers

Forks

Languages