pyHGT/ogbn-mag at master · ofvjet8/pyHGT

History

Name		Name	Last commit message	Last commit date
parent directory ..
pyHGT		pyHGT
README.md		README.md
eval_ogbn_mag.py		eval_ogbn_mag.py
preprocess_ogbn_mag.py		preprocess_ogbn_mag.py
train_ogbn_mag.py		train_ogbn_mag.py

README.md

Usage

This experiment is based on stanford OGB (1.2.1) benchmark. The description of ogbn-mag is avaiable here. The steps are:

run python preprocess_ogbn_mag.py to turn the dataset into our own data structure. As the MAG dataset only have input attributes (features) for paper nodes, for all the other types of nodes (author, affiliation, topic), we simply take the average of their connected paper nodes as their input features.
train the model by python train_ogbn_mag.py --data_dir PATH_OF_DATASET --model_dir PATH_OF_SAVED_MODEL --n_layers 4 --prev_norm --last_norm --use_RTE. Remember to specify your own data and model path.
evaluate the model by python eval_ogbn_mag.py --data_dir PATH_OF_DATASET --model_dir PATH_OF_SAVED_MODEL --task_type variance_reduce. We use mini-batch sampling to get node representation and prediction. Based on it, we provide two evaluation type: - 'sequential': Run the sampling for each batch of test nodes only once, and get one set of prediction results. - 'variance_reduce': Run the sampling for each batch of test nodes multiple times, and get the average prediction score for them as prediction results.

Detailed hyperparameter is:

  --conv_name                      STR     Name of GNN filter (model)                           hgt
  --n_hid                          INT     Number of hidden dimension                           512
  --n_heads                        INT     Number of attention head                             8
  --n_layers                       INT     Number of GNN layers                                 4
  --prev_norm                      BOOL    Whether to use layer-norm on previous layers.        True
  --last_norm                      BOOL    Whether to use layer-norm on the last layer.         True
  --use_RTE                        BOOL    Whether to use RTE                                   True

Reference performance numbers for the ACM dataset:

Model	Accuracy (VR)	Accuracy (Seq)	# Parameter	Hardware
4-layer HGT	0.5007	0.4940	21,173,389	Tesla K80 (12GB)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ogbn-mag

ogbn-mag

README.md

Usage

Files

ogbn-mag

Directory actions

More options

Directory actions

More options

Latest commit

History

ogbn-mag

Folders and files

parent directory

README.md

Usage