Extending RotatE: Interpretability and Multi-Relational Modeling

Introduction

This is an extended PyTorch implementation of the RotatE model for knowledge graph embedding (KGE). We provide a toolkit that aims to improve performance of several popular KGE models. There is also some new models (RotateCT, MRotatE, MRotatECT) in addition to main models provided in the baseline. The toolkit has also used a new method for negative sampling phase that uses temprature annealing to make balance between the exploration and exploitation.

A faster multi-GPU implementation of RotatE and other KGE models is available in GraphVite.

This project is a fork of Repository that belongs to RotatE: Knowledge Graph Embedding by Relational Rotation in Complex Space paper.

Execution Instructions

A jupyter notebook is provided to make execution of model easier in google colab enviroment. You can access it here.

Implemented features

Models:

Evaluation Metrics:

MRR, MR, HITS@1, HITS@3, HITS@10 (filtered)
AUC-PR (for Countries data sets)

Loss Function:

Uniform Negative Sampling
Self-Adversarial Negative Sampling
Temperature-Annealed Self-Adversarial Negative Sampling (new)

Usage

Knowledge Graph Data:

entities.dict: a dictionary map entities to unique ids
relations.dict: a dictionary map relations to unique ids
train.txt: the KGE model is trained to fit this data set
valid.txt: create a blank file if no validation data is available
test.txt: the KGE model is evaluated on this data set

Reproducing the best results

To reprocude the results in the ICLR 2019 paper RotatE: Knowledge Graph Embedding by Relational Rotation in Complex Space, you can run the bash commands in best_config.sh to get the best performance of RotatE, TransE, and ComplEx on five widely used datasets (FB15k, FB15k-237, wn18, wn18rr, Countries).

The run.sh script provides an easy way to search hyper-parameters:

bash run.sh train RotatE FB15k 0 0 1024 256 1000 24.0 1.0 0.0001 150000 16 1.0 0 -de

Speed

The KGE models usually take about half an hour to run 10000 steps on a single GeForce GTX 1080 Ti GPU with default configuration. And these models need different max_steps to converge on different data sets:

Dataset	FB15k	FB15k-237	wn18	wn18rr	Countries S*
MAX_STEPS	150000	100000	80000	80000	40000
TIME	9 h	6 h	4 h	4 h	2 h

Results of the RotatE model

Dataset	FB15k	FB15k-237	wn18	wn18rr
MRR	.797 ± .001	.337 ± .001	.949 ± .000	.477 ± .001
MR	40	177	309	3340
HITS@1	.746	.241	.944	.428
HITS@3	.830	.375	.952	.492
HITS@10	.884	.533	.959	.571

Using the library

The python libarary is organized around 3 objects:

TrainDataset (dataloader.py): prepare data stream for training
TestDataSet (dataloader.py): prepare data stream for evluation
KGEModel (model.py): calculate triple score and provide train/test API

The run.py file contains the main function, which parses arguments, reads data, initilize the model and provides the training loop.

Add your own model to model.py like:

def TransE(self, head, relation, tail, mode): if mode == 'head-batch': score = head + (relation - tail) else: score = (head + relation) - tail score = self.gamma.item() - torch.norm(score, p=1, dim=2) return score

Citation

If you use the codes, please cite the following paper:

@inproceedings{sun2018rotate, title={RotatE: Knowledge Graph Embedding by Relational Rotation in Complex Space}, author={Zhiqing Sun and Zhi-Hong Deng and Jian-Yun Nie and Jian Tang}, booktitle={International Conference on Learning Representations}, year={2019}, url={https://openreview.net/forum?id=HkgEQnRqYQ}, }

Name		Name	Last commit message	Last commit date
Latest commit History 39 Commits
codes		codes
data		data
logs		logs
presentation		presentation
visualizations		visualizations
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
SEN_Project.ipynb		SEN_Project.ipynb
auto-runner.sh		auto-runner.sh
best_config.sh		best_config.sh
requirements.txt		requirements.txt
run.sh		run.sh
visualizer.ipynb		visualizer.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Extending RotatE: Interpretability and Multi-Relational Modeling

Execution Instructions

About

Uh oh!

Releases

Packages

Languages

License

aminhalvaei/SEN-Project

Folders and files

Latest commit

History

Repository files navigation

Extending RotatE: Interpretability and Multi-Relational Modeling

Execution Instructions

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages