AlchemistZoro
diff --git a/‎1.sh‎
Lines changed: 8 additions & 0 deletions b/‎1.sh‎
Lines changed: 8 additions & 0 deletions
diff --git a/‎2.sh‎
Lines changed: 4 additions & 0 deletions b/‎2.sh‎
Lines changed: 4 additions & 0 deletions
diff --git a/‎3.sh‎
Lines changed: 5 additions & 0 deletions b/‎3.sh‎
Lines changed: 5 additions & 0 deletions
diff --git a/‎README.md‎
Lines changed: 26 additions & 80 deletions b/‎README.md‎
Lines changed: 26 additions & 80 deletions
diff --git a/‎greaselm.py‎ ‎conceptlm.py‎greaselm.py renamed to conceptlm.py
Lines changed: 1 addition & 1 deletion b/‎greaselm.py‎ ‎conceptlm.py‎greaselm.py renamed to conceptlm.py
Lines changed: 1 addition & 1 deletion
diff --git a/‎eval_greaselm.sh‎ ‎eval_concept.sh‎eval_greaselm.sh renamed to eval_concept.sh b/‎eval_greaselm.sh‎ ‎eval_concept.sh‎eval_greaselm.sh renamed to eval_concept.sh
diff --git a/‎exp.csv‎
Lines changed: 0 additions & 19 deletions b/‎exp.csv‎
Lines changed: 0 additions & 19 deletions
diff --git a/‎figs/greaselm.png‎
-134 KB b/‎figs/greaselm.png‎
-134 KB
diff --git a/‎modeling/modeling_greaselm.py‎ ‎modeling/modeling_conceptlm.py‎modeling/modeling_greaselm.py renamed to modeling/modeling_conceptlm.py b/‎modeling/modeling_greaselm.py‎ ‎modeling/modeling_conceptlm.py‎modeling/modeling_greaselm.py renamed to modeling/modeling_conceptlm.py
diff --git a/‎run_greaselm.sh‎ ‎run_conceptlm.sh‎run_greaselm.sh renamed to run_conceptlm.sh b/‎run_greaselm.sh‎ ‎run_conceptlm.sh‎run_greaselm.sh renamed to run_conceptlm.sh
@@ -0,0 +1,8 @@
+# CUDA_VISIBLE_DEVICES=5 ./run_conceptlm.sh obqa --data_dir data/ --emp False --use_wandb True -k 1
+# CUDA_VISIBLE_DEVICES=5 ./run_conceptlm.sh obqa --data_dir data/ --emp False --use_wandb True -k 3
+# CUDA_VISIBLE_DEVICES=5 ./run_conceptlm.sh obqa --data_dir data/ --emp False --use_wandb True -k 7
+# CUDA_VISIBLE_DEVICES=5 ./run_conceptlm.sh obqa --data_dir data/ --emp False --use_wandb True --mix_number 2
+# CUDA_VISIBLE_DEVICES=5 ./run_conceptlm.sh obqa --data_dir data/ --emp False --use_wandb True --mix_number 3
+CUDA_VISIBLE_DEVICES=5 ./run_conceptlm.sh obqa --data_dir data/ --emp False --use_wandb True --mix_number 5
+# CUDA_VISIBLE_DEVICES=6 ./run_conceptlm.sh obqa --data_dir data/ --emp False --use_wandb True --mix_number 10 
+# CUDA_VISIBLE_DEVICES=6 ./run_conceptlm.sh obqa --data_dir data/ --emp False --use_wandb True --mix_number 20
@@ -0,0 +1,4 @@
+CUDA_VISIBLE_DEVICES=5 ./run_conceptlm.sh obqa --data_dir data/ --emp False --use_wandb True 
+CUDA_VISIBLE_DEVICES=5 ./run_conceptlm.sh csqa --data_dir data/ --emp False --use_wandb True 
+CUDA_VISIBLE_DEVICES=6 ./run_conceptlm.sh obqa --data_dir data/ --emp False --use_wandb True --gnn_dim 100
+CUDA_VISIBLE_DEVICES=6 ./run_conceptlm.sh obqa --data_dir data/ --emp False --use_wandb True --gnn_dim 300
@@ -0,0 +1,5 @@
+CUDA_VISIBLE_DEVICES=7 ./run_conceptlm.sh obqa --data_dir data/ --emp True --use_wandb True -k 1
+CUDA_VISIBLE_DEVICES=7 ./run_conceptlm.sh obqa --data_dir data/ --emp True --use_wandb True -k 3
+CUDA_VISIBLE_DEVICES=7 ./run_conceptlm.sh obqa --data_dir data/ --emp True --use_wandb True -k 5
+CUDA_VISIBLE_DEVICES=7 ./run_conceptlm.sh obqa --data_dir data/ --emp True --use_wandb True -k 7
+CUDA_VISIBLE_DEVICES=7 ./run_conceptlm.sh csqa --data_dir data/ --emp True --use_wandb True -k 5
@@ -1,18 +1,4 @@
-# ConceptLM: Graph REASoning Enhanced Language Models for Question Answering
-
-This repo provides the source code & data of our paper [ConceptLM: Graph REASoning Enhanced Language Models for Question Answering](https://arxiv.org/abs/2201.08860) (ICLR 2022 spotlight). If you use any of our code, processed data or pretrained models, please cite:
-```bib
-@inproceedings{zhang2021conceptlm,
-  title={ConceptLM: Graph REASoning Enhanced Language Models},
-  author={Zhang, Xikun and Bosselut, Antoine and Yasunaga, Michihiro and Ren, Hongyu and Liang, Percy and Manning, Christopher D and Leskovec, Jure},
-  booktitle={International Conference on Learning Representations},
-  year={2021}
-}
-```
-
-<p align="center">
-  <img src="./figs/conceptlm.png" width="600" title="ConceptLM model architecture" alt="">
-</p>
+# ConceptLM
 
 ## 1. Dependencies
 
@@ -66,92 +52,52 @@ The script to download and preprocess the [MedQA-USMLE](https://github.com/jind1
 ### Directly download preprocessed data
 For your convenience, if you don't want to preprocess the data yourself, you can download all the preprocessed data [here](https://drive.google.com/drive/folders/1T6B4nou5P3u-6jr0z6e3IkitO8fNVM6f?usp=sharing). Download them into the top-level directory of this repo and unzip them. Move the `medqa_usmle` and `ddb` folders into the `data/` directory.
 
-### Resulting file structure
 
-The resulting file structure should look like this:
-
-```plain
-.
-├── README.md
-├── data/
-    ├── cpnet/                 (prerocessed ConceptNet)
-    ├── csqa/
-        ├── train_rand_split.jsonl
-        ├── dev_rand_split.jsonl
-        ├── test_rand_split_no_answers.jsonl
-        ├── statement/             (converted statements)
-        ├── grounded/              (grounded entities)
-        ├── graphs/                (extracted subgraphs)
-        ├── ...
-    ├── obqa/
-    ├── medqa_usmle/
-    └── ddb/
-```
 
 ## 3. Training ConceptLM
 To train ConceptLM on CommonsenseQA, run
 ```
 CUDA_VISIBLE_DEVICES=0 ./run_conceptlm.sh csqa --data_dir data/
 ```
-CSQA with pool
-```
-CUDA_VISIBLE_DEVICES=3 ./run_conceptlm.sh csqa --data_dir data/ --use_wandb True --emp False -mbs 4
-```
-You can specify up to 2 GPUs you want to use in the beginning of the command `CUDA_VISIBLE_DEVICES=...`.
 
-Similarly, to train ConceptLM on OpenbookQA, run
+Debug on OBQA
 ```
-CUDA_VISIBLE_DEVICES=0 ./run_conceptlm.sh obqa --data_dir data/
+CUDA_VISIBLE_DEVICES=0 ./run_conceptlm.sh obqa --data_dir data/  --emp True --debug True
 ```
 
-Embedding pool experiment test
-```
-CUDA_VISIBLE_DEVICES=1 ./run_conceptlm.sh obqa --data_dir data/ --use_wandb True --emp False  
-```
 
-Debug
+## 4. Experimental expansion
+### BASE MODEL
 ```
-CUDA_VISIBLE_DEVICES=0 ./run_conceptlm.sh obqa --data_dir data/  --emp True --debug True
+./run_conceptlm.sh obqa --data_dir data/ --emp False --use_wandb True 
+./run_conceptlm.sh csqa --data_dir data/ --emp False --use_wandb True 
 ```
 
-To train ConceptLM on MedQA-USMLE, run
+### Different number of mixed coding layers
 ```
-CUDA_VISIBLE_DEVICES=0 ./run_conceptlm__medqa_usmle.sh
+./run_conceptlm.sh obqa --data_dir data/ --emp False --use_wandb True -k 1
+./run_conceptlm.sh obqa --data_dir data/ --emp False --use_wandb True -k 3
+./run_conceptlm.sh obqa --data_dir data/ --emp False --use_wandb True -k 7
 ```
-
-## 4. Pretrained model checkpoints
-You can download a pretrained ConceptLM model on CommonsenseQA [here](https://drive.google.com/file/d/1QPwLZFA6AQ-pFfDR6TWLdBAvm3c_HOUr/view?usp=sharing), which achieves an IH-dev acc. of `79.0` and an IH-test acc. of `74.0`.
-
-You can also download a pretrained ConceptLM model on OpenbookQA [here](https://drive.google.com/file/d/1-QqyiQuU9xlN20vwfIaqYQ_uJMP8d7Pv/view?usp=sharing), which achieves an test acc. of `84.8`.
-
-You can also download a pretrained ConceptLM model on MedQA-USMLE [here](https://drive.google.com/file/d/1j0QxiBiGbv0s9PhseSly6V6uiHWU5IEt/view?usp=sharing), which achieves an test acc. of `38.5`.
-
-## 5. Evaluating a pretrained model checkpoint
-To evaluate a pretrained ConceptLM model checkpoint on CommonsenseQA, run
+### Entity encoding node
 ```
-CUDA_VISIBLE_DEVICES=0 ./eval_conceptlm.sh csqa --data_dir data/ --load_model_path /path/to/checkpoint
+./run_conceptlm.sh obqa --data_dir data/ --emp True --use_wandb True -k 1
+./run_conceptlm.sh obqa --data_dir data/ --emp True --use_wandb True -k 3
+./run_conceptlm.sh obqa --data_dir data/ --emp True --use_wandb True -k 5
+./run_conceptlm.sh obqa --data_dir data/ --emp True --use_wandb True -k 7
+./run_conceptlm.sh csqa --data_dir data/ --emp True --use_wandb True -k 5
 ```
-Again you can specify up to 2 GPUs you want to use in the beginning of the command `CUDA_VISIBLE_DEVICES=...`.
-
-Similarly, to evaluate a pretrained ConceptLM model checkpoint on OpenbookQA, run
+### Different number of interaction nodes
 ```
-CUDA_VISIBLE_DEVICES=0 ./eval_conceptlm.sh obqa --data_dir data/ --load_model_path /path/to/checkpoint
+./run_conceptlm.sh obqa --data_dir data/ --emp False --use_wandb True --mix_number 2
+./run_conceptlm.sh obqa --data_dir data/ --emp False --use_wandb True --mix_number 3
+./run_conceptlm.sh obqa --data_dir data/ --emp False --use_wandb True --mix_number 5
+./run_conceptlm.sh obqa --data_dir data/ --emp False --use_wandb True --mix_number 10 
+./run_conceptlm.sh obqa --data_dir data/ --emp False --use_wandb True --mix_number 20
 ```
-To evaluate a pretrained ConceptLM model checkpoint on MedQA-USMLE, run
+### Subgraphs of different number of nodes
 ```
-INHERIT_BERT=1 CUDA_VISIBLE_DEVICES=0 ./eval_conceptlm.sh medqa_usmle --data_dir data/ --load_model_path /path/to/checkpoint
+./run_conceptlm.sh obqa --data_dir data/ --emp False --use_wandb True --gnn_dim 100
+./run_conceptlm.sh obqa --data_dir data/ --emp False --use_wandb True --gnn_dim 300
 ```
 
-## 6. Use your own dataset
-- Convert your dataset to  `{train,dev,test}.statement.jsonl`  in .jsonl format (see `data/csqa/statement/train.statement.jsonl`)
-- Create a directory in `data/{yourdataset}/` to store the .jsonl files
-- Modify `preprocess.py` and perform subgraph extraction for your data
-- Modify `utils/parser_utils.py` to support your own dataset
-
-## 7. Acknowledgment
-This repo is built upon the following work:
-```
-QA-GNN: Question Answering using Language Models and Knowledge Graphs
-https://github.com/michiyasunaga/qagnn
-```
-Many thanks to the authors and developers!
@@ -577,7 +577,7 @@ def main(args):
 
     args.hf_version = transformers.__version__
 
-    with wandb.init(project="KG-LM", config=args, name=args.run_name, resume="allow", id=wandb_id, settings=wandb.Settings(start_method="fork"), mode=wandb_mode):
+    with wandb.init(project="CLM-TEST", config=args, name=args.run_name, resume="allow", id=wandb_id, settings=wandb.Settings(start_method="fork"), mode=wandb_mode):
         print(socket.gethostname())
         print ("pid:", os.getpid())
         print ("screen: %s" % subprocess.check_output('echo $STY', shell=True).decode('utf'))