GitXplorerGitXplorer
t

infoshare

public
6 stars
0 forks
0 issues

Commits

List of commits on branch main.
Unverified
cc349d7f9ea2fb5cf562c5aa3330baf0efb4865c

add citation

tthesofakillers committed a year ago
Verified
59c2c2cb7bc8dfdbfe7a9a9388d5f2283272e88b

add paper arxiv link

tthesofakillers committed a year ago
Verified
1d61e23526b99c5743a98a624aac1b8ca16a1384

Update README.md

tthesofakillers committed a year ago
Unverified
fbb11f63b2bc937ef16f1c382c3adaaf45ffa0ec

update lisa scripts

tthesofakillers committed a year ago
Unverified
38b27efce53cf6799f18639b4025bb2c8b7d4cb2

use store_true because argparse cant fkn handle booleans

tthesofakillers committed a year ago
Unverified
f32b91f609b31c5c0735dd2458d89ebb3ae53cd7

remove random print statement

tthesofakillers committed a year ago

README

The README file for this repository.

Probing LLMs for Joint Encoding of Linguistic Categories

Paper

Official repository for the paper: "Probing LLMs for Joint Encoding of Linguistic Categories." Findings of EMNLP 2023.

https://arxiv.org/abs/2310.18696

Requirements and Setup

Details such as python and package versions can be found in the generated pyproject.toml and poetry.lock files.

We recommend using an environment manager such as conda. After setting up your environment with the correct python version, please proceed with the installation of the required packages. We provide a requirements.txt file for this.

pip install -r requirements.txt

This requirements.txt file is generated by running the following

sh gen_pip_reqs.sh

Repository contents

.
├── data/                            # Where data is kept
├── experiments/                     # arrays of images
├── images/                          # more individual images
├── lisa/                            # SLURM jobs and configs
├── infoshare/
│   ├── datamodules/                 # handle data loading, processing
│   ├── models/                      # Model implementations
│   ├── run
│   │   ├── test.py                  # run testing
│   │   ├── test_xlingual.py         # run testing across languages
│   │   └── train.py                 # run training
│   ├── __init__.py
│   └── utils.py                     # general utils
├── notebooks/                       # see notebooks/README.md
├── reports/                         # LaTeX and more
├── README.md                        # you are here
├── lswsd_lemmas.txt                 # lemmas used for LSWSD
├── poetry.lock                      # dependencies metadata
├── pyproject.toml                   # project metadata
├── gen_pip_reqs.sh                  # script for generating requirements.txt
└── requirements.txt                 # required packages for PIP

The above was generated with

tree . -L 3 --dirsfirst -I "*.eps|*.png|*.pdf|lightning_logs|*pycache*|backup"

followed by some manual edits.

Citation

If you use this code or find our work otherwise useful, please consider citing our paper:

@inproceedings{starace2023probing,
  title={Probing LLMs for Joint Encoding of Linguistic Categories},
  author={Starace, Giulio and Papakostas, Konstantinos and Choenni, Rochelle and Panagiotopoulos, Apostolos and Rosati, Matteo and Leidinger, Alina and Shutova, Ekaterina},
  booktitle={Findings of the Association for Computational Linguistics: EMNLP 2023},
  pages={7158--7179},
  year={2023}
}