ImpScore: A Metric To Calculate The Implicitness Level of Sentences

This is the repository for ICLR 2025 paper ImpScore: A Learnable Metric For Quantifying The Implicitness Level of Sentences.

Package To do list

🔨 We have built a python package for everyone to download: https://pypi.org/project/implicit-score/.

✅ ~~Make ImpScore a package~~ (current version 0.1.3).
🔲 Add functions to enable customized training of ImpScore.

How To Use ImpScore

Download through pip.

First, download it through pip.

$ pip install implicit-score

Then, in python code, import this package.

import impscore

Use ImpScore to calculate the implicitness score of single English sentence:

# this will download the latest model version into GPU
model = impscore.load_model(load_device="cuda")  # or "cpu"

# sentence list
sentences = ["I have to leave now. Talk to you later.",
             "I can't believe we've talked for so long."]

# calculate implicitness scores, prag_embs, and sem_embs. The returned variables are lists of results.
imp_scores, prag_embs, sem_embs = model.infer_single(sentences)

print(imp_scores)
# The outputs will be a tensor list ([0.6709, 1.0984])
# higher score indicates higher level of implicitness.

Use ImpScore to calculate the implicitness of English sentence pairs, so we can compute their pragmatic distance:

model = impscore.load_model(load_device="cuda")

sentence_pairs = [
    ["I have to leave now. Talk to you later.", "I can't believe we've talked for so long."],
    ["You must find a new place and move out by the end of this month.",
     "Maybe exploring other housing options could benefit us both?"]
]

s1_list = [pair[0] for pair in sentence_pairs]  # list of the first sentence in pairs
s2_list = [pair[1] for pair in sentence_pairs]  # list of the second sentence in pairs

# imp_score1 is the implicitness score list for s1 sentences,
# imp_score2 is the implicitness score list for s2 sentences.
# prag_distance is the pragmatic distance list, where prag_distance[i] is the pragmatic distance between s1[i] and s2[i].
imp_score1, imp_score2, prag_distance = model.infer_pairs(s1_list, s2_list)

print(imp_score1, imp_score2, prag_distance)
# the outputs: tensor([0.6709, 0.9273]) tensor([1.0984, 1.3642]) tensor([0.6660, 0.7115])

How To Train ImpScore

ImpScore is also open for customized training, where you can:

change the model hyperparameter settings,
use more training data,
etc.

I plan to incorporate this feature into the future version of implicit-score. For now, you can download the source code and data in this repository for training. The data is the same with what we introduced in the paper.

The repository structure:

├── all_data.csv // training data
├── load_dataset.py
├── train.py  // the training main function
├── model.py // model implementation
└── utils.py

About The Training Data: The training data consists of 112,580 sentence pairs in form of (implicit sentence, explicit sentence). In the file, the first row is the header, and each following row consists of two sentence pairs: positive pair: (implicit sentence, explicit sentence), negative pair: (implicit sentence, explicit sentence) The implicit sentence in the two pairs are the same, as described in the paper.

Key packages required to run the code are listed below. Ensure that you have a GPU.

openai==1.34.0
tqdm=4.65.0
pandas=2.1.4
numpy=1.26.4
pytorch=2.3.0
transformers=4.36.2
datasets=2.19.1
numpy=1.26.4
sentence-transformers=3.0.1

To run the code, simply enter

$ python train.py

When training is completed, the code automatically generates plots for:

training loss plot over epochs
implicitness score distribution on test samples
pragmatics and implicitness score distribution on test samples

Feel free to modify or extend the data in all_data.csv file and train your own metric.

Please cite our metric if you use it in your work:

@inproceedings{wang2025impscore,
    title={ImpScore: A Learnable Metric For Quantifying The Implicitness Level of Sentences},
    author={Yuxin Wang and Xiaomeng Zhu and Weimin Lyu and Saeed Hassanpour and Soroush Vosoughi},
    booktitle={The Thirteenth International Conference on Learning Representations},
    year={2025},
    url={https://openreview.net/forum?id=gYWqxXE5RJ}
}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ImpScore: A Metric To Calculate The Implicitness Level of Sentences

Package To do list

How To Use ImpScore

Download through pip.

How To Train ImpScore

About

Uh oh!

Releases

Packages

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 34 Commits
.gitignore		.gitignore
README.md		README.md
all_data.csv		all_data.csv
load_dataset.py		load_dataset.py
model.py		model.py
train.py		train.py
utils.py		utils.py

audreycs/ImpScore

Folders and files

Latest commit

History

Repository files navigation

ImpScore: A Metric To Calculate The Implicitness Level of Sentences

Package To do list

How To Use ImpScore

Download through pip.

How To Train ImpScore

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages