Passage-based BM25 Hard Negatives: A Simple and Effective Negative Sampling Strategy For Dense Retrieval
Thanh-Do Nguyen, Chi Minh Bui, Thi-Hai-Yen Vuong, Xuan-Hieu Phan
- Anthology ID:
- 2023.paclic-1.59
- Volume:
- Proceedings of the 37th Pacific Asia Conference on Language, Information and Computation
- Month:
- December
- Year:
- 2023
- Address:
- Hong Kong, China
- Editors:
- Chu-Ren Huang, Yasunari Harada, Jong-Bok Kim, Si Chen, Yu-Yin Hsu, Emmanuele Chersoni, Pranav A, Winnie Huiheng Zeng, Bo Peng, Yuxi Li, Junlin Li
- Venue:
- PACLIC
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 591–599
- Language:
- URL:
- https://aclanthology.org/2023.paclic-1.59
- DOI:
- Bibkey:
- Cite (ACL):
- Thanh-Do Nguyen, Chi Minh Bui, Thi-Hai-Yen Vuong, and Xuan-Hieu Phan. 2023. Passage-based BM25 Hard Negatives: A Simple and Effective Negative Sampling Strategy For Dense Retrieval. In Proceedings of the 37th Pacific Asia Conference on Language, Information and Computation, pages 591–599, Hong Kong, China. Association for Computational Linguistics.
- Cite (Informal):
- Passage-based BM25 Hard Negatives: A Simple and Effective Negative Sampling Strategy For Dense Retrieval (Nguyen et al., PACLIC 2023)
- Copy Citation:
- PDF:
- https://aclanthology.org/2023.paclic-1.59.pdf
Export citation
@inproceedings{nguyen-etal-2023-passage, title = "Passage-based {BM}25 Hard Negatives: A Simple and Effective Negative Sampling Strategy For Dense Retrieval", author = "Nguyen, Thanh-Do and Bui, Chi Minh and Vuong, Thi-Hai-Yen and Phan, Xuan-Hieu", editor = "Huang, Chu-Ren and Harada, Yasunari and Kim, Jong-Bok and Chen, Si and Hsu, Yu-Yin and Chersoni, Emmanuele and A, Pranav and Zeng, Winnie Huiheng and Peng, Bo and Li, Yuxi and Li, Junlin", booktitle = "Proceedings of the 37th Pacific Asia Conference on Language, Information and Computation", month = dec, year = "2023", address = "Hong Kong, China", publisher = "Association for Computational Linguistics", url = "https://aclanthology.org/2023.paclic-1.59", pages = "591--599", }
<?xml version="1.0" encoding="UTF-8"?> <modsCollection xmlns="http://www.loc.gov/mods/v3"> <mods ID="nguyen-etal-2023-passage"> <titleInfo> <title>Passage-based BM25 Hard Negatives: A Simple and Effective Negative Sampling Strategy For Dense Retrieval</title> </titleInfo> <name type="personal"> <namePart type="given">Thanh-Do</namePart> <namePart type="family">Nguyen</namePart> <role> <roleTerm authority="marcrelator" type="text">author</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Chi</namePart> <namePart type="given">Minh</namePart> <namePart type="family">Bui</namePart> <role> <roleTerm authority="marcrelator" type="text">author</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Thi-Hai-Yen</namePart> <namePart type="family">Vuong</namePart> <role> <roleTerm authority="marcrelator" type="text">author</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Xuan-Hieu</namePart> <namePart type="family">Phan</namePart> <role> <roleTerm authority="marcrelator" type="text">author</roleTerm> </role> </name> <originInfo> <dateIssued>2023-12</dateIssued> </originInfo> <typeOfResource>text</typeOfResource> <relatedItem type="host"> <titleInfo> <title>Proceedings of the 37th Pacific Asia Conference on Language, Information and Computation</title> </titleInfo> <name type="personal"> <namePart type="given">Chu-Ren</namePart> <namePart type="family">Huang</namePart> <role> <roleTerm authority="marcrelator" type="text">editor</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Yasunari</namePart> <namePart type="family">Harada</namePart> <role> <roleTerm authority="marcrelator" type="text">editor</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Jong-Bok</namePart> <namePart type="family">Kim</namePart> <role> <roleTerm authority="marcrelator" type="text">editor</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Si</namePart> <namePart type="family">Chen</namePart> <role> <roleTerm authority="marcrelator" type="text">editor</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Yu-Yin</namePart> <namePart type="family">Hsu</namePart> <role> <roleTerm authority="marcrelator" type="text">editor</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Emmanuele</namePart> <namePart type="family">Chersoni</namePart> <role> <roleTerm authority="marcrelator" type="text">editor</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Pranav</namePart> <namePart type="family">A</namePart> <role> <roleTerm authority="marcrelator" type="text">editor</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Winnie</namePart> <namePart type="given">Huiheng</namePart> <namePart type="family">Zeng</namePart> <role> <roleTerm authority="marcrelator" type="text">editor</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Bo</namePart> <namePart type="family">Peng</namePart> <role> <roleTerm authority="marcrelator" type="text">editor</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Yuxi</namePart> <namePart type="family">Li</namePart> <role> <roleTerm authority="marcrelator" type="text">editor</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Junlin</namePart> <namePart type="family">Li</namePart> <role> <roleTerm authority="marcrelator" type="text">editor</roleTerm> </role> </name> <originInfo> <publisher>Association for Computational Linguistics</publisher> <place> <placeTerm type="text">Hong Kong, China</placeTerm> </place> </originInfo> <genre authority="marcgt">conference publication</genre> </relatedItem> <identifier type="citekey">nguyen-etal-2023-passage</identifier> <location> <url>https://aclanthology.org/2023.paclic-1.59</url> </location> <part> <date>2023-12</date> <extent unit="page"> <start>591</start> <end>599</end> </extent> </part> </mods> </modsCollection>
%0 Conference Proceedings %T Passage-based BM25 Hard Negatives: A Simple and Effective Negative Sampling Strategy For Dense Retrieval %A Nguyen, Thanh-Do %A Bui, Chi Minh %A Vuong, Thi-Hai-Yen %A Phan, Xuan-Hieu %Y Huang, Chu-Ren %Y Harada, Yasunari %Y Kim, Jong-Bok %Y Chen, Si %Y Hsu, Yu-Yin %Y Chersoni, Emmanuele %Y A, Pranav %Y Zeng, Winnie Huiheng %Y Peng, Bo %Y Li, Yuxi %Y Li, Junlin %S Proceedings of the 37th Pacific Asia Conference on Language, Information and Computation %D 2023 %8 December %I Association for Computational Linguistics %C Hong Kong, China %F nguyen-etal-2023-passage %U https://aclanthology.org/2023.paclic-1.59 %P 591-599
Markdown (Informal)
[Passage-based BM25 Hard Negatives: A Simple and Effective Negative Sampling Strategy For Dense Retrieval](https://aclanthology.org/2023.paclic-1.59) (Nguyen et al., PACLIC 2023)
- Passage-based BM25 Hard Negatives: A Simple and Effective Negative Sampling Strategy For Dense Retrieval (Nguyen et al., PACLIC 2023)
ACL
- Thanh-Do Nguyen, Chi Minh Bui, Thi-Hai-Yen Vuong, and Xuan-Hieu Phan. 2023. Passage-based BM25 Hard Negatives: A Simple and Effective Negative Sampling Strategy For Dense Retrieval. In Proceedings of the 37th Pacific Asia Conference on Language, Information and Computation, pages 591–599, Hong Kong, China. Association for Computational Linguistics.