Write a Blog >>
ESEC/FSE 2022
Mon 14 - Fri 18 November 2022 Singapore
Wed 16 Nov 2022 12:00 - 12:07 at SRC Auditorium 2 - Mining Software Repositories Chair(s): Timofey Bryksin

We introduce WikiDoMiner – a tool for automatically generating domain-specific corpora by crawling Wikipedia. WikiDoMiner helps requirements engineers create an external knowledge resource that is specific to the underlying domain of a given requirements specification (RS). Being able to build such a resource is important since domain-specific datasets are scarce. WikiDoMiner generates a corpus by first extracting a set of domain-specific keywords from a given RS, and then querying Wikipedia for these keywords. The output of WikiDoMiner is a set of Wikipedia articles relevant to the domain of the input RS. Mining Wikipedia for domain-specific knowledge can be beneficial for multiple requirements engineering tasks, e.g., ambiguity handling, requirements classification, and question answering. WikiDoMiner is publicly available on Zenodo under an open-source license (DOI: 10.5281/zenodo.6671357).

Wed 16 Nov

Displayed time zone: Beijing, Chongqing, Hong Kong, Urumqi change

11:00 - 12:30
Mining Software RepositoriesResearch Papers / Demonstrations at SRC Auditorium 2
Chair(s): Timofey Bryksin JetBrains Research
11:00
15m
Talk
An Exploratory Study on the Predominant Programming Paradigms in Python Code
Research Papers
Robert Dyer University of Nebraska-Lincoln, Jigyasa Chauhan University of Nebraska-Lincoln
DOI Pre-print Media Attached
11:15
15m
Talk
An Empirical Study of Blockchain System Vulnerabilities: Modules, Types, and Patterns
Research Papers
Xiao Yi Chinese University of Hong Kong, Daoyuan Wu Chinese University of Hong Kong, Lingxiao Jiang Singapore Management University, Yuzhou Fang Chinese University of Hong Kong, Kehuan Zhang Chinese University of Hong Kong, Wei Zhang Nanjing University of Posts and Telecommunications
DOI
11:30
15m
Talk
How to Better Utilize Code Graphs in Semantic Code Search?
Research Papers
Yucen Shi Northeastern University, Ying Yin Northeastern University, Zhengkui Wang Singapore Institute of Technology, David Lo Singapore Management University, Tao Zhang Macau University of Science and Technology, Xin Xia Huawei, Yuhai Zhao Northeastern University, Bowen Xu Singapore Management University
DOI
11:45
15m
Talk
23 Shades of Self-Admitted Technical Debt: An Empirical Study on Machine Learning Software
Research Papers
David OBrien Iowa State University, Sumon Biswas Carnegie Mellon University, Sayem Mohammad Imtiaz Iowa State University, Rabe Abdalkareem Carleton University, Emad Shihab Concordia University, Hridesh Rajan Iowa State University
DOI
12:00
7m
Talk
WikiDoMiner: Wikipedia Domain-specific Miner
Demonstrations
Saad Ezzini University of Luxembourg, Sallam Abualhaija University of Luxembourg, Mehrdad Sabetzadeh University of Ottawa
12:08
7m
Talk
RegMiner: Mining Replicable Regression Dataset from Code Repositories
Demonstrations
Xuezhi Song Fudan University, Yun Lin Shanghai Jiao Tong University; National University of Singapore, Yijian Wu Fudan University, Yifan Zhang National University of Singapore, Siang Hwee Ng National University of Singapore, Xin Peng Fudan University, Jin Song Dong National University of Singapore, Hong Mei Peking University