A Modular Hierarchical Model for Paper Quality Evaluation

Xi Deng1, Shasha Li1, Jie Yu1, Jun Ma1, Bin Ji1, Wuhang Lin1, Shezheng Song1 and Zibo Yi2, 1National University of Defense Technology, China, 2Information Research Center of Military Science PLA Academy of Military Science, China; Xi Deng1, Shasha Li1, Jie Yu1, Jun Ma1, Bin Ji1, Wuhang Lin1, Shezheng Song1 and Zibo Yi2, 1National University of Defense Technology, China, 2Information Research Center of Military Science PLA Academy of Military Science, China

A Modular Hierarchical Model for Paper Quality Evaluation

Authors

Xi Deng¹, Shasha Li¹, Jie Yu¹, Jun Ma¹, Bin Ji¹, Wuhang Lin¹, Shezheng Song¹ and Zibo Yi², ¹National University of Defense Technology, China, ²Information Research Center of Military Science PLA Academy of Military Science, China

Abstract

Paper quality evaluation is of great significance as it helps to select high quality papers from the massive amount of academic papers. However, existing models needs improvement on the interaction and aggregation of the hierarchical structure. These models also ignore the guiding role of the title and abstract in the paper text. To address above two issues, we propose a well-designed modular hierarchical model (MHM) for paper quality evaluation. Firstly, the input to our model is most of the paper text, and no additional information is needed. Secondly, we fully exploit the inherent hierarchy of the text with three encoders with attention mechanisms: a word-to-sentence(WtoS) encoder, a sentence-to-paragraph(StoP) encoder, and a paper encoder. Specifically, the WtoS encoder uses the pre-trained language model SciBERT to obtain the sentence representation from the word representation. The StoP encoder lets sentences in the same paragraph interact and aggregates them to get paragraph embeddings based on importance scores. The paper encoder does interaction among different hierarchical structures of three modules of a paper text: the paper title, abstract sentences, and body paragraphs. Then this encoder aggregates new representations generated into a compact vector. In addition, the paper encoder models the guiding role of the title and abstract, respectively, generating another two compact vectors. We concatenate the above three compact vectors and additional four manual features to obtain the paper representation. This representation is then fed into a classifier to obtain the acceptance decision, which is a proxy for papersÃ¢â‚¬â„¢ quality. Experimental results on a large-scale dataset built by ourselves show that our model consistently outperforms the previous strong baselines in four evaluation metrics. Quantitative and qualitative analyses further validate the superiority of our model.

Keywords

Paper quality evaluation, Modular, Hierarchical, Attention mechanisms, Interact.

CS&IT Conference Proceedings

A Modular Hierarchical Model for Paper Quality Evaluation