ВИЗНАЧЕННЯ ТЕМПОРАЛЬНОЇ СПРЯМОВАНОСТІ В ТЕКСТАХ: НЕЙРОМЕРЕЖЕВИЙ ПІДХІД ДЛЯ ХРОНОЛОГІЧНОГО ВПОРЯДКУВАННЯ НА ОСНОВІ АНАЛІЗУ ПАР СЛІВ

B. S. Biletskyi; V. B. Mokin

doi:10.31649/1997-9266-2024-177-6-121-128

Authors

B. S. Biletskyi Vinnytsia National Technical University
V. B. Mokin Vinnytsia National Technical University

DOI:

https://doi.org/10.31649/1997-9266-2024-177-6-121-128

Keywords:

intelligent technologies, machine learning, artificial intelligence, neural networks, natural language processing, temporal directionality, information technology

Abstract

The article presents a neural network approach for determining the temporal orientation in texts, which allows reconstructing the chronology of events even in the absence of explicit time markers. This approach determines the probabilistic order of words in texts, taking into account their statistical and linguistic relationships. In contrast to traditional approaches that rely on explicit temporal expressions or publication dates, the proposed approach allows to estimate the order of events based on the identified relationships between pairs of words in documents, describing events.

To analyze the temporal orientation, neural networks are used to model the relationships between words by comparing their occurrence in texts in pairs. Formulas have been developed to calculate temporal orientation indicators based on the frequency of occurrence of words in dated texts. The obtained indicators are normalized, this provides a better interpretation of the results.

Based on these indicators, a set of features was formed to train machine learning models according to various criteria. To test the effectiveness, we created a Ukrainian-language corpus of 127,000 social media news and applied several models: Gradient Boosting Classifier, Random Forest Classifier, Decision Tree, and Logistic Regression. As an example, 48 features that characterize the news, were selected. The experiments revealed that the Gradient Boosting Classifier model showed the best result with an accuracy of 89.76 % on the validation dataset, which exceeded the accuracy of other models such as Random Forest (74.81%) and Decision Tree (68.97 %).

The proposed approach proved to be effective in modeling the chronological relationships between events, which is important for text automation tasks. The approach can be used to analyze news, chronologically organize historical events, and work with text data in large arrays.

Author Biographies

B. S. Biletskyi, Vinnytsia National Technical University

Post-Graduate Student of the Chair of System Analysis and Information Technologies

V. B. Mokin, Vinnytsia National Technical University

Dr. Sc. (Eng.), Professor, Head of the Chair of System Analysis and Information Technologies

References

W. Xiang, and B. Wang, “A survey of event extraction from text,” IEEE Access. vol. 7, pp. 173111-173137, 2019.

S. Zhang, L. Huang, and Q. Ning, “Extracting Temporal Event Relation with Syntactic-Guided Temporal Graph Transformer,” arXiv: 2104, 09570, 2021.

X. Xu, T. Gao, Y. Wang, and X. Xuan, “Event temporal relation extraction with attention mechanism and graph neural network,” Tsinghua Sci. Technol., vol. 27, pp. 79-90, 2021.

M. Ballesteros, O. Papadopoulou, and N. Goyal, “Severing the edge between before and after: Neural architectures for temporal ordering of events,” in Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP), 2020, pp. 5068-5079.

Q. Ning, Z. Feng, and D. Roth, “A Structured Learning Approach to Temporal Relation Extraction,” in Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing (EMNLP), Copenhagen, Denmark, 2017, pp. 1027-1037.

T. Goyal, and G. Durrett, “Embedding time expressions for deep temporal ordering models,” Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics (ACL), pp. 4401-4411, 2019.

Y. Liu, J. Ma, and P. Li, “Predicting higher-order patterns in temporal networks,” in Proceedings of the 30th ACM International Conference on Information and Knowledge Management (CIKM), 2021, pp. 3219-3228.

W. Xia, Y. Li, and S. Li, “Graph neural point process for temporal interaction prediction,” in Proceedings of the 39th International Conference on Machine Learning (ICML), 2023, pp. 1-10.

Q. Ning, S. Subramanian, and D. Roth, “An Improved Neural Baseline for Temporal Relation Extraction,” in Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP), 2019, pp. 6203-6209.

A. Naik, L. Breitfeller, and C. Rose, “TDDiscourse: A dataset for discourse-level temporal ordering of events,” Proceedings of the 20th Annual SIGdial Meeting on Discourse and Dialogue, 2019, pp. 239-249.

В. Б. Мокін, i М. В. Дратований, Наука про дані: машинне навчання та інтелектуальний аналіз даних, електр. навч. посіб. комбінованого (локального та мережевого) використання. Вінниця, Україна: ВНТУ, 2024, 258 с. [Електронний ресурс]. Режим доступу: https://docs.vntu.edu.ua/card.php?id=8163.

Determining of Temporal Directionality in Texts: a Neural Network-Based Approach for Chronological Ordering Based on Pairwise Word Analysis

Authors

DOI:

Keywords:

Abstract

Author Biographies

B. S. Biletskyi, Vinnytsia National Technical University

V. B. Mokin, Vinnytsia National Technical University

References

Downloads

Published

How to Cite

Issue

Section

Metrics

Downloads

License

Most read articles by the same author(s)

Language

Make a Submission

Information

Visitors

Current Issue