Determining of Temporal Directionality in Texts: a Neural Network-Based Approach for Chronological Ordering Based on Pairwise Word Analysis
DOI:
https://doi.org/10.31649/1997-9266-2024-177-6-121-128Keywords:
intelligent technologies, machine learning, artificial intelligence, neural networks, natural language processing, temporal directionality, information technologyAbstract
The article presents a neural network approach for determining the temporal orientation in texts, which allows reconstructing the chronology of events even in the absence of explicit time markers. This approach determines the probabilistic order of words in texts, taking into account their statistical and linguistic relationships. In contrast to traditional approaches that rely on explicit temporal expressions or publication dates, the proposed approach allows to estimate the order of events based on the identified relationships between pairs of words in documents, describing events.
To analyze the temporal orientation, neural networks are used to model the relationships between words by comparing their occurrence in texts in pairs. Formulas have been developed to calculate temporal orientation indicators based on the frequency of occurrence of words in dated texts. The obtained indicators are normalized, this provides a better interpretation of the results.
Based on these indicators, a set of features was formed to train machine learning models according to various criteria. To test the effectiveness, we created a Ukrainian-language corpus of 127,000 social media news and applied several models: Gradient Boosting Classifier, Random Forest Classifier, Decision Tree, and Logistic Regression. As an example, 48 features that characterize the news, were selected. The experiments revealed that the Gradient Boosting Classifier model showed the best result with an accuracy of 89.76 % on the validation dataset, which exceeded the accuracy of other models such as Random Forest (74.81%) and Decision Tree (68.97 %).
The proposed approach proved to be effective in modeling the chronological relationships between events, which is important for text automation tasks. The approach can be used to analyze news, chronologically organize historical events, and work with text data in large arrays.
References
W. Xiang, and B. Wang, “A survey of event extraction from text,” IEEE Access. vol. 7, pp. 173111-173137, 2019.
S. Zhang, L. Huang, and Q. Ning, “Extracting Temporal Event Relation with Syntactic-Guided Temporal Graph Transformer,” arXiv: 2104, 09570, 2021.
X. Xu, T. Gao, Y. Wang, and X. Xuan, “Event temporal relation extraction with attention mechanism and graph neural network,” Tsinghua Sci. Technol., vol. 27, pp. 79-90, 2021.
M. Ballesteros, O. Papadopoulou, and N. Goyal, “Severing the edge between before and after: Neural architectures for temporal ordering of events,” in Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP), 2020, pp. 5068-5079.
Q. Ning, Z. Feng, and D. Roth, “A Structured Learning Approach to Temporal Relation Extraction,” in Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing (EMNLP), Copenhagen, Denmark, 2017, pp. 1027-1037.
T. Goyal, and G. Durrett, “Embedding time expressions for deep temporal ordering models,” Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics (ACL), pp. 4401-4411, 2019.
Y. Liu, J. Ma, and P. Li, “Predicting higher-order patterns in temporal networks,” in Proceedings of the 30th ACM International Conference on Information and Knowledge Management (CIKM), 2021, pp. 3219-3228.
W. Xia, Y. Li, and S. Li, “Graph neural point process for temporal interaction prediction,” in Proceedings of the 39th International Conference on Machine Learning (ICML), 2023, pp. 1-10.
Q. Ning, S. Subramanian, and D. Roth, “An Improved Neural Baseline for Temporal Relation Extraction,” in Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP), 2019, pp. 6203-6209.
A. Naik, L. Breitfeller, and C. Rose, “TDDiscourse: A dataset for discourse-level temporal ordering of events,” Proceedings of the 20th Annual SIGdial Meeting on Discourse and Dialogue, 2019, pp. 239-249.
В. Б. Мокін, i М. В. Дратований, Наука про дані: машинне навчання та інтелектуальний аналіз даних, електр. навч. посіб. комбінованого (локального та мережевого) використання. Вінниця, Україна: ВНТУ, 2024, 258 с. [Електронний ресурс]. Режим доступу: https://docs.vntu.edu.ua/card.php?id=8163.
Downloads
-
pdf (Українська)
Downloads: 1
Published
How to Cite
Issue
Section
License
This work is licensed under a Creative Commons Attribution 4.0 International License.
Authors who publish with this journal agree to the following terms:
- Authors retain copyright and grant the journal right of first publication.
- Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgment of its initial publication in this journal.
- Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (See The Effect of Open Access).