Information Technology for Verification Answers in the Intellectual Automated Knowledge Control System
DOI:
https://doi.org/10.31649/1997-9266-2020-151-4-58-66Keywords:
intellectual system, test control of knowledge, answers in arbitrary text form, , length of the longest common subsequence, Hirschberg algorithmAbstract
The process of verification the answers to the "open" type questions (short text answer, essay) in the testing systems was researched. The analysis of automated systems in which this process is implemented has been carried out. The limitations of their application for control the level of knowledge on technical disciplines were determined. There has been justified the relevance of research on the "fuzzy" string comparison methods in the problems of verification of answers, submitted in an arbitrary text form. The functional structure of the module for verification such answers in intelligent automated system for control the level of knowledge in the form of a structural step-by-step diagram of the process of testing the input answer was proposed. The computational algorithms of each stage of processing are given; their implementation in C# was executed. The first step is to "normalize" the words of the answer and the etalon (the correct answer to the question stored in the data-base). The result of the first stage is a string arrays: the first is a set of response words, the second is a set of etalon words (words less than four characters in length are not included. The second step involves the cyclic calling the function of finding the length of the longest common subsequence (LCS) of the words of arrays of the response and the etalon. A block diagram of the LCS calculation procedure based on the recursive algorithm proposed by Hirschberg was presented. Comparison of each word of the input answer with all the words of the etalon, including synonyms, makes it possible to find the length of the LCS, even if the order of the words in the answer and the etalon does not match, which is an advantage of the proposed approach. In the third step, the total indicator of similarity of response and etalon is calculated by finding the sum of LCS of individual words. The last step is to formulate the result of validation of the answer based on the value of the similarity indicator (set depending on requirements: high level of coincidence — 50 %, sufficient — 30 %, low — 10 %). The proposed information technology has been tested for the processing of the answers in an arbitrary text form. A comparison of the obtained results with Levenshtein distance and latent-semantic analysis was performed. The proposed IT gives the best result when checking the answers in which synonymous words are used.
The number of false results for the 50 variants of responses of different lengths (from 10 to 200 characters) was 4 %. As a result of the test, the recommended length of the answer and the etalon (maximum 200 characters) was set. Such length provides the higher accuracy. The directions of future research are formed: increasing the efficiency of the algorithm by introducing an additional stage of processing, — determining the "general" degree of similarity of the answer and the etalon based on the Jacquard coefficient; implementation of an automated intelligent knowledge control system based on client-server technology; forming the result of the test to the relative scale of assessment of the level of knowledge.
References
Modular Object Oriented Dynamic Learning Environment [Online]. Available: https://moodle.org.
SunRav TestOfficePro. Программа для создания тестов [Электронный ресурс]. Режим доступа: http://sunrav.ru/testofficepro.html .
WriteToLearn is a web-based tool for building writing skills and reading comprehension in grades 4-12. [Online]. Available: https://www.pearsonassessments.com/professional-assessments/products/programs/write-to-learn.html .
Versant. Automated language tests for recruitment, training and student placement [Online]. Available: https://www.pearson.com/english/versant.html .
О. І. Комарницька, «Метод семантичного порівняння нечіткої текстової інформації,» Збірник наукових праць Військового інституту Київського національного університету імені Тараса Шевченка, вип. 47, с. 127-132, 2014. [Електронний ресурс]. Режим доступу: http://nbuv.gov.ua/UJRN/Znpviknu_2014_47_22. Дата звернення: Квіт. 21, 2020.
Ю. Ю. Черепанова, «Контроль знаний с ответами на естественном языке,» Восточно-европейский журнал передовых технологий. Информационные технологии, № 4/2(40)2009, с. 32-36, 2009. [Электронный ресурс]. Режим доступа: http://journals.uran.ua/eejet/article/view/20825/18401. Дата обращения: Апрель 21, 2020.
Ю. П. Кондратенко, и С. О. Волкова, «Програмний комплекс для автоматизованого тестування знань студентів,» Технічні вісті, № 1(22), 2(23), с. 32-36, 2006.
И. Д. Рудинский, Основы формально-структурного моделирования систем обучения и автоматизации тестирования знаний (монография). Москва, РФ: Горячая линия-Телеком, 2004, 204 с.
A. Barlybayev, and A Sharipbay, “An intelligent system for learning, controlling and assessment knowledge”, Information. vol. 18, nо 5(A), pp.1817-1827, 2015.
Heinz Dreher, “Interactive On-line Formative Evaluation of Student Assignments,” Issues in Informing Science and Information Technology, vol. 3, pp. 189-197, 2006. https://doi.org/10.28945/882.
R. Williams, and H. Dreher, “Formative assessment visual feedback in computer graded essays,” Issues in Informing Science and Information Technology, vol. 2, pp. 23-32, 2005. https://doi.org/10.28945/808.
Olga C. Santosa, Jesus G. Boticario, and Diana Pérez-Marín, “Extending web-based educational systems with personalised support through User Centred Designed recommendations along the e-learning life cycle,” Science of Computer Programming, vol. 88, pp. 92-109, 2014. https://doi.org/10.1016/j.scico.2013.12.004.
К. Т. Кузьма, «Аналіз методів перевірки відповіді в системах тестування, поданої в текстовій формі,» Вчені записки ТНУ імені В.І. Вернадського. Серія: Технічні науки», т. 29 (68) № 1, ч. 1, с. 163-167, 2019. [Електронний ресурс]. Режим доступу: http://www.tech.vernadskyjournals.in.ua/journals/2018/1_2018/part_1/30.pdf. Дата звернення: Квіт. 21, 2020.
К. Т. Кузьма, і О. В. Мельник, «Обчислювальна технологія перевірки відповідей у системах тестування,» Вчені записки ТНУ імені В.І. Вернадського. Серія: Технічні науки», т. 31 (70) № 1, ч. 1, с. 85-88. 2020, https://doi.org/10.32838/2663-5941/2020.1-1/15.
Б. Смит, Методы и алгоритмы вычислений на строках: пер. с англ. Москва, Россия, РФ: ООО «И.Д. Вильямс», 2006, 496 с.
D. S. Hirschberg, “A linear space algorithm for computing maximal common subsequences,” Communications of the ACM, vol. 18, no. 6, pp. 341-343, June, 1975. https://doi.org/10.1145/360825.360861.
Orgad Keller, Tsvi Kopelowitz, and Moshe Lewenstein, “On the longest common parameterized subsequence,” Theoretical Computer Science, vol. 410, Issue 51, pp.5347-5353, November, 2009, https://doi.org/10.1016/j.tcs.2009.09.011.
Downloads
-
PDF (Українська)
Downloads: 181
Published
How to Cite
Issue
Section
License
Authors who publish with this journal agree to the following terms:
- Authors retain copyright and grant the journal right of first publication.
- Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgment of its initial publication in this journal.
- Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (See The Effect of Open Access).