TY - JOUR KW - data science KW - machine learning KW - NLP KW - semantic similarity search KW - text similarity KW - thesaurus KW - triples AU - Giavid Valiyev AU - Marcello Piraino AU - Arvid Kok AU - Michael Street AU - Ivana Mestric AU - Retzius Birger AB -

This paper describes initial exploitation of Natural Language Processing (NLP) techniques applied to a specific set of related NATO documents. In particular, the text similarity technique was applied to document sets with the aim of capturing the relationships between documents or sections of documents from semantic and syntactic perspectives. Thesaurus and triple extraction techniques allowed the understanding of the sentences beyond the syntactic structure, thus improving the accuracy in capturing similar content across documents with diverse syntactic structures. The objective is to assess whether Natural Language Processing tools can retrieve relationships and gaps between such kinds of textual data. This work improves interoperability in NATO by enhancing the development and application of policies, directives and other documents, which dictate how Consultation, Command and Control (C3) systems across the Alliance interoperate and support NATO's operational needs.

BT - Information & Security: An International Journal DA - 2020 DO - https://doi.org/10.11610/isij.4713 IS - 2 LA - eng N2 -

This paper describes initial exploitation of Natural Language Processing (NLP) techniques applied to a specific set of related NATO documents. In particular, the text similarity technique was applied to document sets with the aim of capturing the relationships between documents or sections of documents from semantic and syntactic perspectives. Thesaurus and triple extraction techniques allowed the understanding of the sentences beyond the syntactic structure, thus improving the accuracy in capturing similar content across documents with diverse syntactic structures. The objective is to assess whether Natural Language Processing tools can retrieve relationships and gaps between such kinds of textual data. This work improves interoperability in NATO by enhancing the development and application of policies, directives and other documents, which dictate how Consultation, Command and Control (C3) systems across the Alliance interoperate and support NATO's operational needs.

PY - 2020 SE - 187 SP - 187 EP - 202 T2 - Information & Security: An International Journal TI - Initial Exploitation of Natural Language Processing Techniques on NATO Strategy and Policies VL - 47 ER -