Skip to main content
PLOS One logoLink to PLOS One
. 2024 Mar 22;19(3):e0299582. doi: 10.1371/journal.pone.0299582

Application of a text mining method in navigation and communication for enhancing maritime safety

Paulina Hatłas-Sowińska 1,*, Leszek Misztal 2
Editor: Sheraz Aslam3
PMCID: PMC10959359  PMID: 38517917

Abstract

This paper introduces a model for the translation of natural language into ontology and vice versa in an autonomous navigation system of a sea-going vessel. The system comprehensively executes communication tasks at sea. The authors use machine learning methods in the field of text mining and basic and additional properties of ontologies. The newly developed ontology is applicable in shipping. The key elements of the prototype are the sequence of communication commands given from the ship’s bridge, decomposition, extraction of the communication sequence and the rule base. The presented model has been implemented and verified in selected scenarios of collision situations at sea. The test results confirm that the assumptions, the designed system architecture and the algorithms in the prototype are correct.

Introduction

The process of ship conduct requires continuous exchange and processing of navigational information. Whether the decisions made are correct depends on the extent, accuracy and reliability, as well as the appropriate perception of information. Ship navigators are required to use all available means to assess the navigational situation, including ship’s equipment and systems, voice communication and other methods. Voice communication thus provides a channel of communication for obtaining additional information and, where appropriate, agreements. An analysis of maritime court decisions shows that, in cases of collisions, failure to establish voice communication with the other vessel was one of the allegations made against vessels involved in an accident. Wrong decisions can be caused by the failure to establish voice communication, improperly conducted communication or the misunderstanding of the information thus transmitted. Disadvantages of communicating orally include the problem of decoding a message on a semantic level, polarisation (the tendency to express extreme opinions), labelling (noticing problems by naming them rather than analysing them), mixing facts and conclusions, and static judgement (i.e. opinions are not verified even if elements of reality constantly change). The primary task of navigation is to ensure safety by avoiding hazards at sea. Direct communication established between ships followed by automated communication processes can minimize bad decisions and, consequently, wrong actions resulting in accidents at sea.

The majority of navigational accidents occurs due to human error. One classification of these errors was introduced by Reason [1]. Another publication [2] indicated the ’80–20’ rule, which states that up to 80% of accidents are due to human error and 20% are technical accidents.

This analysis of maritime accidents was made using the Maritime Statistical Yearbooks for the years: 2017–2022 [3] and the reports of the State Commission for the Investigation of Marine Accidents 2017–2022 [4].

The percentage of maritime accidents by type (Table 1) between 2017 and 2021 is comparable, i.e. the differences in the categories: serious accident, accident and incident range from 22% to 30%. In contrast, the difference between very serious accidents and the others is significant, ranging from 140% to 230%.

Table 1. Marine accidents by type.

Accident types 2017 2018 2019 2020 2021 SUMA
Very serious accident 11 7 12 4 5 39
Serious accident 53 21 14 11 4 103
Accident 28 21 25 21 36 131
Incident or event 22 17 22 14 18 93
TOTAL 114 66 73 50 63

Source: Data from the State Commission for the Investigation of Marine Accidents

Table 2 illustrates the causes of marine accidents and incidents by year, with various factors responsible for the occurrence of these accidents: mechanical, human, external, other (e.g. organisational).

Table 2. Factors leading to marine accidents and incidents 2017–2020.

Factor 2017 2018 2019 2020 2021
human 55 26 29 26 28
mechanical 52 23 20 15 17
external 7 3 2 3 2
other 3 14 22 6 16

source: [3]

Table 2 shows that:

  • In 2017, in 114 marine accidents and incidents, as many as 52 were influenced by the human factor. In seven cases the external factor was involved and in 52—the mechanical factor. Note that the human factor alone accounted for about 46%;

  • In 2018, of 66 marine accidents and incidents, 26 were due to the human factor. In three cases there was an external factor and in 23 the mechanical factor was at play. This time the human factor accounted for about 40%;

  • in 2019, of 73 marine accidents and incidents as many as 29 were influenced by the human factor. An external factor affected two cases, while 20 other cases were impacted by the mechanical factor. It was noted that the human factor alone accounted for about 39%;

  • in 2020, of 50 marine accidents and incidents as many as 26 were caused by the human factor. Only three cases occurred due to the external factor, and 15 were due to the mechanical factor. It was noted that the human factor alone accounted for about 52%;

  • In 2021, 63 marine accidents and incidents were recorded, of which 28 were caused by the human factor. In two cases the external factor was to blame, and 17 cases had mechanical causes. In this group the human factor was associated with about 45% of all accidents.

The trend for human factor to occur continues to be on the rise (even if the total number of accidents decreases). The research concept for solving the human error problem is the application of artificial intelligence algorithms and machine learning solutions from the field of text mining [5] also referred to as natural language processing [6]. This aims to automate communication between ships that have with an implemented autonomous navigation solution, as well as to automate communication between ships steered by the human navigator and a ship with an autonomous navigation system. The application of models for communication for the solution described in this work takes into account the two-way communication: messages sent by the human navigator to the autonomous system and vice versa. In the former case, voice commands are translated into natural language using speech recognition techniques [7]. Then they are subjected to text data processing and analysis techniques [8] related to tokenisation and word categorisation part-of-speech and command type recognition [9], sentence meaning analysis and message type classification [10] for the corresponding categories in the ontology. Finally, a command sequence expressed in the ontology is created, providing an unambiguous transfer of information to the autonomous system. In the case where the autonomous system initiates communication, the command sequence expressed in the ontology is subjected to the decomposition of the ontology sequence for one of the communication directions, which is then transformed according to the lexical rule base [11] into natural language. The transformation is based on the proposed mathematical model using ontology classes with a generating function and an interpreting function. In the final step, it is possible to utter a navigation-related message using a natural language produced by a speech synthesiser.

Material and methods

Analysis of the present state

One step towards increasing the level of safety is the introduction of innovative transport solutions. Efficiency of transport can be provided by improving transport processes through the use of appropriate IT tools to support these processes. Examples include e-navigation, monitoring and collision detection systems for road and rail traffic, road models for autonomous vehicles or ship systems and equipment. The latest methods in the field of communication are based on the introduction of intelligent conversational systems, i.e. computer programmes designed to simulate intelligent conversation through textual or verbal methods.

Today’s maritime communication is based on convention requirements for ship equipment as well as personnel training that includes the knowledge of the International Convention for Preventing Collisions at Sea, often referred to as the Colllision Regulations, or COLREGs. Detailed communication procedures are set out in the International Radio Regulations issued by the International Telecommunication Union ITU). However, the process of conducting communications itself requires an extensive analysis of the situation based on existing procedures as well as the ship’s equipment and systems.

One problem that arises in this context is that the communication process as understood in the field of maritime transport must be precisely defined. Its main task is to convey a message from the sender in such a way that it is unambiguously understood by the receiver. The large amount of information, the diversity of its type and scope results in the need for processing, integration and selection, and most importantly, an unambiguous form and interpretation.

In radio communication, in addition to lexical and pronunciation problems, there are also technical issues, such as equipment failure. Numerous publications point to the communication system that is ineffective in certain situations. Communication errors are presented in [12].

There are current communication procedures recommended by the ITU, but they are complex and difficult for operators to follow.

In the analysis of the literature, particular attention was paid to these publications: [1315]. Their main highlights include: failures in maritime communication implemented so far, indication that currently there is no ontology ready to be implemented in maritime communication, and presentation of benefits from the implementation of an ontology into the communication process.

A review of numerous publications in various scientific fields shows that the use of ontologies provides tangible benefits and translates into improvements in the quality of operation of a given system or industry. The current state of knowledge confirms that the research to date in the field of maritime transport shows a lack of an unambiguous and functioning ontology-based communication system. Applying ontologies and operating on semantic models extends capabilities of information gathering and processing.

To date, ontology has been used in the construction of several semantic models to describe the state and behaviour of ships at sea. Wen et al. proposed using ontology to build a semantic ship behaviour model (SMSB) based on a dynamic Bayesian network (DBN) to help represent and understand ship behaviour [16]. Van Hage et al. used ontology to build a Simple Event Model (SEM) to infer ship behaviour at different levels of abstraction, integrating knowledge from the network. That case is particularly interesting as it shows that ship position data is not sufficient for a navigator to fully understand the situation at sea [17]. Hagaseth et al. used existing semantic tools to extract the original meaning from maritime regulations texts to enhance the consistency of regulations, and to support compliance and enforcement by actors on ships and in ports [18].

The use of ontology-based information access techniques has also been used to enhance security and cyber security in ports [19]. Other applications of ontologies include underwater robots [20], decision support system in healthcare [21], ontology-based ship modelling [22], and ontology-based approaches to semantic sensors [23].

Communication process

The specific character of maritime transport called for the construction of a separate, unique ontology allowing its application to automatic communication. Apparent deficiencies in verbal communication indicate the need to create a method that allows the inclusion of two parallel areas: navigation and communication. The elements contained in the navigation and communication ontologies are linked by models created using techniques from the field of text data mining for bidirectional communication between the human navigator and the autonomous navigation system. The way the models work and the steps used in them are described in the next section of the article. It also presents the translation between natural language and sequence in the communication ontology and vice versa. A prototype software with a graphical user interface for communication between an autonomous system and a human-operated system is also described. The prototype is based on data from the collision situation scenarios presented in the paper and performs the task of translation between the parties. The implemented prototype generates a natural language output, which can be used as input to a speech synthesiser library containing functions for voice generation. This makes it possible to imitate human speech and convert the autonomous vessel communications to those understood by the traditional system (ship’s personnel).

A method for building an ontology for maritime transport has been based on a model of the communication process, lexical analysis and the so-called loop method.

Communication process model

The creation of a communication model, which forms the basis of the operations performed in the ontology, was based on the given literature, and on expert knowledge of maritime communication. There are many ready-made communication models, such as Lasswell’s persuasive act model [24], the Shannon and Weaver model of signal transmission [25], Newcomb’s triangular model [26], Schramm’s community of experience model [27], the ’subcutaneous sting’ model, the two-stage communication model, the agenda-setting process model of McCombs and Showa and others. None of the above-mentioned models is suitable enough to build a communication ontology for maritime transport on its basis.

The work of developing a customised communication process model was done in several steps: analysis of available communication models, identification of communication participants/ objects, definition of information content, definition of rules and techniques for communication, visualisation of the communication model. The created communication process model (Fig 1) contains the two-way communication activity as well as the ontology. A message sent from the sender to the receiver must pass through an ontology block: encoding, transmission and decoding. Encoding involves linking the type of the message to its content (function f), transmission delivers the message to function g, responsible for decoding the message and passing it to the receiver in a readable from. The reverse action, when the receiver intends to respond to the sender, i.e. feedback, is defined as the receiver’s response to the message after decoding it.

Fig 1. Communication model.

Fig 1

Source: authors’ work.

Lexical analysis

In order to create a method for building an ontology for maritime transport, we need a vocabulary collection scheme for this domain (Fig 2). It consists in collecting data, then analysing it, selecting symbols for the word (if necessary) and implementation. The latter will show whether the concept is well placed in the overall ontology, or whether it should be changed or removed.

Fig 2. Vocabulary collection scheme.

Fig 2

Source: authors’ work.

Lexical analysis is important in developing an ontology-based message structure, as this will allow data with a specific syntax to be read unambiguously. When loading such data, the syntax must be recognised before it can be processed. The coding action is to first divide the loaded string of words into smaller syntactic elements called lexemes, and only further analyse the string of lexemes. This constitutes a separate module dealing with this task, which aims to increase the efficiency of communication. During the lexical analysis of the ontology, a string of words is loaded and broken down into lexemes. However, what is passed on is not exactly lexemes. It is information representing the meaning of the lexeme. It is represented by a symbol and an optional attribute. The symbol represents information about the type of the lexeme. If the lexemes of a given type carry a certain ’’value’’, an attribute is attached to the symbol and is equal to this value.

Loop method

This method involves searching for a given word or pair of words in the order as contained in the ontology. Path finding in an ontology is based on two principles (Fig 3):

Fig 3. Schematic of the loop method.

Fig 3

Source: authors’ work.

  1. Calling up the addressee of the message, selecting the message type and category.

  2. 2—An algorithm searches according to the order of ontology classes: selection of the first class, then subclasses, until the word from the message text is found;

checking for the word you are looking for in the main classes, if the word in question is not found in the body of the message, the search moves on to a subclass.

The loop terminates when the entire ’path’ of the message has been obtained.

The method of building an ontology for maritime transport has been developed so that it ensures that the branch relating to a particular domain—in this case, the navigation ontology—can be continuously extended.

The present research work is based on transforming the ontological notation into natural language, using the text mining method.

Theory/Calculation

Ontology

By definition, ontology is a theory about any domain, describing concepts hierarchically in order to establish semantic relations. It is characterised by a logical theory that imposes constraints on logical models. Ontologies are created for various purposes [28]:

  • to disseminate a common understanding of information structure among people or agent applications,

  • to enable the knowledge in a particular field to be reused many times,

  • to openly clarify assumptions about the chosen domain,

  • to separate knowledge of the domain from the knowledge associated with operating the domain,

  • to analyse knowledge of a specific domain.

The first definition was formulated by [29]: "an ontology is a formal, explicit specification of a shared conceptualisation".

An ontology can be written as a formula [30], where the set O defines the structure of concepts, the relations between them, as well as the theory about the model being defined.

For any broad domain, we can create different ontologies to describe the domain in different ways, so this ontology has been described using a new, proprietary, detailed formula to facilitate the collection of words for maritime transport.

O={A,K,R,f,g} (1)

where

O - ontology;

A - axiom of choice;

K - class of abstraction;

R - relationship

f - message-generating function;

g - message interpretation function.

The basic rules of each element in the set O are given below [3133].

Axiom of choice

One of the axioms of multiplicity theory stating that it is possible to construct a set (called a selector) containing exactly one element from each set belonging to a family of non-empty disjoint sets;

Class abstraction

If X is a non-empty set and ≈ is an equivalence relation on that set, then the sets [x] for xϵX are called abstraction classes of relations ≈ w X. More precisely: the class [x] is called the equivalence (abstraction) class of relations ≈ in X determined by x or with representation of x if it satisfies the conditions:

[x]={yX:xy}
x,yX(y[x](xy).

The set of all equivalence classes of relations ≈ in X is denoted as X/. For any elements: x,x1,x2X, we have:

x[x]
[x1]=[x2]x1x2
[x1][x2][x1][x2]=.

An equivalence relation ≈ defined on the set X, establishes the division of this set into non-empty and pairwise disjoint subsets, i.e. into classes of abstractions in this relation, in such a way that two elements x,yX belong to the same class if and only if xy.

Relation

Let there be given non-empty sets X and Y. A relation on the set X×Y is any subset of the Cartesian product of X×Y. We shall denote the relation as ρ. If we consider a relation ρ between elements of set X and elements of set Y, then ρX×Y,

We write then that for xϵX, yϵY the following occurs:

  • (x,y)∈ρ and we say that the couple (x,y) belongs to a relationship ρ or

  • xρy and then we say that the element x is in a relation ρ with element y.

If X is an n-element set and Y is an m-element set, then there are 2nm of all relations in the set X×Y.

The relation ρX×X is called an equivalence relation if it is reflexive (⋁xXxρx), symmetric (x,yX(xρyyρx)) and transitive x,y,zX[(xρyandyρz)xρz]. We will denote equivalence relations as: ≈:

xyx=y,x,yR
klkl>0,k,lZ{0}
xyxyϵZ,x,yR

The relation is reflexive because: xXxx=0ϵZ

The relationship is symmetric because: x,yRxyϵZyx=(xy)ϵZ)

The relationship is transitive because: x,y,zRxyϵZiyzϵZxz=(xy)+(yz)

The relation ρ is a partial order relation if it is reflexive ⋁xXxρx, antisymmetric x,y,zX[(xρyandyρx)x=y] and transitive x,y,zX[(xρyandyρz)xρz].

The relation ρ is a relation of linear order if it is reflexive ⋁xXxρx, antisymmetric x,y,zX[(xρyandyρx)x=y], transitive x,y,zX[(xρyandyρz)xρz], and consistent x,yX(xρyoryρx).

An example for a linear order relation: xρyxy,x,yR

  • reflexivity: xx

  • antisymmetry: (xyandyx)x=y

  • transitivity: (xyandyz)xz

  • connectivity: xy or yx

Generating function f and interpreting function g

The function f connects one element from the set Y with a selected element or elements from the set X, forming K, the so-called message body. The function g establishes the meaning of the information sent and the actions to be performed in relation to the message received. This function assigns a combination of elements from sets Y and X to the received message Ki.

The functions f and g are expressed by the formulae:

f:Y×XKf(yn,Xk)=Ki (2)
g:KY×Xg(Ki)=(yn,Xk) (3)

where

Y = {y1,y2,…,yl} - a set of message types with associated category, (l∈N),
ynY - selected type with an associated message category, (1≤nl)
X = {x1,x2,…,xm} - a set of navigational concepts (entities contained in the navigation ontology), (m ∈ N),
Xk={xk1,xk2,,xkj}X - a set of entities in the k-th message,
k - the number of the transmitted message (kN),
K - set of messages,
K i - i-th message from the set K,
Ki={sin,si1,si2,,sij} - individual words that appear in a message, (i,j,nN),
where:
sin—n-th word in the i-th message Ki from the set Y,
sij—j-th word in the i-th message Ki from the set X.

The proof of the correct formulation of the above functions consists of three parts: part one covers the need to use the Cartesian product and formulates the properties of the function f, part two describes the set of values of the function f, part three presents the inverse function (continuous transformations).

There exists an ordered pair (y,x) of two elements, assuming that: yY and xX. The Cartesian product of Y×X of the set Y,X is the set of all ordered pairs (y,x) such that:

yY and xX. Hence Y×X={(y,x):yYxX).

The Cartesian product was used to ensure that the words in the message were ordered.

Example:

The Cartesian product of the sets Y = {A_intention, T_information} and X = {course, position, passing} contains 6 ordered pairs:

Y×X = {(A_intention, course), (A_intention, position), (A_intention, passing), (T_information, course), (T_information, position), (T_information, passing)}.

The Cartesian product of the set Y cannot be formed with itself, i.e. Y×Y, since the message K being constructed is of one type only. In contrast, one can form the Cartesian product of a set X with itself, i.e. X×X for the same word path, e.g. X = {course, alter course, to port, collision course} whereby, according to the definition given above, the ordered pairs are: X×X = {(course, alter course,), (course, collision course), (alter course, to port)}.

The proposed software research prototype uses the class sequences present in the ontology to execute communication at sea. After decomposing the entire ontology sequence into one-way communication, the prototype uses a proposed rule base that transforms the decomposed class sequences into sentences formulated in natural language. This goal is achieved by using the proposed mathematical definition of the Cartesian product for the communication and navigation ontology classes in the model and in the rule base.

The function f defined on non-empty sets Y, X such that:

(Y≠∅∧X≠∅) with values in a non-empty set K (K≠∅), the relation: f:Y×XK satisfies the following conditions:

yYx1,x2X[(y,x1)f(y,x2)f]y=y,

e.g: {T_information, course}—specify course and {T_information, position}- specify position;

the domain is Y×X={(yn,Xk):ynYXkX} (i.e. the set of all elements belonging to the set X that are in a relation with at least one element from the set Y); the set of values of the function forms the set K.

Text mining

The increase of computer performance and the capacity of data storage systems has led to an extend in the use of artificial intelligence and machine learning algorithms [34] in science, administration, business, including transport applications [35]. One important methods are text mining algorithms [36], which use various types of data mining techniques, lexical analysis [37], rule induction methods [38], information extraction and other techniques [39]. In this way, it is possible to obtain new information, to understand the communicated intentions on the basis of information gathered from websites, books, opinions, various textual sources and information directly communicated by humans [40]. Practical natural language processing solutions [41] include, for example, machine translations from one language into another (for example, the commonly used Google Translator), understanding intentions expressed in natural language (e.g. car navigation systems), human dialogue systems (used in the ChatGPT conversation system), written text recognition systems (used in OCR applications), speech recognition systems used, for example, in intelligent home control systems, a system solutions for automatic analysis of the type of user feedback used by large sales networks.

The use of text mining algorithms based on ontologies enables the automation of maritime communication for the transmission of information between the captain on board and the vessel equipped with an autonomous control system. The automated prototype described herein is designed to avoid collisions and ensure safe navigation. The solution is implemented using an ontology, i.e. a set of hierarchical rules consisting of classes that, using information sequence records concerning maritime communication and navigation, enable reliable and safe manoeuvres of ships at sea. Therefore, for two-way communication using a secure ontology-based protocol, two models have been proposed for the translation of information passed from an autonomous ship to a traditional ship. This is first model, which, using an ontology-based rule base, creates natural language sentences that can be communicated to the captain using a speech synthesiser. The other model aims to perform a natural language comprehension task (commands spoken on the navigational bridge) using a multi-step method including tokenisation, lexical analysis, part-of-speech recognition [9], command type identification, and, in the final step, generation of a class sequence expressed in the ontology based on the analysed sentences. The information transformed from natural language to a sequence of commands expressed in the ontology, executed in the described manner, carries out the transmission of maritime commands that are unambiguous and comprehensible to the autonomous system. A prototype application has been created for the first of the said models, which translates the sequence of classes in the ontology into natural language following the described steps of the model.

Results

Model of the two-way translation of natural language into/from ontology: Prototype

The proposed prototype of the ontology-based event/command sequence conversion model into natural language is presented in Fig 4 (below). The concept incorporates the conversion of command sequences from the navigation bridge exchanged with a ship with automatic communication into a natural language that will be understood by the navigator on a ship with traditional communication. The model carries out decomposition (when necessary) and extracts the communication sequence expressed in an ontology for one side of the communication. The next step is to create a natural language sentence using the rules defined for the sequence of events in the ontology. The mathematical method herein developed and presented is based on the Cartesian product using the generating function Fg and the interpreting function Fi. The final step, not presented in the present model, will be the ability to utter the created sentence by one of speech synthesis solutions [42].

Fig 4. Model for converting instruction sequences in an ontology into natural language.

Fig 4

Source: authors’ work.

For the implementation of the described model, a prototype application written in the Python programming language [43] was created (Fig 5). It implements the individual steps of the model for the scenarios presented in the article, including the decomposition of event sequences in the ontology and the use of the Rule_gen class with programmed rules [40] to translate events in the ontology into natural language. Additionally, to visualise how the method works, the Tkinter library [44] was used to create a user-friendly prototype interface. The generated natural language sentence sequence based on the ontology sequence written in the scenario of Table 3 (Communication scenario (category: negotiation) with ontology notation for the relative bearing 112°) is presented below.

Fig 5. Prototype operation for translating ontology event sequences into natural language.

Fig 5

Source: authors’ work.

Table 3. Communication scenario (category: Negotiations) with ontological notation for the encounter situation with 112 degree relative bearing.

Source: authors’ work.

Communication scenario:
negotiation
Communication through ontologies Natural language communication (text mining)
A to B: what are your intentions? {ontology of communication, message structures, body, Question, Q_intention} What are your intentions?
B to A: I have right-of-way, I will maintain course and speed {ontology of communication, message structures, body, Answer, A_information},{ontology of navigation, identification of situation, priority, I_have},
{ontology of navigation, identification of situation, keep_the_course_and_speed}.
I have right-of-way, I will keep the course and speed.
A to B: no, you do not have right-of-way; you are the overtaking vessel; you give way to me {ontology of communication, phrase, negative}
{ontology of communication, message structures, body, Tell, T_demand},
{ontology of navigation, identification of situation, priority, I_have},{ontology of navigation, Identification of situation,
Give way}, {ontology of communication, message structures, body, Tell, T_information}, {ontology of navigation, Identification of situation, Over taking}.
No. You do not have right-of-way, you give way to me. You are the overtaking vessel.
B to A: you are the overtaking vessel, you should give way {ontology of communication, phrase, negative},
{ontology of communication, message structures, body, Answer, A_information},
{ontology of navigation, Identification of situation, Over taking}.
{ontology of communication, message structures, body, Tell, T_expectation},
{ontology of navigation, Identification of situation, Give way}.
No. You are the overtaking vessel. You should give way.
A to B: then alter course slightly to port {ontology of communication, message structures, body, Tell, T_information},
{ontology of navigation, Features information, Navigation information, course, Alter course, To port}.
Then alter course slightly to port.
B to A: OK, I am altering course to port to give you more space {ontology of communication, roger},
{ontology of communication, message structures, body, Answer, A_information},{ontology of navigation, Features information, Navigation information, course, Alter course, To port}.{ontology of communication, message structures, body, Tell, T_information},{ontology of navigation, Features information, Ship maneuvering, more space}.
Confirmation. I am altering course to port to make more space.

The next prototype presents an implementation model for converting sentences from natural language to instruction sequences in an ontology.

The model consists of a number of steps, where the input are captain commands in the form of a sequence of natural language sentences, while the output is a sequence of commands in the form of an ontology (Fig 6). At the start of the operation, the model organises the data by tokenising it into individual words and sentences. It then discards words that are not relevant to the spoken commands, but are typical of human interactions (e.g. please, however, but, etc.). The next step is stemming, i.e. reducing the number of similar words that derive from a single parent (root). Lemmatisation, in turn, aims to reduce the number of words by using a base word, thus reducing words with, for example, different endings. The information reduction steps are followed by the step of labelling the types of sentence parts such as nouns, verbs, adjectives and other, as well as identifying the types of commands. This is an important step before using the classifier to recognise the types of messages sent from the ship’s bridge, such as information or request. Then, making use of the knowledge base in the form of rules, an operation will be carried out to create a sequence of commands in the form of classes from an ontology set. This form will be unambiguous and readable for the autonomous system.

Fig 6. A model for converting commands given on the navigational bridge from natural language to a sequence of commands in an ontology.

Fig 6

Source: authors’ work.

The completed prototype communication model (Fig 7) has the following structure:

Fig 7. Extended model of communication.

Fig 7

Source: authors’ work.

Examples of collision situations and the model prototype

To achieve the objectives of this work, these authors employed the method of simulation tests conducted with the use of an ECDIS simulator. Some of the scenarios selected from previously prepared scenarios of ship encounters were executed. The ECDIS simulator consists of eight independent stations (ships) NaviTrainer 4000 from Transas cooperating with eight ECDIS NaviSailor 3000i stations. The simulator, installed at the Maritime University of Technology in Szczecin, makes it possible to simulate practically any ship encounter scenario in the selected shipping area, offering access to more than 10 ship models. Each ship model allows full use of the ship’s equipment. The water area can be visually observed, and the navigator can also use any of the systems typically installed on the modern commercial ship bridge, including those used in ARPA and AIS scenarios. The ship models have full course and speed altering capabilities. The tests were conducted in the following configuration: two stations manned by trained navigators, who have full access to ship’s equipment, with the ability to manoeuvre by altering course and speed, and are in charge of communication; each test participant sees a target ship in sight (visualisation), as well as by means of the ship’s systems and equipment—radar, ARPA, AIS, ECDIS; at the start of the test, the relevant data is recorded from the AIS systems of two independent ships: "own" and "target".

Three encounter situations and two vessel models (non-autonomous, real-time) were selected for the study:

  • crossing courses: ship A on course 270°, and ship B on course 000°; the meeting ships are of medium size (length ca. 170 m), both proceeding full ahead, in sight if each other;

  • reciprocal or almost reciprocal courses, ship A on course 180°, ship B on course 000°; the meeting ships are of medium size (length ca. 170 m), both proceeding at full ahead and seeing each other;

  • encounter with a vessel on relative bearing 112°: vessel A proceeds on course 312°, and vessel B on course 000°; the meeting vessels are of medium size (length approx. 170 m), both proceeding full ahead and sea each other.

The selected scenarios are given as standard ship meeting situations as described in the COLREGs. Nevertheless, they raise many interpretative doubts about the safety of the manoeuvre being performed. The article presents a scenario of a collision situation of an encounter with relative bearing of 112°, because similar situations are often misinterpreted, and one of two rules are taken as applicable: overtaking right-of-way or intersecting courses. Each situation was simulated several times (depending on how the situation developed). Each time, different communication exchanges and related manoeuvres by the ships were recorded. For these, appropriate scenarios were prepared and recorded, allowing the initial situation to be replayed several times with the possibility of any individual subsequent action to be taken by navigators.

Scenario: Ship B course: 000°, ship A course: 312°. Good visibility. Wind force: 2, sea state 1. Ship A sends an enquiry to B about intentions. Vessel B answers, sending information that it has the right of way and will maintain course and speed. Vessel A disagrees and sends a request for ship B to give way. Vessel B also responds and requests ship A to give way, arguing that vessel A is the overtaking vessel. Vessel A informs vessel B that it alters course by 18° to starboard (to the right). Vessel B also alters course, but by 15° to port (to the left) to give more passage space ahead for vessel A (Fig 8). The vessels passed each other at a pre-fixed 0.6 Nm (Fig 9). The trajectory is shown in Fig 10.

Fig 8. Scenario: Relative bearing 112°, negotiation.

Fig 8

Situation during a manoeuvre. Source: Transas ECDIS NaviSailor 3000i.

Fig 9. Scenario: Relative bearing 112°, negotiation.

Fig 9

Situation after the manoeuvre. Source: Transas ECDIS NaviSailor 3000i.

Fig 10. Actual trajectories for the relative bearing 112° (scenario: Negotiation).

Fig 10

Source: authors’ work.

Discussion and future work

For each scenario, transformation of the communication using the ontology into natural language was carried out, based on the decomposition of the communication into individual commands in the ontology and constructing, according to the model, natural language sentences based on the rules contained in the model’s rule base. The sentences built in this way can be successfully communicated to the navigator by generating the human voice using speech synthesis techniques and transmitting it using radio voice communication techniques available on board ships. The prototype software successfully implemented the translation for communication scenarios uploaded for testing.

In the future research work, efforts will be made to extend the prototype of automatic translation between the autonomous system and the human navigator based on the extension of the existing knowledge base of the model with further cases occurring in real scenarios that occur at sea. Another research step will be the cases of collision scenarios for communication in the other direction (feedback), where it will be necessary to realise a prototype based on the proposed model using text mining techniques to implement other scenarios actually occurring at sea. Besides, the authors intend to increase the accuracy of the proposed model for further natural language input used by ship captains. Ultimately, the work will be aimed to devise a fully functional two-way communication model that will enable accurate command recognition and the execution of the full scope of safe collision avoidance manoeuvres as they take place on waterways worldwide.

Proposed models for controlling commands from the captain’s bridge to the autonomous system and automatically launched using empirical data from the presented test scenarios. Due to the operation of the first model—the article as a concept added to semi-autonomous maritime communication.

The target models will be tested based on a significant data set, where:

  • for the first model, the input data will be a set of sequences of events in the ontology, while the output will be sentences in a natural language;

  • for the second model, the input will be commands in a natural language, and the output will be a sequence of events in the ontology.

The test method will use developed models for each pair of elements from the set, where the input to the model will be input data, and the generated output data from the models will be compared with the collected output data (correct data). In this way, it will be possible to compare the correctness of the results and determine the accuracy coefficients of the method. Work on the proposed models is aimed at achieving the highest possible accuracy based on the collected data set.

Conclusions

The automation of communication processes, including verbal communication between navigators on ships, may be one way to reduce collisions at sea.

This article presents assumptions, modes and forms of communication including automatic maritime communication systems as well as selected issues of the generation of outgoing messages, which are the result of automatically performed processes of analysis and interpretation of incoming messages. The models proposed for this purpose, based on text mining techniques using a rule base, made it possible to automatically translate natural language into ontology and vice versa, and consequently to carry out safe sea manoeuvres without the occurrence of collisions. These methods of automating communication processes, including verbal communication, are equally important for manned and unmanned, autonomous ships. The above also applies to other modes of transport.

The authors demonstrated the positive impact of using ontologies in the process of conducting ship-to-ship communication with a text mining approach. The article shows that the application of appropriate ontologies in a communication system could lead to the avoidance of collisions (or possibly to a significant reduction of their consequences). However, the application of the proposed solutions goes beyond collision or close-quarters situations. Sufficiently early communication in the indicated manner will allow potentially dangerous situations to be resolved much earlier, give navigators more time to analyse and observe how the situation develops, and reduce the stress associated with conducting last-minute manoeuvres.

Although correctly applied Collision Regulations and navigation procedures should be sufficient in the cases discussed, practice and the actual collision statistics show that the existing regulations are not always correctly interpreted. This is due to fatigue, haste, various pressures on navigating officers and, sometimes, inexperience and poor knowledge of the professional English language used in maritime navigation. In order to improve the automatic communication system and extend the way navigators communicate automatically, a model prototype was used for translating natural language into ontology (working both ways).

Supporting information

S1 Datasets

(ZIP)

pone.0299582.s001.zip (1.7KB, zip)

Data Availability

Hatłas-Sowińska Paulina, Misztal Leszek (2023). Data Set of ontology and communication scenario for method of enhancing maritime safety. SEANOE. https://doi.org/10.17882/97041.

Funding Statement

The authors received no specific funding for this work.

References

  • 1.Reason J. (1990). Human Error. Cambridge University Press. [Google Scholar]
  • 2.Harrald J., Mazzuchi T., Spahn J., Van Dorp R., Merrick J., Shrestha S., et al. (1998). Using system simulation to model the impact of human error in a maritime system. Safety Science, 30(1–2), 235–247. doi: 10.1016/S0925-7535(98)00048-4 [DOI] [Google Scholar]
  • 3.Statistical Yearbook of Maritime Economy 2017–2022. Accessed: 01.06.2022r, https://stat.gov.pl/obszary-tematyczne/roczniki-statystyczne /roczniki-statystyczne/rocznik-statystyczny-gospodarki-morskiej-2017,11,10.html
  • 4.GDPWM 2017–2022 Annual analysis: marine incidents and accidents. Accessed:12.12.2022 http://pkbwm.gov.pl/images/sampledata/analizy/ Accessed:12.12.2022, http://pkbwm.gov.pl/images/sampledata/analizy/Analiza_roczna_2017.pdf
  • 5.Feldman R, Sanger J, The text mining handbook. Advanced approaches in analysing unstructured data. Cambridge University Press, Cambridge, 2007. [Google Scholar]
  • 6.Dale Robert, Moisl Herman, Somers Harold, editors, Handbook of Natural Language Processing. Marcel Dekker, 2000. [Google Scholar]
  • 7.Jurafsky Daniel, Martin James H., Speech and Language Processing. Prentice Hall, second edition, 2008. [Google Scholar]
  • 8.Weiss Sholom, Indurkhya Nitin, Zhang Tong, Demarau Fred, Text mining: Predictive Methods for Analyzing Unstructured Information. Springer, 2004. [Google Scholar]
  • 9.Indurkhya Nintin, Damerau Fred J., editors, Handbook of Natural Language Processing, second edition, Morgan and Claypool, 2010. [Google Scholar]
  • 10.Berry Michael W., Castellanos Matu, Survey of text mining II clustering, classification and retrival, Springer; 2008. [Google Scholar]
  • 11.Blackburn Patrick, Bos Johan, Representation and inference for natural language: A first course in computational semantics, CSLI Publications, Stanford, 2005. [Google Scholar]
  • 12.Żurek J., Niski R., Radziwanowski M., Jacek Stefański. (2005). Current state and main directions of development of maritime radiocommunication. Stage 1 Paper No. 08300025.
  • 13.Verbal Communication failures and Safety at Sea. (2002). Accessed: 29.01.2022, https://scinapse.io/papers/2289303803.
  • 14.Malyankar R., 1999. Creating a Navigation Ontology. 6. [Google Scholar]
  • 15.Kopacz Z., et al., 2004. Information of maritime navigation, its kinds, components and users. European Journal of Navigation, 3, 53–60. [Google Scholar]
  • 16.Wen Yuanqiao, Huang Yamin, Zhong Shubin, Cheng Xiaodong and Huang Liang, 2019 Ontological Ship Behavior Modeling Based on COLREGs for Knowledge Reasoning, J. Mar. Sci. Eng. 2022, 10(2), 203; 10.3390/jmse10020203 [DOI] [Google Scholar]
  • 17.Willem van Hage Véronique Malaisé, Segers Roxane, Hollink Laura, Schreiber Guus, 2012. the Simple Event Model Ontology, doi: 10.1007/978-1-4614-6230-9_10, In book: Situation Awareness with Systems of Systems. Chapter: 10, Publisher: Springer; 2012. [DOI] [Google Scholar]
  • 18.Hagaseth Marianne, Ceolin Davide, Ontology Based Integration of Ship Inspection, 2012, doi: 10.1007/978-1-4614-6230-9_10, In book: Situation Awareness with Systems of Systems. Chapter: 10, Publisher: Springer. [DOI] [Google Scholar]
  • 19.Pileggi Salvatore F., Indorf Marius, Nagi Ayman and Kersten Wolfgang, CoRiMaS-An Ontological Approach to Cooperative Risk Management in Seaports, Sustainability 2020, 12(11), 4767; 10.3390/su12114767. [DOI] [Google Scholar]
  • 20.Li Xin, Bilbao Sonia, Tamara Martín-Wanton Joaquim Bastos and Rodriguez Jonathan 2017, SWARMs Ontology: A Common Information Model for the Cooperation of Underwater Robots, Sensors (Basel). 2017 Mar; 17(3): 569. Published online 2017 Mar 11. doi: 10.3390/s17030569 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Spoladore Daniele, Pessot Elena, Collaborative Ontology Engineering Methodologies for the Development of Decision Support Systems: Case Studies in the Healthcare Domain, Electronics 2021, 10(9), 1060; doi: 10.3390/electronics10091060 [DOI] [Google Scholar]
  • 22.Zhong Shubin, Wen Yuanqiao, Huang Yamin, Cheng Xiaodong and Huang Liang. Ontological Ship Behavior Modeling Based on COLREGs for Knowledge Reasoning, J. Mar. Sci. Eng. 2022, 10(2), 203; 10.3390/jmse10020203 [DOI] [Google Scholar]
  • 23.Liu Jin, Li Yunhui, Tian Xiaohu, Sangaiah Arun Kumar and Wang Jin; Towards Semantic Sensor Data: An Ontology Approach. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Lasswell H. D. 1948. the structure and function of communication in society. In Bryson L. (Ed.), The communication of ideas (pp. 37–51). New York: Harper and Row. [Google Scholar]
  • 25.Shannon C., Weaver W. (1948). The Mathematical Theory of Communication. Bell System Technical Journal, 27, 379–423, 623–656. [Google Scholar]
  • 26.Newcomb T. M. 1953. An approach to the study of communicative acts. Psychological Review, 60(6), 393–404. doi: 10.1037/h0063098 [DOI] [PubMed] [Google Scholar]
  • 27.Littlejohn S. W., Foss K. A. (2009). Encyclopedia of Communication Theory. SAGE. [Google Scholar]
  • 28.Paulina Hatłas, Zbigniew Pietrzykowski, Paweł Banaś Piotr Wołejsza; 2016. Subontology of communication in the automation of negotiating processes in maritime navigation. Zeszyty Naukowe Akademii Morskiej w Szczecinie, No. 46 (118). 10.17402/139 [DOI] [Google Scholar]
  • 29.Gruber T. R. (1993). A translational approach to portable ontology specifications. Elsevier, Knowledge Acquisition, 5(2), 199–220, doi: 10.1006/knac.1993.1008 [DOI] [Google Scholar]
  • 30.Maedche A. 2002 Ontology Learning for the Semantic Web. The Springer International Series in Engineering and Computer Science. Dostęp: 30.11.2017. http://www.springer.com/la/book/9780792376569
  • 31.Hilbert D. Fundamentals of geometry. Appendix VIII On infinity. 1st ed. 1948. [Google Scholar]
  • 32.Weyl H. Das Kontinuum. Kritische Untersuchungen über die Grundlagen der Analysis. Leipzig, 1918. [Google Scholar]
  • 33.Kuratowski K A. Mostowski: Teoria mnogości. 2nd ed. PWN, 1966. [Google Scholar]
  • 34.Alpaydin Ethem, Introduction to Machine Learning. MIT; Press, 2004. [Google Scholar]
  • 35.Han Jiawei, Kamber Micheline, Data Mining: Concepts and Techniques second edition, Morgan Kaufmann Publishers, 2006. [Google Scholar]
  • 36.Steven Abney Semisupervised Learning for Computational Linguistics. Chapman and Hall. 2008. [Google Scholar]
  • 37.Manning Christopher, Schutze Hinrich, Foundations of statistical natural language processing, MIT Press, Cambridge, 1999. [Google Scholar]
  • 38.Triantaphyllou E., Felici G.: Data Mining & Knowledge Discovery based in Rule Induction, Springer Science, 2006. [Google Scholar]
  • 39.Olson D., Delen D., Advanced Data Mining Techniques, Springer, 2008. [Google Scholar]
  • 40.Witten Ian H., Eibe Frank: Data Mining Practical Machine Learning Tools and Techniques, Morgan Kaufmann, 2005. [Google Scholar]
  • 41.Farghaly Ali, editor, Handbook for Language Engineers, CSLI Publications, Stanford, 2003. [Google Scholar]
  • 42.Paul Taylor, Text to speech synthesis, Cambridge University Press, 2009. [Google Scholar]
  • 43.Lutz Mark, Ascher David. Learning Python. O’Reilly, second edition, 2003. [Google Scholar]
  • 44.Moore Alan D. Python GUI Programming with Tkinter, O’Reilly, 2018. [Google Scholar]

Decision Letter 0

Sheraz Aslam

24 Oct 2023

PONE-D-23-28679Application of a text mining methods in navigation and communication for enhancing maritime safety.PLOS ONE

Dear Dr. Hatłas-Sowińska,

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.

Please submit your revised manuscript by Dec 08 2023 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

Please include the following items when submitting your revised manuscript:

  • A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.

  • A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.

  • An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter.

If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: https://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols. Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols.

We look forward to receiving your revised manuscript.

Kind regards,

Sheraz Aslam

Academic Editor

PLOS ONE

Journal Requirements:

1. When submitting your revision, we need you to address these additional requirements.

Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at 

https://journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and 

https://journals.plos.org/plosone/s/file?id=ba62/PLOSOne_formatting_sample_title_authors_affiliations.pdf

2. Please note that PLOS ONE has specific guidelines on code sharing for submissions in which author-generated code underpins the findings in the manuscript. In these cases, all author-generated code must be made available without restrictions upon publication of the work. Please review our guidelines at https://journals.plos.org/plosone/s/materials-and-software-sharing#loc-sharing-code and ensure that your code is shared in a way that follows best practice and facilitates reproducibility and reuse.

3. In your Data Availability statement, you have not specified where the minimal data set underlying the results described in your manuscript can be found. PLOS defines a study's minimal data set as the underlying data used to reach the conclusions drawn in the manuscript and any additional data required to replicate the reported study findings in their entirety. All PLOS journals require that the minimal data set be made fully available. For more information about our data policy, please see http://journals.plos.org/plosone/s/data-availability.

Upon re-submitting your revised manuscript, please upload your study’s minimal underlying data set as either Supporting Information files or to a stable, public repository and include the relevant URLs, DOIs, or accession numbers within your revised cover letter. For a list of acceptable repositories, please see http://journals.plos.org/plosone/s/data-availability#loc-recommended-repositories. Any potentially identifying patient information must be fully anonymized.

Important: If there are ethical or legal restrictions to sharing your data publicly, please explain these restrictions in detail. Please see our guidelines for more information on what we consider unacceptable restrictions to publicly sharing data: http://journals.plos.org/plosone/s/data-availability#loc-unacceptable-data-access-restrictions. Note that it is not acceptable for the authors to be the sole named individuals responsible for ensuring data access.

We will update your Data Availability statement to reflect the information you provide in your cover letter.

Additional Editor Comments:

Please consider the comments given by two reviewers in order to consider it for the next round.

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Yes

Reviewer #2: Partly

**********

2. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: Yes

Reviewer #2: No

**********

3. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

Reviewer #2: Yes

**********

4. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: No

Reviewer #2: Yes

**********

5. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: Dear Author(s),

Topic of the article is interesting. However, following comments should be addressed prior to further processing of the article.

1) Refer to whole article: Similarity is very high i.e. 32%

2) Refer to whole article: Too many formatting issues.

3) Refer to whole article: Authors have claimed that auto generated ontology will lead to safe sea communication. What do they think about machine error?

4) Refer to whole article: Language grammars are usually very complicated. Which language is targeted by the authors and how do they address grammar issue which may lead to communication of false signal?

5) Refer to Whole article: Scope of this study seems limited as the proposed model focuses on a single language. What about communication across different languages?

6) Refer to figure 2 & 3: Article is written in English however another language is used in figures.

7) Refer to sub-section 1.2.3: Authors need to include a flow chart for easier understanding of the process to the novice reader.

8) Refer to all figures: Figures quality is very low.

9) Refer to table 2: Content is replicated in table 2 and figure 5. Further, most of the references are very old. No reference from 2023. Authors need to include recent references.

Good luck.

Reviewer #2: The manuscript introduces an intriguing model for the translation of natural language into ontology and vice versa within the context of an autonomous navigation system for sea-going vessels. The authors aim to address the challenges associated with the oral communication of navigational information. The identified drawbacks of oral communication, including semantic decoding issues, polarization of extreme viewpoints, labeling problems, confusion between facts and conclusions, and static judgments, are well-highlighted.The topic is highly interesting and addresses a relevant issue in navigational communication.

Areas for Improvement:

1. The manuscript's formatting is subpar and does not adhere to standard conventions. Clear and organized presentation is crucial for effective communication of research findings.

2. Several important figures lack clarity, and their formats are not in accordance with the required standards. Clear and well-presented visuals are essential for readers to grasp the key concepts and findings.

3. The simulation validation process in the manuscript is deemed too simplistic. In real maritime communication scenarios, numerous pronunciation and technical issues are prevalent, yet these challenges are not adequately reflected in the simulated settings. A more comprehensive simulation that mimics the complexities of actual maritime communication should be considered.

The manuscript requires substantial revision, focusing on improving its formatting, enhancing the clarity of figures, and conducting a more realistic simulation validation. Addressing these issues will significantly enhance the overall quality of the manuscript.

**********

6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: Yes: Syed Muhammad Mohsin

Reviewer #2: No

**********

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.]

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step.

Attachment

Submitted filename: Review comments - Authors.docx

pone.0299582.s002.docx (19.5KB, docx)
PLoS One. 2024 Mar 22;19(3):e0299582. doi: 10.1371/journal.pone.0299582.r002

Author response to Decision Letter 0


13 Nov 2023

Dear Reviewers,

thank you very much for your thorough substantive assessment

and editorial of our article. Thank you for your positive opinions as well as the critical comments. They constitute important tips to improve the quality of our research work. Below we provide answers to the questions asked and responses to the comments included in the review.

1) Refer to whole article: Similarity is very high i.e. 32%

This article is a continuation of research on automatic communication in maritime transport. The authors continue their original research topic. There is little similarity between the activities in the studies. But this article introduces something new - the text mining method

2) Refer to whole article: Too many formatting issues.

Thank you for this attention. The article has been corrected

3) Refer to whole article: Authors have claimed that auto generated ontology will lead to safe sea communication. What do they think about machine error?

The phrase "machine error" can be understood as communication errors, user errors, calculation errors. The proposed communication model takes into account the occurrence of the mentioned errors. The authors state that machine errors may occur. Occurring in unusual, unpredictable and typical situations. These errors can come in two forms. The first one is – machine and computational errors. The second one is for processing errors: errors regarding data or numerical quantities.

With current research, the explanation for the above errors is non-existent. However, the authors do not deny that this is impossible.

4) Refer to whole article: Language grammars are usually very complicated. Which language is targeted by the authors and how do they address grammar issue which may lead to communication of false signal?

The analysis of maritime case law indicates that in the event of a lack of connection between a voice call and a second call, this was one of the basic charges against the ships that took part in the occurrence. Decision errors may be caused by failure to initiate voice communication, its effects, or misunderstanding of the information conveyed in this way. These errors may be related to stress, which in turn affects the use of mental control and personal safety, self-assessment scores and situational awareness, disturbance of characteristic features, prolongation of decision duration. Errors may also be missing when it comes to using the English language. Disadvantages are determined in an oral form, including: problems with decoding the message at the semantic level, polarization (tendency to express extreme opinions), labeling (noticing problems by naming them, rather than analyzing them), mixing facts and occurrence as well as static assessment ( i.e. lack of verification of opinions regarding changes in elements of reality).

The primary task of navigation is to ensure safe navigation by avoiding dangers during a sea voyage. The goal of establishing direct communication between ships and automating communication processes can reduce wrong decisions and, as a result, wrong actions resulting in maritime accidents. This mainly concerns dangerous situations requiring decisive action to avoid a collision, in particular excessive approach situations (a situation in which avoiding a collision requires concerted action by the navigators of the meeting ships).

Currently, communication at sea is based on conventional requirements regarding ship equipment and crew training. At the level of the international convention, rules for the law of the sea route (COLREGs) have also been established. Detailed communication procedures are specified in the International Radio Regulations issued by the ITU (International Telecommunication Union). However, the communication process itself requires a broad analysis of the situation based on existing procedures, as well as ship equipment and systems.

The MPDM regulations are designed to safely conduct maneuvers, e.g. passing, overtaking, and especially anti-collision maneuvers - without the use of voice and/or radio-electronic communication. Maneuvering and warning signals (light and sound) are permitted for ships that are mutually visible. However, ambiguities and discrepancies in the interpretation of terms such as "safe distance", "early enough", "change course to the right" or the interpretation of the meeting situation (e.g. overtaking or crossing courses), introduced the practice of correspondence, most often by phone, between navigators meeting ships.

The communication process is divided into individual elements: the sender, i.e. the initiator, the recipient and the message. The person initiating communication performs coding, i.e. transforming thoughts into words that he thinks will be understandable to the recipient, thus creating a message. Then comes the communication channel, i.e. the way in which the content is to be conveyed. The message reaches the said recipient. Decoding, i.e. the interpretation of the received information, depends on its perceptual capabilities.

In such a situation, a problem arises - the communication process understood in the field of maritime transport must be precisely defined. Its main task is to convey the message from the sender in such a way that it is clearly understood by the recipient. A large amount of information and the diversity of its types and scopes result in the need to process, integrate and select it. And most importantly – a clear form and interpretation. An example of ambiguity that a navigator may understand in different ways is: heave up the line/send heaving line.

The article presents sample dialogues in scenarios and the recording of messages in ontological notation - in English. The reason is that English is considered the basic language in maritime navigation.

There are current communication system procedures recommended by the ITU (International Telecommunications Union), but they are complicated and difficult to assimilate by operators.

The current state of knowledge confirms that all previous research in the field of maritime transport does not indicate a clear and working communication system based on ontology.

The use of ontology and the beginning of operating on semantic models contributes to obtaining new possibilities related to the collection and processing of information.

5) Refer to Whole article: Scope of this study seems limited as the proposed model focuses on a single language. What about communication across different languages?

Due to maritime law, which requires communication in English, the authors deal only with this language.

6) Refer to figure 2 & 3: Article is written in English however another language is used in figures.

Thank you for this attention. The drawings have been corrected

7) Refer to sub-section 1.2.3: Authors need to include a flow chart for easier understanding of the process to the novice reader.

Thank you for your attention. We added information about the process in the loop method

8) Refer to all figures: Figures quality is very low.

Thank you for your attention. The drawings have been corrected

9) Refer to table 2: Content is replicated in table 2 and figure 5. Further, most of the references are very old. No reference from 2023. Authors need to include recent references.

Thank you for your attention. The drawings and tabels have been corrected

The authors use the Statistical Yearbook of Maritime Economy 2022.

The provided data was included in the article.

In January 20234 - data from 2022 will be available.

This chapter contains information on accidents at sea to ships of Polish nationality, and marine salvage. 2. Data on accidents at sea cover ships of either Polish or non-Polish nationality when the incident or accident at sea occurred in Polish internal waters or Polish territorial sea. Passenger ro-ro ferries or highspeed passenger ships are also included when the incident or accident occurred outside the internal waters or territorial sea of the EU member state provided that the last port of call of that ship was a Polish Republic's seaport. In addition these statistics cover ships of gross tonnage below 50, i.e. fishing boats, yachts or tugs

Attachment

Submitted filename: Response to Reviewers.docx

pone.0299582.s003.docx (766.1KB, docx)

Decision Letter 1

Sheraz Aslam

5 Jan 2024

PONE-D-23-28679R1Zastosowanie metod eksploracji tekstu w nawigacji i łączności dla poprawy bezpieczeństwa morskiego.PLOS ONE

Dear Dr. Hatłas-Sowińska,

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.

==============================Please revise empirical study as suggested by Reviewer 2 and resubmit the revised copy.

==============================

Please submit your revised manuscript by Feb 19 2024 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

Please include the following items when submitting your revised manuscript:

  • A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.

  • A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.

  • An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter.

If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: https://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols. Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols.

We look forward to receiving your revised manuscript.

Kind regards,

Sheraz Aslam

Academic Editor

PLOS ONE

Additional Editor Comments (if provided):

Since Reviewer 2 is not satisfied related to empirical study,

"I still have reservations regarding the empirical validation of the method of this study".

Please consider empirical validation again.

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. If the authors have adequately addressed your comments raised in a previous round of review and you feel that this manuscript is now acceptable for publication, you may indicate that here to bypass the “Comments to the Author” section, enter your conflict of interest statement in the “Confidential to Editor” section, and submit your "Accept" recommendation.

Reviewer #1: All comments have been addressed

Reviewer #2: All comments have been addressed

**********

2. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Yes

Reviewer #2: Partly

**********

3. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: Yes

Reviewer #2: I Don't Know

**********

4. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

Reviewer #2: Yes

**********

5. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

Reviewer #2: Yes

**********

6. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: Dear Author(s),

My comments are satisfactorily addressed in the revised version. I have no more comments.

Good luck.

Reviewer #2: Although the revised manuscript has addressed most of the questions raised in the initial review and made appropriate adjustments to the text. However, I still have reservations regarding the empirical validation of the method of this study. While authors have validated your method in three specific encounter situations, I believe that this may not sufficiently demonstrate its applicability and effectiveness across a broader range of real-world scenarios.

**********

7. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: Yes: Syed Muhammad Mohsin

Reviewer #2: No

**********

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.]

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step.

Attachment

Submitted filename: Review comments - Authors.docx

pone.0299582.s004.docx (18.4KB, docx)
PLoS One. 2024 Mar 22;19(3):e0299582. doi: 10.1371/journal.pone.0299582.r004

Author response to Decision Letter 1


26 Jan 2024

Dear Reviewer,

Thank you very much for your further comments to our article. To ensure the quality of our research work is as good as possible, we provide the answer to the question below.

Data validation is a solution that uses research with the efficiency of empirical data. We try to ensure that the data of the congregations in our research are thorough and detailed. Validation is important because if it is false or incomplete, data may be exposed and incorrect decisions may be made.

Proposed models for controlling commands from the captain's bridge to the autonomous system and automatically launched using empirical data from the presented test scenarios. Due to the operation of the first model - the article as a concept added to semi-autonomous maritime communication.

The target models will be tested based on a significant data set, where:

- for the first model, the input data will be a set of sequences of events in the ontology, while the output will be sentences in a natural language;

- for the second model, the input will be commands in a natural language, and the output will be a sequence of events in the ontology.

The test method will use developed models for each pair of elements from the set, where the input to the model will be input data, and the generated output data from the models will be compared with the collected output data (correct data). In this way, it will be possible to compare the correctness of the results and determine the accuracy coefficients of the method. Work on the proposed models is aimed at achieving the highest possible accuracy based on the collected data set.

Data analysis software is an indispensable tool in empirical research. The authors plan to conduct advanced statistical analyses, which will enable a better understanding of the collected data and drawing accurate conclusions. This is the direction of further research work by the authors.

Attachment

Submitted filename: Response to Reviewers.docx

pone.0299582.s005.docx (14KB, docx)

Decision Letter 2

Sheraz Aslam

13 Feb 2024

Application of a text mining methods in navigation and communication for enhancing maritime safety

PONE-D-23-28679R2

Dear Dr. Hatłas-Sowińska,

We’re pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it meets all outstanding technical requirements.

Within one week, you’ll receive an e-mail detailing the required amendments. When these have been addressed, you’ll receive a formal acceptance letter and your manuscript will be scheduled for publication.

An invoice for payment will follow shortly after the formal acceptance. To ensure an efficient process, please log into Editorial Manager at http://www.editorialmanager.com/pone/, click the 'Update My Information' link at the top of the page, and double check that your user information is up-to-date. If you have any billing related questions, please contact our Author Billing department directly at authorbilling@plos.org.

If your institution or institutions have a press office, please notify them about your upcoming paper to help maximize its impact. If they’ll be preparing press materials, please inform our press team as soon as possible -- no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org.

Kind regards,

Sheraz Aslam

Academic Editor

PLOS ONE

Additional Editor Comments (optional):

Reviewers' comments:

Acceptance letter

Sheraz Aslam

11 Mar 2024

PONE-D-23-28679R2

PLOS ONE

Dear Dr. Hatłas-Sowińska,

I'm pleased to inform you that your manuscript has been deemed suitable for publication in PLOS ONE. Congratulations! Your manuscript is now being handed over to our production team.

At this stage, our production department will prepare your paper for publication. This includes ensuring the following:

* All references, tables, and figures are properly cited

* All relevant supporting information is included in the manuscript submission,

* There are no issues that prevent the paper from being properly typeset

If revisions are needed, the production department will contact you directly to resolve them. If no revisions are needed, you will receive an email when the publication date has been set. At this time, we do not offer pre-publication proofs to authors during production of the accepted work. Please keep in mind that we are working through a large volume of accepted articles, so please give us a few weeks to review your paper and let you know the next and final steps.

Lastly, if your institution or institutions have a press office, please let them know about your upcoming paper now to help maximize its impact. If they'll be preparing press materials, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org.

If we can help with anything else, please email us at customercare@plos.org.

Thank you for submitting your work to PLOS ONE and supporting open access.

Kind regards,

PLOS ONE Editorial Office Staff

on behalf of

Dr. Sheraz Aslam

Academic Editor

PLOS ONE

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    S1 Datasets

    (ZIP)

    pone.0299582.s001.zip (1.7KB, zip)
    Attachment

    Submitted filename: Review comments - Authors.docx

    pone.0299582.s002.docx (19.5KB, docx)
    Attachment

    Submitted filename: Response to Reviewers.docx

    pone.0299582.s003.docx (766.1KB, docx)
    Attachment

    Submitted filename: Review comments - Authors.docx

    pone.0299582.s004.docx (18.4KB, docx)
    Attachment

    Submitted filename: Response to Reviewers.docx

    pone.0299582.s005.docx (14KB, docx)

    Data Availability Statement

    Hatłas-Sowińska Paulina, Misztal Leszek (2023). Data Set of ontology and communication scenario for method of enhancing maritime safety. SEANOE. https://doi.org/10.17882/97041.


    Articles from PLOS ONE are provided here courtesy of PLOS

    RESOURCES