How practical is it? Machine Learning for Identifying Conceptual Interoperability Constraints in API Documents

12/05/2018
by   Hadil Abukwaik, et al.
0

Building meaningful interoperation with external software units requires performing the conceptual interoperability analysis that starts with identifying the conceptual interoperability constraints of each software unit, then it compares the systems' constraints to detect their conceptual mismatch. We call the conceptual interoperability constraints (the COINs) that can be of different types including structure, dynamic, and quality. Missing such constraints may lead to unexpected mismatches, expensive resolution, and running-late projects. However, it is a challenging task for software architects and analysts to manually analyze the unstructured text in API documents to identify the COINs. Not only it is a tedious and time-consuming task, but also it needs knowledge about the constraint types. In this article, we present and evaluate our idea of utilizing machine learning techniques in automating the COIN identification, which is the first step of conceptual interoperability analysis, from human text in API documents. Our empirical research started with a multiple-case study to build the ground truth dataset, on which we contributed our machine learning COIN-Classification Model. We show the model's robustness through experiments using different machine learning text-classification algorithms. The experiments' results revealed that our model can achieve up to 87 text. Thus, we implemented a tool that embeds our model to demonstrate its practical value in industrial context. Then, we evaluated the practitioners' acceptance for the tool and found that they significantly agreed on its usefulness and ease of use.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/23/2019

On Using Machine Learning to Identify Knowledge in API Reference Documentation

Using API reference documentation like JavaDoc is an integral part of so...
research
06/27/2021

Pairing Conceptual Modeling with Machine Learning

Both conceptual modeling and machine learning have long been recognized ...
research
11/13/2017

Detecting Near Duplicates in Software Documentation

Contemporary software documentation is as complicated as the software it...
research
01/20/2023

Transforming Unstructured Text into Data with Context Rule Assisted Machine Learning (CRAML)

We describe a method and new no-code software tools enabling domain expe...
research
06/06/2021

Identifying Populist Paragraphs in Text: A machine-learning approach

Abstract: In this paper we present an approach to develop a text-classif...
research
07/10/2023

A Semi-Automated Solution Approach Selection Tool for Any Use Case via Scopus and OpenAI: a Case Study for AI/ML in Oncology

In today's vast literature landscape, a manual review is very time-consu...
research
10/20/2020

Extracting Procedural Knowledge from Technical Documents

Procedures are an important knowledge component of documents that can be...

Please sign up or login with your details

Forgot password? Click here to reset