Automatic String Replace by Examples

Type:

Conf

Authors:

Alberto Bartoli, Andrea De Lorenzo, Eric Medvet

In:

ACM Genetic and Evolutionary Computation Conference (GECCO), held in Amsterdam (Netherlands)

Year:

2013

Links and material:

Abstract #

Search-and-replace is a text processing task which may be largely automated with regular expressions: the user must describe with a specific formal language the regions to be modified (search pattern) and the corresponding desired changes (replacement expression). Writing and tuning the required expressions requires high familiarity with the corresponding formalism and is typically a lengthy, error-prone process.In this paper we propose a tool based on genetic programming (GP) for generating automatically both the search pattern and the replacement expression based only on examples. The user merely provides examples of the input text along with the desired output text and does not need any knowledge about the regular expression formalism nor about GP. We are not aware of any similar proposal. We experimentally evaluated our proposal on 4 different search-and-replace tasks operating on real-world datasets and found good results, which suggests that the approach may indeed be practically viable.