National Repository of Grey Literature 1 records found  Search took 0.00 seconds. 
Tokenization-aware Diff and Patch
Antoniewicz, Arkadiusz Martin ; Kratochvíl, Miroslav (advisor) ; Horký, Vojtěch (referee)
File comparison algorithms and utilities 'diff', 'patch' and 'diff3' are widely used in programming for the purpose of code comparison, and in many version control systems. Despite the usefulness, the differences and patches produced by the tools are strictly line-oriented, which complicates processing of differently formatted data, such as free flowing text, markup, and various other formats where line breaks are not crucial. This thesis de- scribes and implements a customizable version of these tools, which allows the user to specify an arbitrary tokenization of the input, thus allowing easy diffing, patching and change-merging of content not supported by the tradi- tional diff. Additionally, the thesis describes a newly appearing challenge of managing the whitespace in the patches, and demonstrates the functional- ity on a practical use-case that can not be performed with the current diff utilities. 1

Interested in being notified about new results for this query?
Subscribe to the RSS feed.