Proposal / Submission Type

Peer Reviewed Paper

Location

St. Paul, Minnesota

Start Date

20-5-2010 11:00 AM

Abstract

There are several different methods of comparing source code from different programs to find copying1 . Perhaps the most common method is comparing source code statements, comments, strings, identifiers, and instruction sequences. However, there are anecdotes about the use of whitespace patterns in code. These virtually invisible patterns of spaces and tabs have been used in litigation to imply copying, but no formal study has been performed that shows that these patterns can actually identify copied code. This paper presents a detailed study of whitespace patterns and the uniqueness of these patterns in different programs.

Keywords: Copyright Infringement, Intellectual Property, Litigation, Open Source, Plagiarism, Source Code, Source Code Similarity, Whitespace.

 
May 20th, 11:00 AM

Measuring Whitespace Patterns as an Indication of Plagiarism

St. Paul, Minnesota

There are several different methods of comparing source code from different programs to find copying1 . Perhaps the most common method is comparing source code statements, comments, strings, identifiers, and instruction sequences. However, there are anecdotes about the use of whitespace patterns in code. These virtually invisible patterns of spaces and tabs have been used in litigation to imply copying, but no formal study has been performed that shows that these patterns can actually identify copied code. This paper presents a detailed study of whitespace patterns and the uniqueness of these patterns in different programs.

Keywords: Copyright Infringement, Intellectual Property, Litigation, Open Source, Plagiarism, Source Code, Source Code Similarity, Whitespace.