post #1 of 1
Thread Starter 
hi, i need some help finding a software that can cross check every line in a text file for similar lines. i don't mean an exact copy of the line,but very similar.

i basically want to check if 2 lines have more then X amount words in common. i want it to be able to ignore simple grammar mistakes like missing commas or apostrophes, capitals, punctuation ect...


here is an example of 2 lines that should be tagged as duplicates:

"Did you know that the Basenji is the only dog in the world which does not bark?"

"The Basenji is the only dog which does not bark."

they software would be similar to "dupli find" but obviously able to do what i said.
i7
(14 items)
 
  
CPUMotherboardGraphicsRAM
I7 3770k Sabertooth Z77 MSI 290X lightning 24gb ddr 1866 8-8-8-24 Crucial Ballistix elite 
Hard DriveHard DriveHard DriveOS
Intel 330 SSD WD black caviar 500gb Seagate barrcuda Windows 7 64bit 
MonitorKeyboardPowerCase
46inch Samsung lcd Razer lycosa Antec Truepower quattro 1000 NZXT Phantom (White) 
MouseAudio
Razer deathadder Samsung 5.1 surround sound. 
  hide details  
Reply
i7
(14 items)
 
  
CPUMotherboardGraphicsRAM
I7 3770k Sabertooth Z77 MSI 290X lightning 24gb ddr 1866 8-8-8-24 Crucial Ballistix elite 
Hard DriveHard DriveHard DriveOS
Intel 330 SSD WD black caviar 500gb Seagate barrcuda Windows 7 64bit 
MonitorKeyboardPowerCase
46inch Samsung lcd Razer lycosa Antec Truepower quattro 1000 NZXT Phantom (White) 
MouseAudio
Razer deathadder Samsung 5.1 surround sound. 
  hide details  
Reply