[NCLUG] looking for patterns in text

Sean Reifschneider jafo at tummy.com
Tue Sep 10 13:26:20 MDT 2013


On 09/09/2013 04:10 PM, Mike Cullerton wrote:
> They receive feedback from the public during engineering
> projects. Some of this feedback is original. Some is copy/pasted from
> form letter boilerplate. They'd like to parse the feedback text and
> sort it based on how similar it is to the boilerplate.

Would "dwdiff" from the command-line do it?  I mean, I know you want to
teach Python and all, but might be worth checking...

Sean


More information about the NCLUG mailing list