This is a great suggestion. My Father who was a teacher for 35 years suggested proofreading by starting with the last sentence and working your way backwards. This works because your brain can't ...
https://blog.afterthedeadline.com/2013/08/16/read-aloud/#comment-4068
A good way to hear it read is to load the document on a Kindle and select the "text to speech" from the menu. This will help much better because reading it yourself allows you to read in words th...
https://blog.afterthedeadline.com/2013/08/16/read-aloud/#comment-4053
In reply to rsmudge. Hmm, it worked fine on a different version of wikipedia...
In reply to Aly. File not found eh... make sure you have enough disk space a...
I failed with your script on step 3. Any ideas what's wrong? Thanks! Traceback (most recent call last): File "./xmldump2files.py", line 93, in xml.sax.parse(sys.argv, WikiPageSplitter(sys.argv)) ...
In reply to yhj. And so I did. Thanks for pointing this out. I've corrected ...
thanks for your post. it is very useful for me. there is just a small issue on the last step. i think you have missed a sleep.jar after "-jar" : )
In reply to rsmudge. I agree more isn't better, so currently I am doing r...
In reply to tszming. I maintain the set by hand. There are other sets (fo...
How do you generate the "confusion set" and keep it updated? To me, this seems to be not a big set, e.g. bar, bra not appear in the file.