Lots of data will resist the charms of
IndoEuropeanSentenceModel, so this recipe will provide a starting place to modify sentence detection to meet new kinds of sentences. Unfortunately, this is a very open-ended area of system building, so we will focus on techniques rather than likely formats for sentences.
This recipe will follow a well-worn pattern: create evaluation data, set up evaluation, and start hacking. Here we go:
]markup approach. The following is an example that runs afoul of our standard
[All decent people live beyond their incomes nowadays, and those who aren't respectable live beyond ...