Spellcheck should be HTML/XML-aware
Short, concise description of the idea
Spellcheck should not complain about HTML or XML tags, and should complain about invalid markup.
Full description of the idea
I'm tired of entering HTML tags in the Post and Comment pages, selecting spellcheck, and seeing it claim that "href" is spelled wrong. Spellcheck should skip over valid HTML/XML tag that are accepted by the HTML cleaner.
Conversely, "spellcheck" should be enhanced to recognize invalid HTML/XML, e.g., missing/mismatched closing tags, missing angle-brackets, etc., and warn about it.
- Fewer bogus warnings from the spellchecker.
- More helpful warning from the spellchecker.
- Less need to use both spellcheck and preview separately.
An ordered list of problems/issues involved
- Sometimes people enter invalid HTML on purpose, e.g., inside PRE tags, comments, etc.
- Need to rationalize with the rich text editor.
- If invalid HTML is caught, should "spellcheck" be renamed?
- Text inside tags might be missed, e.g., the ALT text on IMG tags.
An organized list, or a few short paragraphs detailing suggestions for implementation
- Need to have the HTML cleaner indicate what parts of the input are valid tags, what parts are invalid tags, and what parts are "other".