unicoderegexp
Various regular expressions for unicode character classes (letter,
punctuation, number, etc.) and helper functions for composing them.
Used by the purify library.
The module exports a bunch of useful RegExps each with a single character class in them:
letter
mark
number
punctuation
symbol
separator
other
visible
printable
unicodeRegExp.visible.test("a");
unicodeRegExp.visible.test(" ");
unicodeRegExp.visible.test("\u00a0");
To validate an entire string you need to build a new RegExp:
var visibleStringRegExp = new RegExp('^' + unicodeRegExp.visible.source + '*$');
visibleStringRegExp.test("foobar");
visibleStringRegExp.test("foo bar");
unicodeRegExp.removeCharacterFromCharacterClassRegExp(/[æøå]/, 'æ');
unicodeRegExp.spliceCharacterClassRegExps(/[a-b]/, /[c-d]/);
The info about which characters belong to which classes was taken from the
XRegExp library and its Unicode plugin.