
Security News
Risky Biz Podcast: Making Reachability Analysis Work in Real-World Codebases
This episode explores the hard problem of reachability analysis, from static analysis limits to handling dynamic languages and massive dependency trees.
nlcst-emoji-modifier
Advanced tools
Classify unicode emoji and Github emoji (gemoji) as EmoticonNode
s.
Implemented by retext-emoji, but separated for use by standalone (non-retext) parsers.
Note: this project is useful in combination with natural language parsers like parse-latin, parse-dutch, and parse-english.
npm:
$ npm install nlcst-emoji-modifier
Component.js:
$ component install wooorm/nlcst-emoji-modifier
Bower:
$ bower install nlcst-emoji-modifier
var modifier = require('nlcst-emoji-modifier');
var ParseEnglish = require('parse-english');
var english = new ParseEnglish();
/* Attach the modifier. */
modifier(english);
english.parse('Who doesn’t like Gemoji? :+1: You? 💩').children[0].children;
Yields:
[
{
"type": "SentenceNode",
"children": [
{
"type": "WordNode",
"children": [
{
"type": "TextNode",
"value": "Who"
}
]
},
{
"type": "WhiteSpaceNode",
"value": " "
},
{
"type": "WordNode",
"children": [
{
"type": "TextNode",
"value": "doesn"
},
{
"type": "PunctuationNode",
"value": "’"
},
{
"type": "TextNode",
"value": "t"
}
]
},
{
"type": "WhiteSpaceNode",
"value": " "
},
{
"type": "WordNode",
"children": [
{
"type": "TextNode",
"value": "like"
}
]
},
{
"type": "WhiteSpaceNode",
"value": " "
},
{
"type": "WordNode",
"children": [
{
"type": "TextNode",
"value": "Gemoji"
}
]
},
{
"type": "PunctuationNode",
"value": "?"
},
{
"type": "WhiteSpaceNode",
"value": " "
},
{
"type": "EmoticonNode",
"value": ":+1:"
}
]
},
{
"type": "WhiteSpaceNode",
"value": " "
},
{
"type": "SentenceNode",
"children": [
{
"type": "WordNode",
"children": [
{
"type": "TextNode",
"value": "You"
}
]
},
{
"type": "PunctuationNode",
"value": "?"
},
{
"type": "WhiteSpaceNode",
"value": " "
},
{
"type": "EmoticonNode",
"value": "💩"
}
]
}
]
On a MacBook Air, parse-english performs about 27% slower on content filled with (g)emoji, and a 18% slower on content without (g)emoji, when using this modifier.
parse w/ modifier
1,303 op/s » A paragraph (5 sentences, 100 words, 5 emoji, 5 gemoji)
1,653 op/s » A paragraph (5 sentences, 100 words, no emoji, no gemoji)
parse w/o modifier
1,784 op/s » A paragraph (5 sentences, 100 words, 5 emoji, 5 gemoji)
2,038 op/s » A paragraph (5 sentences, 100 words, no emoji, no gemoji)
MIT © Titus Wormer
FAQs
nlcst utility to support emoji
The npm package nlcst-emoji-modifier receives a total of 10,886 weekly downloads. As such, nlcst-emoji-modifier popularity was classified as popular.
We found that nlcst-emoji-modifier demonstrated a not healthy version release cadence and project activity because the last version was released a year ago. It has 2 open source maintainers collaborating on the project.
Did you know?
Socket for GitHub automatically highlights issues in each pull request and monitors the health of all your open source dependencies. Discover the contents of your packages and block harmful activity before you install or update your dependencies.
Security News
This episode explores the hard problem of reachability analysis, from static analysis limits to handling dynamic languages and massive dependency trees.
Security News
/Research
Malicious Nx npm versions stole secrets and wallet info using AI CLI tools; Socket’s AI scanner detected the supply chain attack and flagged the malware.
Security News
CISA’s 2025 draft SBOM guidance adds new fields like hashes, licenses, and tool metadata to make software inventories more actionable.