
Security News
Attackers Are Hunting High-Impact Node.js Maintainers in a Coordinated Social Engineering Campaign
Multiple high-impact npm maintainers confirm they have been targeted in the same social engineering campaign that compromised Axios.
node-pptx-parser
Advanced tools
A PowerPoint (PPTX) parser that extracts text content with preserved formatting
A Node.js library for parsing PowerPoint (PPTX) files and extracting text content. This library maintains text formatting, line breaks, and paragraph structures from the original presentation.
Extract text content from PPTX files with preserved formatting
Parse PPTX structure into manageable JavaScript objects
Access raw XML content of presentation components
Written in TypeScript for type safety
Promise-based API
Preserves line breaks and paragraph formatting
Minimal dependencies
npm install node-pptx-parser
Once the package is installed you can you it with import or require statements like this:
// ESM import:
import PptxParser from "node-pptx-parser";
// CommonJs require:
const PptxParser = require("node-pptx-parser").default;
import PptxParser from "node-pptx-parser";
async function main() {
const parser = new PptxParser("presentation.pptx");
try {
// Extract text from all slides
const textContent = await parser.extractText();
// Print text from each slide
textContent.forEach((slide) => {
console.log(`\nSlide ${slide.id}:`);
console.log(slide.text.join("\n"));
});
} catch (error) {
console.error("Error:", error.message);
}
}
main();
import PptxParser from "node-pptx-parser";
async function main() {
const parser = new PptxParser("presentation.pptx");
try {
// Get complete parsed presentation content
const parsedContent = await parser.parse();
// Access presentation structure
console.log(parsedContent.presentation.parsed);
// Access individual slides
parsedContent.slides.forEach((slide) => {
console.log(`Slide ${slide.id}:`, slide.parsed);
});
// Access raw XML if needed
console.log(parsedContent.presentation.xml);
} catch (error) {
console.error("Error:", error.message);
}
}
main();
PptxParserThe main class for parsing PPTX files.
constructor(filePath: string)
Creates a new instance of PptxParser.
filePath: Path to the PPTX file to be parsedparse()
async parse(): Promise<ParsedPresentation>
Parses the entire PPTX file and returns its content.
ParsedPresentation object containing the complete presentation structureextractText()
async extractText(): Promise<SlideTextContent[]>
Extracts formatted text content from all slides.
SlideTextContent objectsParsedPresentationinterface ParsedPresentation {
presentation: {
path: string;
xml: string;
parsed: any;
};
relationships: {
path: string;
xml: string;
parsed: any;
};
slides: ParsedSlide[];
}
ParsedSlideinterface ParsedSlide {
id: string;
path: string;
xml: string;
parsed: any;
}
SlideTextContentinterface SlideTextContent extends ParsedSlide {
text: string[];
}
The library throws errors in the following cases:
Invalid PPTX file structure
File reading errors
XML parsing errors
Example error handling:
try {
const parser = new PptxParser("presentation.ppt");
const content = await parser.extractText();
} catch (error) {
if (error.message.includes("Invalid PPTX file structure")) {
console.error("The PPTX file is corrupted or invalid");
} else {
console.error("An error occurred:", error.message);
}
}
MIT
Contributions are welcome! Please feel free to submit a Pull Request.
FAQs
A PowerPoint (PPTX) parser that extracts text content with preserved formatting
The npm package node-pptx-parser receives a total of 6,470 weekly downloads. As such, node-pptx-parser popularity was classified as popular.
We found that node-pptx-parser demonstrated a not healthy version release cadence and project activity because the last version was released a year ago. It has 1 open source maintainer collaborating on the project.
Did you know?

Socket for GitHub automatically highlights issues in each pull request and monitors the health of all your open source dependencies. Discover the contents of your packages and block harmful activity before you install or update your dependencies.

Security News
Multiple high-impact npm maintainers confirm they have been targeted in the same social engineering campaign that compromised Axios.

Security News
Axios compromise traced to social engineering, showing how attacks on maintainers can bypass controls and expose the broader software supply chain.

Security News
Node.js has paused its bug bounty program after funding ended, removing payouts for vulnerability reports but keeping its security process unchanged.