You're Invited:Meet the Socket Team at BlackHat and DEF CON in Las Vegas, Aug 4-6.RSVP
Socket
Book a DemoInstallSign in
Socket

datascav-switch

Package Overview
Dependencies
Maintainers
1
Alerts
File Explorer

Advanced tools

Socket logo

Install Socket

Detect and block malicious and high-risk dependencies

Install

datascav-switch

Modules to convert different types of files using AI based validations and conversions.

1.0.3
pipPyPI
Maintainers
1

datascav-switch

Python LangChain OpenAI License: MIT

datascav-switch is a Python package for intelligent document format conversion, leveraging generative AI (OpenAI) and a scalable architecture. This project is part of a suite of tools for automation, data extraction, and transformation.

Main Features

  • PDF to Markdown conversion with layout preservation
  • Support for multiple input formats (file, URL, base64, bytes)
  • Parallel processing and dynamic logging
  • Detailed token tracking
  • Native integration with LangChain and tracing via LangSmith

Installation

pip install datascav-switch

Requirements:

  • Python 3.10+
  • OpenAI API key (OPENAI_API_KEY)

Quick Start

from scav_switch.converters.pdf import ScavToMarkdown
scav = ScavToMarkdown(model='gpt-4.1', verbose=True)
markdown = scav.dig('/path/to/file.pdf')
print(markdown)

For complete examples and detailed documentation, see the docs/ folder and the notebooks for each module.

Documentation

License

MIT

FAQs

Did you know?

Socket

Socket for GitHub automatically highlights issues in each pull request and monitors the health of all your open source dependencies. Discover the contents of your packages and block harmful activity before you install or update your dependencies.

Install

Related posts