![Create React App Officially Deprecated Amid React 19 Compatibility Issues](https://cdn.sanity.io/images/cgdhsj6q/production/04fa08cf844d798abc0e1a6391c129363cc7e2ab-1024x1024.webp?w=400&fit=max&auto=format)
Security News
Create React App Officially Deprecated Amid React 19 Compatibility Issues
Create React App is officially deprecated due to React 19 issues and lack of maintenance—developers should switch to Vite or other modern alternatives.
A class to handle and process multiple files with identical structures within a directory.
Reading Various File Formats: Suporta leitura de arquivos CSV, JSON, Parquet e Excel.
Directory and ZIP File Handling: Capacidade de ler dados de diretórios e arquivos ZIP, além de bytes e objetos zipfile.ZipFile
.
Data Joining: União de DataFrames que possuem colunas semelhantes.
Deduplication: Remoção de duplicatas com base em colunas específicas.
Custom Filters: Aplicação de filtros personalizados aos DataFrames.
Logging: Registro detalhado das operações de leitura e manipulação de dados.
pip install polars loguru
from data_harvest_reader import DataReader
data_reader = DataReader(log_to_file=True, log_file="data_reader.log")
data = data_reader.read_data('path/to/directory', join_similar=True)
data = data_reader.read_data('path/to/zipfile.zip', join_similar=False)
with open('path/to/zipfile.zip', 'rb') as f:
zip_bytes = f.read()
data = data_reader.read_data(zip_bytes, join_similar=False)
with zipfile.ZipFile('path/to/zipfile.zip', 'r') as zip_file:
data = data_reader.read_data(zip_file, join_similar=False)
duplicated_subset_dict = {'file1': ['column1', 'column2']}
data = data_reader.read_data('path/to/source', duplicated_subset_dict=duplicated_subset_dict)
filter_subset = {
'file1': [{'column': 'Col1', 'operation': '>', 'values': 100},
{'column': 'Col2', 'operation': '==', 'values': 'Value'}]
}
data = data_reader.read_data('path/to/source', filter_subset=filter_subset)
try:
data = data_reader.read_data('path/to/source')
except UnsupportedFormatError:
print("Unsupported file format provided")
except FilterConfigurationError:
print("Error in filter configuration")
data_reader = DataReader()
data = data_reader.read_data(r'C:\path o\data', join_similar=True,
filter_subset={'example_file': [{'column': 'Age', 'operation': '>', 'values': 30}]})
Fork the Repository: Start by forking the main repository. This creates your own copy of the project where you can make changes.
Clone the Forked Repository: Clone your fork to your local machine. This step allows you to work on the codebase directly.
Set Up the Development Environment: Ensure you have all necessary dependencies installed. It's recommended to use a virtual environment.
Create a New Branch: Always create a new branch for your changes. This keeps the main branch stable and makes reviewing changes easier.
Make Your Changes: Implement your feature, fix a bug, or make your proposed changes. Ensure your code adheres to the project's coding standards and guidelines.
Test Your Changes: Before submitting, test your changes thoroughly. Write unit tests if applicable, and ensure all existing tests pass.
Document Your Changes: Update the documentation to reflect your changes. If you're adding a new feature, include usage examples.
Commit Your Changes: Make concise and clear commit messages, describing what each commit does.
Push to Your Fork: Push your changes to your fork on GitHub.
Create a Pull Request (PR): Go to the original `DataReader` repository and create a pull request from your fork. Ensure you describe your changes in detail and link any relevant issues.
After submitting your PR, the maintainers will review your changes. Be responsive to feedback:
Respond to Comments: If the reviewers ask for changes, make them promptly. Discuss any suggestions or concerns.
Update Your PR: If needed, update your PR based on feedback. This may involve adding more tests or tweaking your approach.
Once your PR is approved:
Merge: The maintainers will merge your changes into the main codebase.
Stay Engaged: Continue to stay involved in the project. Look out for feedback from users on your new feature or fix.
Contributing to `DataReader` is a rewarding experience that benefits the entire user community. Your contributions help make `DataReader` a more robust and versatile tool. We welcome developers of all skill levels and appreciate every form of contribution, from code to documentation. Thank you for considering contributing to `DataReader`!
FAQs
A class to handle and process multiple files with identical structures within a directory.
We found that data-harvest-reader demonstrated a healthy version release cadence and project activity because the last version was released less than a year ago. It has 1 open source maintainer collaborating on the project.
Did you know?
Socket for GitHub automatically highlights issues in each pull request and monitors the health of all your open source dependencies. Discover the contents of your packages and block harmful activity before you install or update your dependencies.
Security News
Create React App is officially deprecated due to React 19 issues and lack of maintenance—developers should switch to Vite or other modern alternatives.
Security News
Oracle seeks to dismiss fraud claims in the JavaScript trademark dispute, delaying the case and avoiding questions about its right to the name.
Security News
The Linux Foundation is warning open source developers that compliance with global sanctions is mandatory, highlighting legal risks and restrictions on contributions.