Security News
Fluent Assertions Faces Backlash After Abandoning Open Source Licensing
Fluent Assertions is facing backlash after dropping the Apache license for a commercial model, leaving users blindsided and questioning contributor rights.
aws-analytics-reference-architecture
Advanced tools
The AWS Analytics Reference Architecture is a set of analytics solutions put together as end-to-end examples. It regroups AWS best practices for designing, implementing, and operating analytics platforms through different purpose-built patterns, handling common requirements, and solving customers' challenges.
This project is composed of:
This documentation explains how to get started with the core components of the AWS Analytics Reference Architecture.
The core components can be deployed in any AWS region
Install the following components with the specified version on the machine from which the deployment will be executed:
Bootstrap AWS CDK in your region (here eu-west-1). It will provision resources required to deploy AWS CDK applications
export ACCOUNT_ID=$(aws sts get-caller-identity --query Account --output text)
export AWS_REGION=eu-west-1
cdk bootstrap aws://$ACCOUNT_ID/$AWS_REGION
mkdir my_demo
cd my_demo
cdk init app --language python
python3 -m venv .env
source .env/bin/activate
aws-cdk-lib==2.51.0
constructs>=10.0.0,<11.0.0
aws_analytics_reference_architecture>=2.0.0
python -m pip install -r requirements.txt
import aws_analytics_reference_architecture as ara
# Create a new DataLakeStorage with Raw, Clean and Transform buckets configured with data lake best practices
storage = ara.DataLakeStorage (self,"storage")
# Create a new DataLakeCatalog with Raw, Clean and Transform databases
catalog = ara.DataLakeCatalog (self,"catalog")
# Generate the Sales Data
sales_data = ara.BatchReplayer(
scope=self,
id="sale-data",
dataset=ara.PreparedDataset.RETAIL_1_GB_STORE_SALE,
sink_object_key="sale",
sink_bucket=storage.raw_bucket,
)
# Generate the Customer Data
customer_data = ara.BatchReplayer(
scope=self,
id="customer-data",
dataset=ara.PreparedDataset.RETAIL_1_GB_CUSTOMER,
sink_object_key="customer",
sink_bucket=storage.raw_bucket,
)
# Configure defaults for Athena console
athena_defaults = ara.AthenaDemoSetup(scope=self, id="demo_setup")
# Configure a default role for AWS Glue jobs
ara.GlueDemoRole.get_or_create(self)
Deploy the AWS CDK application
cdk deploy
The time to deploy the application is depending on the constructs you are using
Delete the AWS CDK application
cdk destroy
More contructs, helpers and datasets are available in the AWS Analytics Reference Architecture. See the full API specification here
Please refer to the contributing guidelines and contributing FAQ for details.
The documentation is made available under the Creative Commons Attribution-ShareAlike 4.0 International License. See the LICENSE file.
The sample code within this documentation is made available under the MIT-0 license. See the LICENSE-SAMPLECODE file.
FAQs
aws-analytics-reference-architecture
We found that aws-analytics-reference-architecture demonstrated a healthy version release cadence and project activity because the last version was released less than a year ago. It has 1 open source maintainer collaborating on the project.
Did you know?
Socket for GitHub automatically highlights issues in each pull request and monitors the health of all your open source dependencies. Discover the contents of your packages and block harmful activity before you install or update your dependencies.
Security News
Fluent Assertions is facing backlash after dropping the Apache license for a commercial model, leaving users blindsided and questioning contributor rights.
Research
Security News
Socket researchers uncover the risks of a malicious Python package targeting Discord developers.
Security News
The UK is proposing a bold ban on ransomware payments by public entities to disrupt cybercrime, protect critical services, and lead global cybersecurity efforts.