@cumulus/checksum
Advanced tools
Changelog
[v1.13.1] - 2019-07-22
Changelog
[v1.13.0] - 2019-5-20
CUMULUS-802 added some additional IAM permissions to support ECS autoscaling, so you will have to redeploy your IAM stack. As a result of the changes for CUMULUS-1193, CUMULUS-1264, and CUMULUS-1310, you must delete your existing stacks (except IAM) before deploying this version of Cumulus. If running Cumulus within a VPC and extended downtime is acceptable, we recommend doing this at the end of the day to allow AWS backend resources and network interfaces to be cleaned up overnight.
CUMULUS-1228
ecs.amiid
property in your config. Instructions for finding
the most recent NGAP AMI can be found using
these instructions.CUMULUS-1310
db
stack.
Migrations for this version will need to be user-managed. (e.g. elasticsearch and dynamoDB).
Order of stack deployment is iam
-> db
-> app
.config.yml
file, i.e.: kes cf deploy --kes-folder app --template node_modules/@cumulus/deployment/[iam|db|app] [...]
Backwards-compatible. For development, please re-run npm run bootstrap
to build new kes
overrides.
Deployment docs have been updated to show how to deploy a single-config Cumulus instance.params
have been moved: Nest params
fields under app
, db
or iam
to override all Parameters for a particular stack's cloudformation template. Backwards-compatible with multi-config setups.stackName
and stackNameNoDash
have been retired. Use prefix
and prefixNoDash
instead.iams
section in app/config.yml
IAM roles has been deprecated as a user-facing parameter,
unless your IAM role ARNs do not match the convention shown in @cumulus/deployment/app/config.yml
vpc.securityGroup
will need to be set with a pre-existing security group ID to use Cumulus in a VPC. Must allow inbound HTTP(S) (Port 443).CUMULUS-1212
@cumulus/post-to-cmr
will now fail if any granules being processed are missing a metadata file. You can set the new config option skipMetaCheck
to true
to pass post-to-cmr without a metadata file.CUMULUS-1232
@cumulus/sync-granule
will no longer silently pass if no checksum data is provided. It will use input
from the granule object to:
checksumType
and checksumValue
are in the file record OR a checksum file is provided
(throws InvalidChecksum
on fail), else log warning that no checksum is available.file.size
is in the file record (throws UnexpectedFileSize
on fail),
else log warning that no file size is available.CUMULUS-1264
CumulusApiDefault
nested stack resource has been renamed to CumulusApiDistribution
CumulusApiV1
nested stack resource has been renamed to CumulusApiBackend
urs: true
config option for when defining your lambdas (e.g. in lambdas.yml
) has been deprecated. There are two new options to replace it:
urs_redirect: 'token'
: This will expose a TOKEN_REDIRECT_ENDPOINT
environment variable to your lambda that references the /token
endpoint on the Cumulus backend APIurs_redirect: 'distribution'
: This will expose a DISTRIBUTION_REDIRECT_ENDPOINT
environment variable to your lambda that references the /redirect
endpoint on the Cumulus distribution APICUMULUS-1193
aws iam create-service-linked-role --aws-service-name es.amazonaws.com
CUMULUS-802
maxInstances
must be greater than minInstances
. If you use defaults, no change is required.CUMULUS-1269
fileType
field to type
fileSize
field to size
checksumValue
field to checksum
where not already done.ancillary
and linkage
type support to file objects.CUMULUS-799
CUMULUS-1242 - Added sqs2sfThrottle
lambda. The lambda reads SQS messages for queued executions and uses semaphores to only start new executions if the maximum number of executions defined for the priority key (cumulus_meta.priorityKey
) has not been reached. Any SQS messages that are read but not used to start executions remain in the queue.
CUMULUS-1240
sfSemaphoreDown
lambda. This lambda receives SNS messages and for each message it decrements the semaphore used to track the number of running executions if:
cumulus_meta.priorityKey
)sfSemaphoreDown
lambda as a subscriber to the sfTracker
SNS topicCUMULUS-1265
apiConfigs
configuration option to configure API Gateway to be privateNoVpc
lambdas from documentation and example
folder.CUMULUS-802
Updated @cumulus/ingest/http/httpMixin.list()
to trim trailing spaces on discovered filenames
CUMULUS-1310
db
stack.
This will enable future updates to avoid affecting database resources or requiring migrations.
Migrations for this version will need to be user-managed.
(e.g. elasticsearch and dynamoDB).
Order of stack deployment is iam
-> db
-> app
.config.yml
file, i.e.: kes cf deploy --kes-folder app --template node_modules/@cumulus/deployment/[iam|db|app] [...]
Backwards-compatible. Please re-run npm run bootstrap
to build new kes
overrides.
Deployment docs have been updated to show how to deploy a single-config Cumulus instance.params
fields should now be nested under the stack key (i.e. app
, db
or iam
) to provide Parameters for a particular stack's cloudformation template,
for use with single-config instances. Keys must match the name of the deployment package folder (app
, db
, or iam
).
Backwards-compatible with multi-config setups.stackName
and stackNameNoDash
have been retired as user-facing config parameters. Use prefix
and prefixNoDash
instead.
This will be used to create stack names for all stacks in a single-config use case.
stackName
may still be used as an override in multi-config usage, although this is discouraged.
Warning: overriding the db
stack's stackName
will require you to set dbStackName
in your app/config.yml
.
This parameter is required to fetch outputs from the db
stack to reference in the app
stack.iams
section in app/config.yml
IAM roles has been retired as a user-facing parameter,
unless your IAM role ARNs do not match the convention shown in @cumulus/deployment/app/config.yml
In that case, overriding iams
in your own config is recommended.iam
and db
cloudformation.yml
file names will have respective prefixes (e.g iam.cloudformation.yml
).private
, public
and protected
types.vpc.securityGroup
in config.yml
.
This security group must allow inbound HTTP(S) traffic (Port 443). SSH traffic (Port 22) must be permitted for SSH access to ECS instances.CUMULUS-1236
CUMULUS-1223
CUMULUS-1232
ingest/granule.handleDuplicateFile
for maintainability.ingest/granule.ingestFile
and move-granules/index.moveFileRequest
to use new function.ingest/granule.moveGranuleFileWithVersioning
ingest/granule.verifyFile
now also tests file.size
for verification if it is in the file record and throws
UnexpectedFileSize
error for file size not matching input.ingest/granule.verifyFile
logs warnings if checksum and/or file size are not available.CUMULUS-1193
CUMULUS-1207
Updated Lerna publish script so that published Cumulus packages will pin their dependencies on other Cumulus packages to exact versions (e.g. 1.12.1
instead of ^1.12.1
)
CUMULUS-1203
CUMULUS-1268
Changelog
[v1.12.0] - 2019-4-4
Note: There was an issue publishing 1.12.0. Upgrade to 1.12.1.
CUMULUS-1139
granule.applyWorkflow
uses the new-style granule record as input to workflows.CUMULUS-1171
http
or https
providers so that the host field only
contains a valid hostname or IP address, and the port field contains the
provider port.CUMULUS-1176:
@cumulus/move-granules
input expectations have changed. @cumulus/files-to-granules
is a new intermediate task to perform input translation in the old style.
See the Added and Changed sections of this release changelog for more information.CUMULUS-670
CUMULUS-1208
@cumulus/queue-granules
will now be added to ingest workflow messages as is. In practice, this means that if you are using @cumulus/queue-granules
to trigger ingest workflows and your granule objects input have invalid properties, then your ingest workflows will fail due to schema validation errors.CUMULUS-777
CUMULUS-1183
CUMULUS-1223
@cumulus/common/bucketsConfigJsonObject
for fetching stack's bucket configuration as an object.CUMULUS-853
CUMULUS-805
AWS_REGION
in .env
to be used by deployment scriptCUMULUS-803
CUMULUS-670
CUMULUS-1184 - Added kes logging output to ensure we always see the state machine reference before failures due to configuration
CUMULUS-1105 - Added a dashboard endpoint to serve the dashboard from an S3 bucket
CUMULUS-1199 - Moves s3credentials
endpoint from the backend to the distribution API.
CUMULUS-666
@api/endpoints/s3credentials
to allow EarthData Login authorized users to retrieve temporary security credentials for same-region direct S3 access.CUMULUS-671
@packages/integration-tests/api/distribution/getDistributionApiS3SignedUrl()
to return the S3 signed URL for a file protected by the distribution APICUMULUS-672
cmrMetadataFormat
and cmrConceptId
to output for individual granules from @cumulus/post-to-cmr
. cmrMetadataFormat
will be read from the cmrMetadataFormat
generated for each granule in @cumulus/cmrjs/publish2CMR()
@packages/integration-tests/api/distribution
:
getDistributionApiFileStream()
returns a stream to download files protected by the distribution APIgetDistributionFileUrl()
constructs URLs for requesting files from the distribution APICUMULUS-1185 @cumulus/api/models/Granule.removeGranuleFromCmrByGranule
to replace @cumulus/api/models/Granule.removeGranuleFromCmr
and use the Granule UR from the CMR metadata to remove the granule from CMR
CUMULUS-1101
@cumulus/checksum
package. This package provides functions to calculate and validate checksums.@cumulus/common/aws
: calculateS3ObjectChecksum
and validateS3ObjectChecksum
, which depend on the checksum
package.CUMULUS-1171
@cumulus/common
API documentation to packages/common/docs/API.md
npm run build-docs
task to @cumulus/common
@cumulus/common/string#isValidHostname()
@cumulus/common/string#match()
@cumulus/common/string#matches()
@cumulus/common/string#toLower()
@cumulus/common/string#toUpper()
@cumulus/common/URLUtils#buildURL()
@cumulus/common/util#isNil()
@cumulus/common/util#isNull()
@cumulus/common/util#isUndefined()
@cumulus/common/util#negate()
CUMULUS-1176
@cumulus/files-to-granules
task to handle converting file array output from cumulus-process
tasks into granule objects.
Allows simplification of @cumulus/move-granules
and @cumulus/post-to-cmr
, see Changed section for more details.CUMULUS-1151 Compare the granule holdings in CMR with Cumulus' internal data store
CUMULUS-1152 Compare the granule file holdings in CMR with Cumulus' internal data store
CUMULUS-1216 - Updated @cumulus/ingest/granule/ingestFile
to download files to expected staging location.
CUMULUS-1208 - Updated @cumulus/ingest/queue/enqueueGranuleIngestMessage()
to not transform granule object passed to it when building an ingest message
CUMULUS-1198 - @cumulus/ingest
no longer enforces any expectations about whether provider_path
contains a leading slash or not.
CUMULUS-1170
npm
instead of yarn
package-lock.json
files to ensure matching versions of npm packagesnpm ci
instead of npm install
CUMULUS-670
CUMULUS-1139 - Granules stored in the API contain a files
property. That schema has been greatly
simplified and now better matches the CNM format.
name
property has been renamed to fileName
.filepath
property has been renamed to key
.checksumValue
property has been renamed to checksum
.path
property has been removed.url_path
property has been removed.filename
property (which contained an s3://
URL) has been removed, and the bucket
and key
properties should be used instead. Any requests sent to the API containing a granule.files[].filename
property will be rejected, and any responses coming back from the API will not contain that
filename
property.source
property has been added, which is a URL indicating the original source of the file.@cumulus/ingest/granule.moveGranuleFiles()
no longer includes a filename
field in its
output. The bucket
and key
fields should be used instead.CUMULUS-672
@cumulus/integration-tests/api/EarthdataLogin.getEarthdataLoginRedirectResponse
to @cumulus/integration-tests/api/EarthdataLogin.getEarthdataAccessToken
. The new function returns an access response from Earthdata login, if successful.@cumulus/integration-tests/cmr/getOnlineResources
now accepts an object of options, including cmrMetadataFormat
. Based on the cmrMetadataFormat
, the function will correctly retrieve the online resources for each metadata format (ECHO10, UMM-G)CUMULUS-1101
@cumulus/common/file/getFileChecksumFromStream
into @cumulus/checksum
, and renamed it to generateChecksumFromStream
.
This is a breaking change for users relying on @cumulus/common/file/getFileChecksumFromStream
.@cumulus/ingest/Granule
to depend on new common/aws
checksum functions and remove significantly present checksumming code.
@cumulus/ingest/granule.validateChecksum
. Replaced with @cumulus/ingest/granule.verifyFile
.granule.getChecksumFromFile
to granule.retrieveSuppliedFileChecksumInformation
to be more accurate.@cumulus/common/aws.checksumS3Objects
. Use @cumulus/common/aws.calculateS3ObjectChecksum
instead.CUMULUS-1171
host
and
port
properties. HTTP providers ignored port
and protocol
, and stored
an entire URL in the host
property. Updated the API to only accept valid
hostnames or IP addresses in the provider.host
field. Updated ingest code
to properly build HTTP and HTTPS URLs from provider.protocol
,
provider.host
, and provider.port
.CUMULUS-1176
@cumulus/move-granules
breaking change:
Input to move-granules
is now expected to be in the form of a granules object (i.e. { granules: [ { ... }, { ... } ] }
);
For backwards compatibility with array-of-files outputs from processing steps, use the new @cumulus/files-to-granules
task as an intermediate step.
This task will perform the input translation. This change allows move-granules
to be simpler and behave more predictably.
config.granuleIdExtraction
and config.input_granules
are no longer needed/used by move-granules
.@cumulus/post-to-cmr
: config.granuleIdExtraction
is no longer needed/used by post-to-cmr
.CUMULUS-1174
@cumulus/api
files and granules were not getting indexed correctly because files indexing was failing in db-indexer
@cumulus/deployment
A bug in the Cloudformation template was preventing the API from being able to be launched in a VPC, updated the IAM template to give the permissions to be able to run the API in a VPC@cumulus/api/models/Granule.removeGranuleFromCmr
, instead use @cumulus/api/models/Granule.removeGranuleFromCmrByGranule
@cumulus/ingest/granule.validateChecksum
, instead use @cumulus/ingest/granule.verifyFile
@cumulus/common/aws.checksumS3Objects
, instead use @cumulus/common/aws.calculateS3ObjectChecksum
@cumulus/cmrjs
: getGranuleId
and getCmrFiles
are deprecated due to changes in input handling.