@cumulus/aws-client
Advanced tools
Changelog
[v10.0.0] 2022-02-01
task-config
for all workflows that use the sync-granule
task to include workflowStartTime
set to
{$.cumulus_meta.workflow_start_time}
. See here for an example.isCMRFilename
and isCMRFile
isECHO10File
-> isECHO10Filename
isUMMGFile
-> isUMMGFilename
isISOFile
-> isCMRISOFilename
files
object now matches the file schema in the
Cumulus database and thus also matches the files
object produced by the API with use cases like
applyWorkflow
. This includes removal of name
and filename
in favor of bucket
and key
,
removal of certain properties such as etag
and duplicate_found
and outputting them as
separate objects stored in meta
.@cumulus/checksum
are now converted to string to standardize
checksum formatting across the Cumulus library.sync-granule
task has been updated to support an optional configuration parameter workflowStartTime
. The output payload of sync-granule
now includes a createdAt
time for each granule which is set to the
provided workflowStartTime
or falls back to Date.now()
if not provided. Workflows using
sync-granule
may be updated to include this parameter with the value of {$.cumulus_meta.workflow_start_time}
in the task_config
.@cumulus/cumulus-message-adapter-js
from 2.0.3
to 2.0.4
for
all Cumulus workflow tasksPOST /deadLetterArchive/recoverCumulusMessages
POST /elasticsearch/index-from-database
POST /granules/bulk
POST /granules/bulkDelete
POST /granules/bulkReingest
POST /migrationCounts
POST /reconciliationReports
POST /replays
POST /replays/sqs
knex
package from 0.95.11
to 0.95.15
example/cumulus-tf
module to retrieve default VPC and subnets in NGAP accounts
vpc_tag_name
variable which defines the tags used to look up a VPC. Defaults to VPC tag name used in NGAP accountssubnets_tag_name
variable which defines the tags used to look up VPC subnets. Defaults to a subnet tag name used in NGAP accountsexample/data-persistence-tf
module to retrieve default VPC and subnets in NGAP accounts
vpc_tag_name
variable which defines the tags used to look up a VPC. Defaults to VPC tag name used in NGAP accountssubnets_tag_name
variable which defines the tags used to look up VPC subnets. Defaults to a subnet tag name used in NGAP accountsexample/rds-cluster-tf
module to retrieve default VPC and subnets in NGAP accounts
vpc_tag_name
variable which defines the tags used to look up a VPC. Defaults to VPC tag name used in NGAP accountssubnets_tag_name
variable which defines the tags used to look up VPC subnets. Defaults to tag names used in subnets in for NGAP accountsSHA-256
vs SHA256
) to tasks that calculate checksums.cmr_search_client_config
tfvars to the archive and cumulus terraform modules.s3MultipartChunksizeMb
to collection's meta
fieldmove-granules
task to take in an optional config parameter s3MultipartChunksizeMbvpc_id
variable optional for example/cumulus-tf
modulevpc_id
and subnet_ids
variables optional for example/data-persistence-tf
modulevpc_id
and subnets
variables optional for example/rds-cluster-tf
moduleUSE\_CACHED\_BOOTSTRAP
is disabled.BucketType
.workflowStartTime
createdAt
on output based on workflowStartTime
if provided,
falling back to Date.now()
if not provided.task_config
of SyncGranule in example workflows@cumulus/cumulus-message-adapter-js
version 2.0.3
, which includes an
update cma-js to better expose CMA stderr stream output on lambda timeouts
as well as minor logging enhancements.finalPayload.*
will not dynamically add new fields to mappingoriginalPayload.*
will not dynamically add new fields to mappingtasks
will not dynamically add new fields to mappingTypeError
thrown by @cumulus/cmrjs/cmr-utils.getGranuleTemporalInfo
when
a granule's associated UMM-G JSON metadata file does not contain a ProviderDates
element that has a Type
of either "Update"
or "Insert"
. If neither are
present, the granule's last update date falls back to the "Create"
type
provider date, or undefined
, if none is present.@cumulus/api-client/invokeApi()
to accept a single accepted status code or an array
of accepted status codes via expectedStatusCodes
@cumulus/launchpad-auth/LaunchpadToken.requestToken
and validateToken
to use the HTTPS request option https.pfx
instead of the deprecated pfx
option
for providing the certificate.cmr-utils/getGranuleTemporalInfo
to search for a SingleDateTime
element, when beginningDateTime value is not
found in the metadata file. The granule's temporal information is
returned so that both beginningDateTime and endingDateTime are set to the
discovered singleDateTimeValue._writeGranule()
in write-granules.js
to catch failed granule writes due to schema validation, log the failure and then attempt to set the status of the granule to failed
if it already exists to prevent a failure from allowing the granule to get "stuck" in a non-failed status.@cumulus/api-client
to not log an error for 201 response from updateGranule
hyrax-metadata-updates
task to support reading the DatasetId from ECHO10 XML, and the EntryTitle from UMM-G JSON; these are both valid alternatives to the shortname and version ID.Changelog
[v9.7.1] 2021-12-08 [Backport]
Please note changes in 9.7.0 may not yet be released in future versions, as this is a backport and patch release on the 9.7.x series of releases. Updates that are included in the future will have a corresponding CHANGELOG entry in future releases. Fixed
Changelog
[v9.9.0] 2021-11-03
MoveGranules
step
isISOFile
to check if a given file object is an ISO filegranuleToCmrFileObject
and granulesToCmrFileObjects
now take a
filterFunc
argument
filterFunc
's default value is isCMRFile
, so the previous behavior is
maintained if no value is given for this argumentMoveGranules
passes a custom filter function to
granulesToCmrFileObjects
to check for isISOFile
in addition to
isCMRFile
, so that metadata from .iso.xml
files can be used in the
urlPathTemplate
elasticsearch_client_config
tfvars to the archive and cumulus terraform modules.default_s3_multipart_chunksize_mb
setting to the move-granules
lambda function.default_s3_multipart_chunksize_mb
tfvars to the cumulus and ingest terraform modules.chunkSize
to @cumulus/aws-client/S3.moveObject
and
@cumulus/aws-client/S3.multipartCopyObject
to set the chunk size of the S3 multipart uploads.maxChunkSize
to chunkSize
in
@cumulus/aws-client/lib/S3MultipartUploads.createMultipartChunks
.@cumulus/cumulus-message-adapter-js
version 2.0.1
@cumulus/api/launchpadSaml.launchpadPublicCertificate
to correctly retrieve
certificate from launchpad IdP metadata with and without namespace prefix.Changelog
[v9.8.0] 2021-10-19
36
of cumuluss/async-operation
to Docker Hub for compatibility with
upgrades to knex
package and to address security vulnerabilities.Added @cumulus/db/createRejectableTransaction()
to handle creating a Knex transaction that will throw an error if the transaction rolls back. As of Knex 0.95+, promise rejection on transaction rollback is no longer the default behavior.
CUMULUS-2639
CUMULUS-2670
lambda_timeouts
string map variable for cumulus
module to accept a
update_granules_cmr_metadata_file_links_task_timeout
propertyCUMULUS-2598
knex
version from 0.23.11 to 0.95.11 to address security vulnerabilitiescumuluss/async-operation:36
queued
when scheduling the granule.buckets.json
out of the
s3://internal-bucket/workflows
directory into
s3://internal-bucket/buckets
.Changelog
[v9.7.0] 2021-10-01
queue-granules
task now updates granule status to queued
when a granule is queued. In order to prevent issues with the private API endpoint and Lambda API request and concurrency limits, this functionality runs with limited concurrency, which may increase the task's overall runtime when large numbers of granules are being queued. If you are facing Lambda timeout errors with this task, we recommend converting your queue-granules
task to an ECS activity. This concurrency is configurable via the task config's concurrency
value.discover-granules
task has been updated to limit concurrency on checks to identify and skip already ingested granules in order to prevent issues with the private API endpoint and Lambda API request and concurrency limits. This may increase the task's overall runtime when large numbers of granules are discovered. If you are facing Lambda timeout errors with this task, we recommend converting your discover-granules
task to an ECS activity. This concurrency is configurable via the task config's concurrency
value.<prefix>-sfEventSqsToDbRecords
Lambda to 1024MB@cumulus/queue-granules
to respect a new config parameter: preferredQueueBatchSize
. Queue-granules will respect this batchsize as best as it can to batch granules into workflow payloads. As workflows generally rely on information such as collection and provider expected to be shared across all granules in a workflow, queue-granules will break batches up by collection, as well as provider if there is a provider
field on the granule. This may result in batches that are smaller than the preferred size, but never larger ones. The default value is 1, which preserves current behavior of queueing 1 granule per workflow.DiscoverGranulesToThrottledQueue
that discovers and writes
granules to a throttled background queue. This allows discovery and ingest
of larger numbers of granules without running into limits with lambda
concurrency.archive_api_reserved_concurrency
from 8 to 5 to use fewer reserved lambda
functions. If you see throttling errors on the <stack>-apiEndpoints
you
should increase this value.archive_api_reserved_concurrency
from 8 to 15 to prevent throttling on
the dashboard for default deployments.api/endpoints/execution-status.js
get
method to include associated granules, as
an array, for the provided execution.getExecutionArnsByGranuleCumulusId
returning a list of executionArns sorted by most recent first,
for an input Granule Cumulus ID in support of the move of translatePostgresGranuleToApiGranule
from RDS-Phase2
feature branchgetApiExecutionCumulusIds
returning cumulus IDs for a given list of executionseraseDynamoTables()
. Changed the call Promise.all()
to Promise.allSettled()
to ensure all dynamo records (provider records in particular) are deleted prior to reseeding.Changelog
[v9.6.0] 2021-09-20
PUT /granules
API endpoint to update a granuleupdateGranule
to @cumulus/api-client/granules
POST /granules/{granuleId}/executions
API endpoint to associate an execution with a granuleassociateExecutionWithGranule
to @cumulus/api-client/granules
queued
as option for granule's status
fieldMoved ssh2
package from @cumulus/common
to @cumulus/sftp-client
and
upgraded package from ^0.8.7
to ^1.0.0
to address security vulnerability
issue in previous version.
CUMULUS-2583
QueueGranules
task now updates granule status to queued
once it is added to the queue.CUMULUS-2617
Authorization
header for CMR Launchpad authentication instead of the deprecated Echo-Token
header.Added missing permission for <prefix>_ecs_cluster_instance_role
IAM role (used when running ECS services/tasks)
to allow kms:Decrypt
on the KMS key used to encrypt provider credentials. Adding this permission fixes the sync-granule
task when run as an ECS activity in a Step Function, which previously failed trying to decrypt credentials for providers.
CUMULUS-2576
Changelog
[v9.5.0] 2021-09-07
logs
record type from mappings from Elasticsearch. This change should not have
any adverse impact on existing deployments, even those which still contain logs
records,
but technically it is a breaking change to the Elasticsearch mappings.@cumulus/api-client/asyncOperations.getAsyncOperation
to return parsed JSON body
of response and not the raw API endpoint responsecumulus
module to take lambda_timeouts string map variable that allows timeouts of ingest tasks to be configurable. Allowed properties for the mapping include:POST /granules
API endpoint to create a granulecreateGranule
to @cumulus/api-client
POST /executions
endpoint to create an executionPUT /executions
endpoint to update an executiondelete
method for granules-executions.ts
implemented as part of CUMULUS-2306
from the RDS-Phase-2 feature branch in support of CUMULUS-2644.erasePostgresTables
method in serve.js
implemented as part of CUMULUS-2644,
and CUMULUS-2306 from the RDS-Phase-2 feature branch in support of CUMULUS-2644resetPostgresDb
method to support resetting between integration test suite runsUpdated processDeadLetterArchive
Lambda to return an object where
processingSucceededKeys
is an array of the S3 keys for successfully
processed objects and processingFailedKeys
is an array of S3 keys
for objects that could not be processed
Updated async operations to handle writing records to the databases
when output of the operation is undefined
CUMULUS-2644
migration
directory from the db-migration-lambda
to the db
package and
updated unit test references to migrationDir to be pulled from @cumulus/db
@cumulus/api/bin/serveUtils
to write records to PostgreSQL tablesCUMULUS-2575
@cumulus/db/src/model/granules
functions get
and exists
to
enforce parameter checking so that requests include either (granule_id
and collection_cumulus_id) or (cumulus_id) to prevent incorrect results.@cumulus/message/src/Collections.deconstructCollectionId
has been
modified to throw a descriptive error if the input collectionId
is
undefined rather than TypeError: Cannot read property 'split' of undefined
. This function has also been updated to throw descriptive errors
if an incorrectly formatted collectionId is input.