@cumulus/discover-granules
Advanced tools
Changelog
[v13.3.2] 2022-10-10 [BACKPORT]
Please note changes in 13.3.2 may not yet be released in future versions, as this is a backport and patch release on the 13.3.x series of releases. Updates that are included in the future will have a corresponding CHANGELOG entry in future releases.
@cumulus/aws-client/S3/moveObject
to handle zero byte files (0 byte files).@cumulus/aws-client/S3ObjectStore
class to take string query parameters and
its methods signGetObject
and signHeadObject
to take parameter presignOptions@cumulus/api-client/collections
and @cumulus/integration-tests/api
to encode
collection version in the URI pathnull
files key is provided in an update payload (e.g. files: null
),
an error will be thrown. null
files were not previously supported and would throw potentially unclear errors. This makes the error clearer and more explicit.files
key, all files will be removed in all datastoresChangelog
[v12.0.3] 2022-10-03 [BACKPORT]
Please note changes in 12.0.3 may not yet be released in future versions, as this is a backport and patch release on the 12.0.x series of releases. Updates that are included in the future will have a corresponding CHANGELOG entry in future releases.
null
files key is provided in an update payload (e.g. files: null
),
an error will be thrown. null
files were not previously supported and would throw potentially unclear errors. This makes the error clearer and more explicit.files
key, all files will be removed in all datastores@cumulus/aws-client/S3ObjectStore
class to take string query parameters and
its methods signGetObject
and signHeadObject
to take parameter presignOptions@cumulus/aws-client/S3/moveObject
to handle zero byte files (0 byte files).@cumulus/api-client/collections
and @cumulus/integration-tests/api
to encode
collection version in the URI pathChangelog
[v11.1.7] 2022-10-05 [BACKPORT]
Please note changes in 11.1.7 may not yet be released in future versions, as this is a backport and patch release on the 11.1.x series of releases. Updates that are included in the future will have a corresponding CHANGELOG entry in future releases.
null
files key is provided in an update payload (e.g. files: null
),
an error will be thrown. null
files were not previously supported and would throw potentially unclear errors. This makes the error clearer and more explicit.files
key, all files will be removed in all datastores@cumulus/aws-client/S3ObjectStore
class to take string query parameters and
its methods signGetObject
and signHeadObject
to take parameter presignOptions@cumulus/aws-client/S3/moveObject
to handle zero byte files (0 byte files).@cumulus/api-client/collections
and @cumulus/integration-tests/api
to encode
collection version in the URI pathChangelog
[v13.3.0] 2022-8-19
GET /granules
endpoint has a new optional query parameter:
searchContext
, which is used to resume listing within the same search
context. It is provided in every response from the endpoint as
meta.searchContext
. The searchContext value must be submitted with every
consequent API call, and must be fetched from each new response to maintain
the context.searchContext
query string parameter allows listing past 10,000 results.from
query param in a request will cause the searchContext
to
be ignored and also make the query subject to the 10,000 results cap again.GET /granules
endpoint to leverage ElasticSearch search-after API.
The endpoint will only use search-after when the searchContext
parameter
is provided in a request.Changelog
[v12.0.2] 2022-08-10 [BACKPORT]
Please note changes in 12.0.2 may not yet be released in future versions, as this is a backport and patch release on the 12.0.x series of releases. Updates that are included in the future will have a corresponding CHANGELOG entry in future releases.
granule_cumulus_id
instead of
cumulus_id
. Previous logic removed files by matching file.cumulus_id
to granule.cumulus_id
.Changelog
[v13.2.1] 2022-8-10 [BACKPORT]
granule_cumulus_id
instead of
cumulus_id
. Previous logic removed files by matching file.cumulus_id
to granule.cumulus_id
.Changelog
[v11.1.5] 2022-08-10 [BACKPORT]
Please note changes in 11.1.5 may not yet be released in future versions, as this is a backport and patch release on the 11.1.x series of releases. Updates that are included in the future will have a corresponding CHANGELOG entry in future releases.
granule_cumulus_id
instead of
cumulus_id
. Previous logic removed files by matching file.cumulus_id
to granule.cumulus_id
.Changelog
[v13.2.0] 2022-8-04
cumulus
moduleingestPdrWithNodeNameSpec.js
to use deleteProvidersAndAllDependenciesByHost
function.deleteProvidersByHost
function.collectionId
in workflow input and
updated task to use said collectionId
to look up the corresponding collection record in RDS.Changelog
[v13.1.0] 2022-7-22
The changes introduced in CUMULUS-2962 will re-introduce a
files_granules_cumulus_id_index
on the files
table in the RDS database.
This index will be automatically created as part of the bootstrap lambda
function on deployment of the data-persistence
module.
In cases where the index is already applied, this update will have no effect.
Please Note: In some cases where ingest is occurring at high volume levels and/or the files table has > 150M file records, the migration may fail on deployment due to timing required to both acquire the table state needed for the migration and time to create the index given the resources available.
For reference a rx.5 large Aurora/RDS database with no activity took roughly 6 minutes to create the index for a file table with 300M records and no active ingest, however timed out when the same migration was attempted in production with possible activity on the table.
If you believe you are subject to the above consideration, you may opt to
manually create the files
table index prior to deploying this version of
Core with the following procedure:
select * from pg_indexes where tablename = 'files';
schemaname | tablename | indexname | tablespace | indexdef
------------+-----------+-------------------------+------------+---------------------------------------------------------------------------------------
public | files | files_pkey | | CREATE UNIQUE INDEX files_pkey ON public.files USING btree (cumulus_id)
public | files | files_bucket_key_unique | | CREATE UNIQUE INDEX files_bucket_key_unique ON public.files USING btree (bucket, key)
In this instance you should not see an indexname
row with
files_granules_cumulus_id_index
as the value. If you do, you should be
clear to proceed with the installation.
Stop all ingest operations in Cumulus Core according to your operational procedures. You should validate that it appears there are no active queries that appear to be inserting granules/files into the database as a secondary method of evaluating the database system state:
select pid, query, state, wait_event_type, wait_event from pg_stat_activity where state = 'active';
If query rows are returned with a query
value that involves the files table,
make sure ingest is halted and no other granule-update activity is running on
the system.
Note: In rare instances if there are hung queries that are unable to resolve, it may be necessary to
manually use psql Server Signaling
Functions
pg_cancel_backend
and/or
pg_terminate_backend
if the migration will not complete in the next step.
Run the following query to create the index. Depending on the situation this may take many minutes to complete, and you will note your CPU load and disk I/O rates increase on your cluster:
CREATE INDEX files_granule_cumulus_id_index ON files (granule_cumulus_id);
You should see a response like:
CREATE INDEX
and can verify the index files_granule_cumulus_id_index
was created:
=> select * from pg_indexes where tablename = 'files';
schemaname | tablename | indexname | tablespace | indexdef
------------+-----------+--------------------------------+------------+----------------------------------------------------------------------------------------------
public | files | files_pkey | | CREATE UNIQUE INDEX files_pkey ON public.files USING btree (cumulus_id)
public | files | files_bucket_key_unique | | CREATE UNIQUE INDEX files_bucket_key_unique ON public.files USING btree (bucket, key)
public | files | files_granule_cumulus_id_index | | CREATE INDEX files_granule_cumulus_id_index ON public.files USING btree (granule_cumulus_id)
(3 rows)
CONCURRENTLY
option for CREATE INDEX
.
This can have significant impacts on CPU/write IO, particularly if you are
already using a significant amount of your cluster resources, and may result
in failed writes or an unexpected index/database state.PostgreSQL's documentation provides more information on this option. Please be aware it is unsupported by Cumulus at this time, so community members that opt to go this route should proceed with caution.
files
table to add an index on granule_cumulus_id
move-granule
task to check the optional collection configuration parameter
meta.granuleMetadataFileExtension
to determine the granule metadata file.
If none is specified, the granule CMR metadata or ISO metadata file is used.CUMULUS-2995
CUMULUS-2863
@cumulus/api
validateAndUpdateSqsRule
method to allow 0 retries and 0 visibilityTimeout
in rule's meta.CUMULUS-2959
@cumulus/api
granules
module to convert numeric productVolume to string
when an old granule record is retrieved from DynamoDBFixed the following links on Cumulus docs' Getting Started page:
Also corrected the How to Deploy Cumulus link in the Glossary