Lumigo OpenTelemetry Distro for Node.js
This is the source repository of @lumigo/opentelemetry
, Lumigo OpenTelemetry Distribution for Node.js, intended for use with containerized applications.
The Lumigo OpenTelemetry Distribution for Node.js is made of several upstream OpenTelemetry packaged, additional automated quality-assurance and customizations that optimize for no-code injection, meaning that you should need to update exactly zero lines of code in your application in order to make use of the Lumigo OpenTelemetry Distribution.
(See the No-code activation section for auto-instrumentation instructions)
Note: If you are looking for the Lumigo Node.js tracer for AWS Lambda functions, @lumigo/tracer
is the package you should use instead.
Logging support
The Lumigo OpenTelemetry Distribution also allows logging span-correlated records. See the configuration section for details on how to enable this feature.
When using the logging feature, the same set of rules for secret masking applies on the content of the log message, with only LUMIGO_SECRET_MASKING_REGEX
being considered.
Setup
Add @lumigo/opentelemetry as dependency
Add @lumigo/opentelemetry
as a dependency using your preferred package manager:
npm install @lumigo/opentelemetry
or:
yarn add @lumigo/opentelemetry
Environment-based configuration
For both manual and no-code instrumentation, you will need to configure the LUMIGO_TRACER_TOKEN
environment variable with the token value generated for you by the Lumigo platform, under Settings --> Tracing --> Manual tracing
, and the OTEL_SERVICE_NAME
environment variable with the service name you've chosen:
export LUMIGO_TRACER_TOKEN=<token>
export OTEL_SERVICE_NAME=<service name>
Tracer activation
There are two ways to activate the @lumigo/opentelemetry
package: one based on importing the package in code (manual activation), and the other via the environment (no-code activation).
The no-code activation approach is the preferred one.
No-code activation
Note: The instructions in this section are mutually exclusive with those provided in the Manual instrumentation section.
Set the following environment variable for your Node.js process:
export NODE_OPTIONS="${NODE_OPTIONS} -r '@lumigo/opentelemetry'"
The line above avoids overriding any other settings you may have passed via the NODE_OPTIONS
environment variable.
Manual activation
Note: The instructions in this section are mutually exclusive with those provided in the No-code activation section.
Import @lumigo/opentelemetry
at the beginning of your main file:
const lumigo = require("@lumigo/opentelemetry");
import * as lumigo from "@lumigo/opentelemetry";
See Waiting for the initialization of the Lumigo OpenTelemetry Distro regarding initialization behavior.
Setup for npm package.json start script
{
"scripts": {
"start": "LUMIGO_TRACER_TOKEN=<token> OTEL_SERVICE_NAME=<service name> node -r @lumigo/opentelemetry <main_file>.js"
}
}
Configuration
OpenTelemetry configurations
The Lumigo OpenTelemetry Distro for Node.js is made of several upstream OpenTelemetry packages as well as some additional logic and, as such, the environment variables that work with "vanilla" OpenTelemetry work also with the Lumigo OpenTelemetry Distro for Node.js.
Specifically supported are:
Lumigo-specific configurations
@lumigo/opentelemetry
additionally supports the following configuration options as environment variables:
-
LUMIGO_ENABLE_TRACES
- Default: true
. When set to false
, turns off all of the tracing instrumentations. Note that this does not turn off the logging instrumentations, which are controlled by the LUMIGO_ENABLE_LOGS
environment variable.
-
LUMIGO_TRACER_TOKEN=<token>
: Configure the Lumigo token to enable to upload of telemetry to Lumigo; without this environment variable, your Node.js process will not send telemetry to Lumigo.
-
LUMIGO_DEBUG=TRUE
: Enables debug logging
-
LUMIGO_DEBUG_SPANDUMP=<path|console:log|console:error>
: Log all spans collected to the <path>
file or, the value is console:log
or console:error
, to console.log
or console.error
; this is an option intended only for debugging purposes and should not be used in production.
This setting is independent from LUMIGO_DEBUG
, that is, LUMIGO_DEBUG
does not need to additionally be set for LUMIGO_DEBUG_SPANDUMP
to work.
-
LUMIGO_REPORT_DEPENDENCIES=false
: This option disables the built-in dependency reporting to Lumigo SaaS. For more information, refer to the Automated dependency reporting section.
-
LUMIGO_SWITCH_OFF=TRUE
: This option disables the Lumigo OpenTelemetry Distro entirely; no instrumentation will be injected, no tracing data will be collected.
-
LUMIGO_AUTO_FILTER_EMPTY_SQS
: This option enables the automatic filtering of empty SQS messages from being sent to Lumigo SaaS. For more information, refer to the Filtering out empty SQS messages section.
-
LUMIGO_DISABLE_PG_INSTRUMENTATION=true
: This option disables the automatic instrumentation of postgres.
-
LUMIGO_DISABLE_NEST_INSTRUMENTATION=true
: This option disables the automatic instrumentation of @nestjs/core.
-
LUMIGO_SECRET_MASKING_REGEX='["regex1", "regex2"]'
: Prevents Lumigo from sending keys that match the supplied regular expressions in process environment data, HTTP headers, payloads and queries. All regular expressions are case-insensitive. The "magic" value all
will redact everything. By default, Lumigo applies the following regular expressions: [".*pass.*", ".*key.*", ".*secret.*", ".*credential.*", ".*passphrase.*"]
. More fine-grained settings can be applied via the following environment variables, which will override LUMIGO_SECRET_MASKING_REGEX
for a specific type of data:
LUMIGO_SECRET_MASKING_REGEX_HTTP_REQUEST_BODIES
applies secret redaction to HTTP request bodiesLUMIGO_SECRET_MASKING_REGEX_HTTP_REQUEST_HEADERS
applies secret redaction to HTTP request headersLUMIGO_SECRET_MASKING_REGEX_HTTP_QUERY_PARAMS
applies secret redaction to HTTP query parametersLUMIGO_SECRET_MASKING_REGEX_HTTP_RESPONSE_BODIES
applies secret redaction to HTTP response bodiesLUMIGO_SECRET_MASKING_REGEX_HTTP_RESPONSE_HEADERS
applies secret redaction to HTTP response bodiesLUMIGO_SECRET_MASKING_REGEX_ENVIRONMENT
applies secret redaction to process environment variables (that is, the content of process.env
)
-
LUMIGO_FILTER_HTTP_ENDPOINTS_REGEX='["regex1", "regex2"]'
: This option enables the filtering of client and server endpoints through regular expression searches. Fine-tune your settings via the following environment variables, which work in conjunction with LUMIGO_FILTER_HTTP_ENDPOINTS_REGEX
for a specific span type:
LUMIGO_FILTER_HTTP_ENDPOINTS_REGEX_SERVER
applies the regular expression search exclusively to server spans. Searching is performed against the following attributes on a span: url.path
and http.target
.LUMIGO_FILTER_HTTP_ENDPOINTS_REGEX_CLIENT
applies the regular expression search exclusively to client spans. Searching is performed against the following attributes on a span: url.full
and http.url
.
For more information check out Filtering http endpoints.
Logging instrumentation
LUMIGO_ENABLE_LOGS
- Default: false
. When set to true
, turns on the logging instrumentations (currently for the Winston and Bunyan loggers) to capture log-records and send them to Lumigo. Emitted logs will also get injected with the active span context, e.g.:
"body": "Hello Winston!",
"attributes": {
"trace_id": "1fce43bfd3fdde3f1a9ea1adc78b521d",
"span_id": "13c05292d3b5f5e8",
"trace_flags": "01"
}
"severityText": "info",
Note that logging support is applicable only when using versions of the logging libraries listed here.
LUMIGO_DEBUG_LOGDUMP
- similar to LUMIGO_DEBUG_SPANDUMP
, only for logs instead of spans. Effective only when LUMIGO_ENABLE_LOGS
is set to true
.
Execution Tags
Execution Tags allow you to dynamically add dimensions to your invocations so that they can be identified, searched for, and filtered in Lumigo.
For example: in multi-tenanted systems, execution tags are often used to mark with the identifiers of the end-users that trigger them for analysis (e.g., Explore view) and alerting purposes.
Creating Execution Tags
In the Lumigo OpenTelemetry Distro for JS, execution tags are represented as span attributes and, specifically, as span attributes with the lumigo.execution_tags.
prefix.
For example, you could add an execution tag as follows:
import { trace } from '@opentelemetry/api';
trace.getActiveSpan()?.setAttribute('lumigo.execution_tags.foo','bar');
const { trace } = require('@opentelemetry/api');
trace.getActiveSpan()?.setAttribute('lumigo.execution_tags.foo','bar');
Notice that, using OpenTelemetry's trace.getActiveSpan()
API, you do not need to keep track of the current span, you can get it at any point of your program execution.
In OpenTelemetry, span attributes can be strings
, numbers
(double precision floating point or signed 64 bit integer), booleans
(a.k.a. "primitive types"), and arrays of one primitive type (e.g., an array of string, and array of numbers or an array of booleans).
In Lumigo, booleans and numbers are transformed to strings.
IMPORTANT: If you use the Span.setAttribute
API multiple times on the same span to set values for the same key multiple values, you may override previous values rather than adding to them:
import { trace } from '@opentelemetry/api';
trace.getActiveSpan()?.setAttribute('lumigo.execution_tags.foo', 'bar');
trace.getActiveSpan()?.setAttribute('lumigo.execution_tags.foo', 'baz');
const { trace } = require('@opentelemetry/api');
trace.getActiveSpan()?.setAttribute('lumigo.execution_tags.foo', 'bar');
trace.getActiveSpan()?.setAttribute('lumigo.execution_tags.foo', 'baz');
In the snippets above, the foo
execution tag will have in Lumigo only the baz
value!
Multiple values for an execution tag are supported as follows:
import { trace } from '@opentelemetry/api';
trace.getActiveSpan()?.setAttribute('lumigo.execution_tags.foo', ['bar', 'baz']);
const { trace } = require('@opentelemetry/api');
trace.getActiveSpan()?.setAttribute('lumigo.execution_tags.foo', ['bar', 'baz']);
The snippets above will produce in Lumigo the foo
tag having both bar
and baz
values.
Another option to set multiple values is setting execution Tags in different spans of an invocation.
Execution Tags in different spans of an invocation
In Lumigo, multiple spans may be merged together into one invocation, which is the entry that you see, for example, in the Explore view.
The invocation will include all execution tags on all its spans, and merge their values:
const { trace } = require('@opentelemetry/api');
trace.getActiveSpan()?.setAttribute('lumigo.execution_tags.foo','bar');
const tracer = tracerProvider.getTracer(__filename)
const nestedSpan = tracer.startSpan('child_span');
nestedSpan.setAttribute('lumigo.execution_tags.foo','baz');
nestedSpan.end();
const tracer = tracerProvider.getTracer(__filename)
trace.getActiveSpan()?.setAttribute('lumigo.execution_tags.foo','bar');
const tracer = tracerProvider.getTracer(__filename)
const nestedSpan = tracer.startSpan('child_span');
nestedSpan.setAttribute('lumigo.execution_tags.foo','baz');
nestedSpan.end();
In the examples above, the invocation in Lumigo resulting from executing the code will have both bar
and baz
values associated with the foo
execution tag.
Which spans are merged in the same invocation depends on the parent-child relations among those spans.
Explaining this topic is outside the scope of this documentation; a good first read to get deeper into the topic is the Traces documentation of OpenTelemetry.
In case your execution tags on different spans appear on different invocations than what you would expect, get in touch with Lumigo support.
Execution Tag Limitations
- Up to 50 execution tag keys per invocation in Lumigo, irrespective of how many spans are part of the invocation or how many values each execution tag has.
- The
key
of an execution tag cannot contain the .
character; for example: lumigo.execution_tags.my.tag
is not a valid tag. The OpenTelemetry Span.setAttribute()
API will not fail or log warnings, but that will be displayed as my
in Lumigo. - Each execution tag key can be at most 50 characters long; the
lumigo.execution_tags.
prefix does not count against the 50 characters limit. - Each execution tag value can be at most 70 characters long.
Programmatic Errors
Programmatic Errors allow you to customize errors, monitor and troubleshoot issues that should not necessarily interfere with the service.
For example, an application tries to remove a user who doesn't exist. These custom errors can be captured by adding just a few lines of additional code to your application.
Programmatic Errors indicating that a non-fatal error occurred, such as an application error. You can log programmatic errors, track custom error issues, and trigger Alerts.
Creating a Programmatic Error
Programmatic errors are created by adding span events with a custom attribute being set with the key name lumigo.type
.
For example, you could add a programmatic error as follows:
import { trace } from '@opentelemetry/api';
trace.getActiveSpan()?.addEvent('<error-message>', {'lumigo.type': '<error-type>'});
const { trace } = require('@opentelemetry/api');
trace.getActiveSpan()?.addEvent('<error-message>', {'lumigo.type': '<error-type>'});
Supported runtimes
- Node.js: 14.x, 16.x, 18.x, 20.x
Supported packages
Instrumentation | Package | Supported Versions | | | |
---|
| | 14 | 16 | 18 | 20 |
client-sqs | @aws-sdk/client-sqs | 3.525.0 | 3.525.0 | 3.525.0 | 3.525.0 |
grpc-js | @grpc/grpc-js | 1.8.0~1.8.20 | 1.8.0~1.8.20 | 1.8.0~1.8.20 | 1.8.0~1.8.20 |
core | @nestjs/core | | 10.3.2 | 10.3.2 | 10.3.2 |
amqplib | amqplib | 0.9.0~0.10.4 | 0.9.0~0.10.4 | 0.9.0~0.10.4 | 0.9.0~0.10.4 |
aws-sdk | aws-sdk | 2.1533.0~2.1651.0 | 2.1533.0~2.1651.0 | 2.1533.0~2.1651.0 | 2.1533.0~2.1651.0 |
bunyan | bunyan | 1.8.15 | 1.8.15 | 1.8.15 | 1.8.15 |
| | 2.0.5 | 2.0.5 | 2.0.5 | 2.0.5 |
express | express | 4.9.0~4.19.2 | 4.9.0~4.19.2 | 4.9.0~4.19.2 | 4.9.0~4.19.2 |
fastify | fastify | 3.3.0~3.29.5 | 3.3.0~3.29.5 | 3.3.0~3.29.5 | 3.3.0~3.29.5 |
| | 4.0.0 | 4.0.0 | 4.0.0 | 4.0.0 |
| | 4.0.1~4.28.1 | 4.0.1~4.28.1 | 4.0.1~4.28.1 | 4.0.1~4.28.1 |
ioredis | ioredis | 4.0.0~4.28.5 | 4.0.0~4.28.5 | 4.0.0~4.28.5 | 4.0.0~4.28.5 |
| | 5.0.0~5.4.1 | 5.0.0~5.4.1 | 5.0.0~5.4.1 | 5.0.0~5.4.1 |
kafkajs | kafkajs | 2.0.0~2.2.4 | 2.0.0~2.2.4 | 2.0.0~2.2.4 | 2.0.0~2.2.4 |
mongodb | mongodb | 4.17.0~4.17.2 | 3.6.6~3.7.3 | 3.6.6~3.7.3 | 3.6.6~3.7.3 |
| | 5.0.0~5.9.2 | 4.0.0~4.17.2 | 4.0.0~4.17.2 | 4.0.0~4.17.2 |
| | | 5.0.0~5.9.2 | 5.0.0~5.9.2 | 5.0.0~5.9.2 |
| | | 6.0.0~6.3.0 | 6.0.0~6.3.0 | 6.0.0~6.3.0 |
pg | pg | 8.11.3~8.12.0 | 8.11.3~8.12.0 | 8.11.3~8.12.0 | 8.11.3~8.12.0 |
prisma | prisma | 4.2.0~4.16.2 | 4.2.0~4.16.2 | 4.2.0~4.16.2 | 4.2.0~4.16.2 |
| | 5.0.0~5.16.1 | 5.0.0~5.16.1 | 5.0.0~5.16.1 | 5.0.0~5.16.1 |
redis | redis | 4.0.0~4.6.8 | 4.0.0~4.6.14 | 4.0.0~4.6.14 | 4.0.0~4.6.14 |
| | 4.6.10~4.6.14 | | | |
winston | winston | 3.13.0 | 3.13.0 | 3.13.0 | 3.13.0 |
Activating your Prisma client's instrumentation
If you're using Prisma and you would like it instrumented, the only thing you will need to do (aside from activating the tracer, of course) is ensure that your schema file's generator client
has the tracing
preview feature enabled prior to generating the client itself.
generator client {
provider = "prisma-client-js"
previewFeatures = ["tracing"]
}
NOTE: There have been reports of a possible bug that interferes with tracing when multiple Prisma clients have been instantiated, see Prisma issue #20779.
Automated dependency reporting
To provide better support and better data-driven product decisions with respect to which packages to support next, the Lumigo OpenTelemetry Distro for JS will report to Lumigo SaaS on startup the packages and their versions used in this application, together with the OpenTelemetry resource data to enable analytics in terms of which platforms use which dependencies.
The data uploaded to Lumigo is a set of key-value pairs with package name and version.
Similar is available through the tracing data sent to Lumigo, except that this aims at covering dependencies for which the Lumigo OpenTelemetry Distro for JS does not have instrumentation (yet?).
Lumigo's only goal for these analytics data is to be able to give you the instrumentations you need without you needing to tell us!
The dependencies data is sent only when a LUMIGO_TRACER_TOKEN
is present in the process environment, and it can be opted out via the LUMIGO_REPORT_DEPENDENCIES=false
environment variable.
Baseline setup
The Lumigo OpenTelemetry Distro will automatically create the following OpenTelemetry constructs provided to a NodeTraceProvider
.
Resources
A Resource
built from the default OpenTelemetry resource with the sdk...
attributes, plus:
- The
lumigo.distro.version
documenting the version of this package
Additional resource attributes depending on the compute platform.
Amazon Elastic Container Service
cloud.provider
with value aws
cloud.platform
with value aws_ecs
container.name
with, as value, the container name as defined in the task definitioncontainer.id
with, as value, the container id as defined by the underpinning Docker runtime
If the Task Metadata endpoint v4 is available (ECS_CONTAINER_METADATA_URI_V4
env var is set), the following resource attributes as specified in the AWS ECS Resource Semantic conventions are also set:
aws.ecs.container.arn
aws.ecs.cluster.arn
aws.ecs.launchtype
aws.ecs.task.arn
aws.ecs.task.family
aws.ecs.task.revision
Kubernetes resource attributes
k8s.pod.uid
with the Pod identifier, supported for both cgroups v1 and v2
Exporters
Process resource attributes
SDK configuration
-
The following SDK environment variables are supported:
OTEL_SPAN_ATTRIBUTE_VALUE_LENGTH_LIMIT
OTEL_ATTRIBUTE_VALUE_LENGTH_LIMIT
** If the OTEL_SPAN_ATTRIBUTE_VALUE_LENGTH_LIMIT
environment variable is not set, the span attribute size limit will be taken from OTEL_ATTRIBUTE_VALUE_LENGTH_LIMIT
environment variable. The default size limit when both are not set is 2048.
Advanced use cases
Waiting for the initialization of the Lumigo OpenTelemetry Distro
The initialization of the Lumigo OpenTelemetry Distro is performed asynchronously to avoid potentially blocking behavior.
Due to the asynchronous nature of this initialization logic, some CLI or batch-like applications that perform their logic on startup without needing to wait on external request responses may find that they are missing some of the trace data, for example the first span that represents the startup of the application.
For scenarios in which each and every span is required, the Lumigo OpenTelemetry Distro provides a Promise
called init
that you can wait on as follows:
Node.js prior to v18
import * as lumigo from '@lumigo/opentelemetry';
lumigo.init
.then(()=>{
})
.catch(err => {
});
Node.js v18+
import * as lumigo from '@lumigo/opentelemetry';
try {
await lumigo.init;
} catch (err) {
}
Access to the TracerProvider
The Lumigo OpenTelemetry Distro provides access to the TracerProvider
it configures (see the Baseline setup section for more information) through the resolution of the init
promise:
import * as lumigo from '@lumigo/opentelemetry';
import { Resource } from '@opentelemetry/resources';
import { BasicTracerProvider } from '@opentelemetry/sdk-trace-base';
const tracerProvider: BasicTracerProvider = await lumigo.init.tracerProvider;
const resource: Resource = tracerProvider.resource;
Ensure spans are flushed to Lumigo before shutdown
For short-running processes, the BatchProcessor
configured by the Lumigo OpenTelemetry Distro may not ensure that the tracing data are sent to Lumigo (see the baseline setup section for more information).
Through the access to the tracerProvider
, however, it is possible to ensure that all spans are flushed to Lumigo as follows:
import * as lumigo from '@lumigo/opentelemetry';
import { Resource } from '@opentelemetry/resources';
import { BasicTracerProvider } from '@opentelemetry/sdk-trace-base';
const tracerProvider: BasicTracerProvider = (await lumigo.init).tracerProvider;
try {
await tracerProvider.forceFlush();
} catch (err) {
console.error(err);
}
Filtering out empty SQS messages
A common pattern in SQS-based applications is to continuously poll an SQS queue for messages,
and to process them as they arrive.
In order not to clutter the Lumigo platform with empty SQS polling messages, the default behavior is to filter them
out from being sent to Lumigo.
You can change this behavior by setting the boolean environment variable LUMIGO_AUTO_FILTER_EMPTY_SQS
to FALSE
.
The possible variations are (case-insensitive):
LUMIGO_AUTO_FILTER_EMPTY_SQS=TRUE
filter out empty SQS polling messagesLUMIGO_AUTO_FILTER_EMPTY_SQS=FALSE
do not filter out empty SQS polling messages- No environment variable set (default): filter out empty SQS polling messages
Filtering http endpoints
You can selectively filter spans based on HTTP server/client endpoints for various components, not limited to web frameworks.
Global filtering
Set the LUMIGO_FILTER_HTTP_ENDPOINTS_REGEX
environment variable to a list of regex strings. Spans with matching server/client endpoints will not be traced.
Specific Filtering
For exclusive server (inbound) or client (outbound) span filtering, use the environment variables:
LUMIGO_FILTER_HTTP_ENDPOINTS_REGEX_SERVER
LUMIGO_FILTER_HTTP_ENDPOINTS_REGEX_CLIENT
Notes:
- the environment variable must be a valid JSON array of strings, so if you want to match endpoint with the hostname
google.com
the environment variable value should be ["google\\.com"]
. - If we are filtering out an HTTP call to an opentelemetry traced component, every subsequent invocation made by that
component won't be traced either.
Examples:
- Filtering out every incoming HTTP request to the
/login
endpoint (will also match requests such as /login?user=foo
, /login/bar
))):
LUMIGO_FILTER_HTTP_ENDPOINTS_REGEX_SERVER=["\\/login"]
- Filtering out every outgoing HTTP request to the
google.com
domain (will also match requests such as google.com/foo
, bar.google.com
):
LUMIGO_FILTER_HTTP_ENDPOINTS_REGEX_CLIENT=["google\\.com"]
'
- Filtering out every outgoing HTTP request to
https://www.google.com
(will also match requests such as https://www.google.com/
, https://www.google.com/foo
)
LUMIGO_FILTER_HTTP_ENDPOINTS_REGEX_CLIENT=["https:\\/\\/www\\.google\\.com"]
- Filtering out every HTTP request (incoming or outgoing) with the word
login
:
LUMIGO_FILTER_HTTP_ENDPOINTS_REGEX=["login"]
Contributing
For guidelines on contributing, please see CONTRIBUTING.md.