For GitHub repositories, you can run Datadog Static Code Analysis scans directly on Datadog’s infrastructure. To get started, navigate to the Code Security page.
Scan in CI pipelines
Datadog Static Code Analysis runs in your CI pipelines using the datadog-ci CLI.
First, configure your Datadog API and application keys. Add DD_APP_KEY and DD_API_KEY as secrets. Please ensure your Datadog application key has the code_analysis_read scope.
Next, run Static Code Analysis by following instructions for your chosen CI provider below.
When installing a GitHub App, the following permissions are required to enable certain features:
Content: Read, which allows you to see code snippets displayed in Datadog
Pull Request: Read & Write, which allows Datadog to add feedback for violations directly in your pull requests using pull request comments, as well as open pull requests to fix vulnerabilities
Other source code management providers
If you are using another source code management provider, configure Static Code Analysis to run in your CI pipelines using the datadog-ci CLI tool and upload the results to Datadog.
You must run an analysis of your repository on the default branch before results can begin appearing on the Code Security page.
Customize your configuration
By default, Datadog Static Code Analysis scans your repositories with Datadog’s default rulesets for your programming language(s). You can customize which rulesets or rules to run or ignore, in addition to other parameters. You can customize these settings locally in your repository or within the Datadog App.
Configuration locations
Datadog Static Code Analysis can be configured within Datadog and/or by using a file within your repository’s root directory.
There are three levels of configuration:
Org Level Configuration (Datadog)
Repo Level Configuration (Datadog)
Repo Level Configuration (Repo File)
All three locations use the same YAML format for configuration. These configurations are merged in order using an overlay/patch merge method. For example, lets look at these two sample YAML files:
rulesets:- Arules:foo:ignore:["my_ignored_file.file"]bar:only:["the_only_file.file"]- B
If these YAML files were merged in order, first file with the second, the merge of these YAML files with a overlay/patch method would be the following:
rulesets:- Arules:foo:ignore:["my_ignored_file.file"]args:["my_arg1","my_arg2"]bar:only:["the_only_file.file"]- B
As you can see, the ignore: ["**"] from the first file was overlayed with the ignore: ["my_ignored_file.file"]. This happened because there was a conflict and the second file’s value took precedence due to merge order. The args field from the first file is retained because there is no conflicting value in the second file.
Org level configuration
Configurations at the org level apply to all repositories that are being analyzed and is a good place to define rules that must run or global paths/files to be ignored.
Repository level configuration
Configurations at the repository level apply only to the repository selected. These configurations are merged with the org configuration, with the repository configuration taking precedence. Repository level configurations are a good place to define overrides for repository specific details, or add rules that are specific to only that repo for example.
Repository level configuration (file)
In addition to the configurations provided for the Org and Repository level, you can also define a configuration at the root of your repo in the form of static-analysis.datadog.yml. This file takes precedence over the Repository level configuration defined in Datadog. Repository level file configurations are a useful method to change rule configs and iterate on setup and testing.
Configuration format
The following configuration format applies to all configuration locations: Org level, Repository level, and Repository level (file).
The full structure of a configuration is as follows:
rulesets:- ruleset-name# A ruleset we want to run with default configurations- ruleset-name:# Only apply this ruleset to the following paths/filesonly:- "path/example"- "**/*.file"# Do not apply this ruleset in the following paths/filesignore:- "path/example"- "**/*.file"- ruleset-name:rules:rule-name:# Only apply this rule to the following paths/filesonly:- "path/example"- "**/*.file"# Do not apply this rule to the following paths/filesignore:- "path/example"- "**/*.file"arguments:# Set the rule's argument to value.argument-name:valuerule-name:arguments:# Set different argument values in different subtreesargument-name:# Set the rule's argument to value_1 by default (root path of the repo)/:value_1# Set the rule's argument to value_2 for specific pathspath/example:value_2# Only analyze any ruleset in the following paths/filesonly:- "path/example"- "**/*.file"# Do not analyze any ruleset in the following paths/filesignore:- "path/example"- "**/*.file"
The YAML configuration file supports the following top-level keys:
Property
Type
Description
Default
rulesets
Array
A list of rulesets to analyze. Each element can be either a ruleset name (string) or an object with detailed configuration.
Required
only
Array
A list of file paths or glob patterns. If provided, only matching files are analyzed across all rulesets.
None
ignore
Array
A list of file paths or glob patterns to exclude from analysis across all rulesets.
None
Note: The only and ignore keys here act as file filters that apply to the entire configuration file.
Ruleset configuration
Each entry in the rulesets array can be defined in one of two ways:
Simple Ruleset Declaration: A plain string (for example, ruleset-name) indicates that the ruleset should run with its default settings.
Detailed Ruleset Object: An object where the key is the ruleset name and the value is an object containing additional configuration. The available properties for a detailed ruleset are:
Property
Type
Description
Default
only
Array
File paths or glob patterns. Only files matching these patterns will be processed for this ruleset.
None
ignore
Array
File paths or glob patterns to exclude from analysis for this ruleset.
None
rules
Object
A mapping of individual rule names to their configuration objects.
None
Rule configuration
Within a ruleset’s rules property, each rule is defined by its name and configuration. The properties available for each rule are:
Property
Type
Description
Default
only
Array
File paths or glob patterns. The rule will only be applied to files matching these patterns.
None
ignore
Array
File paths or glob patterns to exclude from the rule’s application.
None
arguments
Object
Parameters and values for the rule. Values can be scalars or specified on a per-path basis.
None
Argument configuration
Rule arguments can be defined in one of two formats:
Static Value: Directly assign a value to an argument.
arguments:argument-name:value
Path-Specific Mapping:
Define different values based on file paths. Use the special key / to denote the default value (applicable at the repository root).
The default argument value when no specific path is matched.
None
specific path
Any
The argument value for files matching the specified path or glob pattern.
None
Example configuration:
rulesets:- python-best-practices- python-security- python-code-style:rules:max-function-lines:# Do not apply the rule max-function-lines to the following filesignore:- "src/main/util/process.py"- "src/main/util/datetime.py"arguments:# Set the max-function-lines rule's threshold to 150 linesmax-lines:150max-class-lines:arguments:# Set different thresholds for the max-class-lines rule in different subtreesmax-lines:# Set the rule's threshold to 200 lines by default (root path of the repo)/:200# Set the rule's threshold to 100 lines in src/main/backendsrc/main/backend:100- python-inclusive- python-django:# Only apply the python-django ruleset to the following pathsonly:- "src/main/backend"- "src/main/django"# Do not apply the python-django ruleset in files matching the following patternignore:- "src/main/backend/util/*.py"# Only analyze source filesonly:- "src/main"- "src/tests"- "**/*.py"# Do not analyze third-party or generated filesignore:- "lib/third_party"- "**/*.generated.py"- "**/*.pb.py"
A list of path prefixes and glob patterns to ignore. Matching files will not be analyzed.
false
only
A list of path prefixes and glob patterns to analyze. Only matching files will be analyzed.
false
ignore-gitignore
Do not use paths listed in the .gitignore file to skip analysis on certain files.
false
false
max-file-size-kb
Ignore files larger than the specified size (in kB units).
false
200
You can include the following ruleset options in the static-analysis.datadog.yml file:
Name
Description
Required
rules
A list of rule configurations for rules belonging to ruleset.
false
ignore
A list of path prefixes and glob patterns to ignore for this specific ruleset. Matching files will not be analyzed.
false
only
A list of path prefixes and glob patterns to analyze for this specific ruleset. Only matching files will be analyzed.
false
You can include the following rule options in the static-analysis.datadog.yml file:
Name
Description
Required
ignore
A list of path prefixes and glob patterns to ignore for this specific rule. Matching files will not be analyzed.
false
only
A list of path prefixes and glob patterns to analyze for this specific rule. Only matching files will be analyzed.
false
arguments
A map of values for rules that support customizable arguments.
false
The map in the arguments field uses an argument’s name as its key, and the values are either strings or maps:
To set a value for the whole repository, you can specify it as a string.
To set different values for different subtrees in the repository, you can specify them as a map from a subtree prefix to the value that the argument will have within that subtree.
Ignoring violations
Ignore for a repository
Add an ignore rule in your static-analysis.datadog.yml file. The example below ignores the rule javascript-express/reduce-server-fingerprinting for all directories.
Add an ignore rule in your static-analysis.datadog.yml file. The example below ignores the rule javascript-express/reduce-server-fingerprinting for this file. For more information on how to ignore by path, see the Customize your configuration section.
To ignore a specific instance of a violation, comment no-dd-sa above the line of code to ignore. This prevents that line from ever producing a violation. For example, in the following Python code snippet, the line foo = 1 would be ignored by Static Code Analysis scans.
#no-dd-safoo=1bar=2
You can also use no-dd-sa to only ignore a particular rule rather than ignoring all rules. To do so, specify the name of the rule you wish to ignore in place of <rule-name> using this template:
no-dd-sa:<rule-name>
For example, in the following JavaScript code snippet, the line my_foo = 1 is analyzed by all rules except for the javascript-code-style/assignment-name rule, which tells the developer to use camelCase instead of snake_case.
If one method succeeds, no further mapping attempts are made. Each mapping method is detailed below.
Identifying the code location in the Software Catalog
The schema version v3 and later of the Software Catalog allows you to add the mapping of your code location for your service. The codeLocations section specifies the location of the repository containing the code and its associated paths.
The paths attribute is a list of globs that should match paths in the repository.
Datadog detects file usage in additional products such as Error Tracking and associate
files with the runtime service. For example, if a service called foo has
a log entry or a stack trace containing a file with a path /modules/foo/bar.py,
it associates files /modules/foo/bar.py to service foo.
Detecting service name in paths and repository names
Datadog detects service names in paths and repository names, and associates the file with the service if a match is found.
For a repository match, if there is a service called myservice and
the repository URL is https://github.com/myorganization/myservice.git, then,
it associates myservice to all files in the repository.
If no repository match is found, Datadog attempts to find a match in the
path of the file. If there is a service named myservice, and the path is /path/to/myservice/foo.py, the file is associated with myservice because the service name is part of the path. If two services are present
in the path, the service name closest to the filename is selected.
Link results to teams
Datadog automatically associates the team attached to a service when a violation or vulnerability is detected. For example, if the file domains/ecommerce/apps/myservice/foo.py
is associated with myservice, then the team myservice will be associated to any violation
detected in this file.
If no services or teams are found, Datadog uses the CODEOWNERS file in your repository. The CODEOWNERS file determines which team owns a file in your Git provider.
Note: You must accurately map your Git provider teams to your Datadog teams for this feature to function properly.
Diff-aware scanning
Diff-aware scanning enables Datadog’s static analyzer to only scan the files modified by a commit in a feature branch. It accelerates scan time significantly by not having the analysis run on every file in the repository for every scan. To enable diff-aware scanning in your CI pipeline, follow these steps:
Make sure your DD_APP_KEY, DD_SITE and DD_API_KEY variables are set in your CI pipeline.
Add a call to datadog-ci git-metadata upload before invoking the static analyzer. This command ensures that Git metadata is available to the Datadog backend. Git metadata is required to calculate the number of files to analyze.
Ensure that the datadog-static-analyzer is invoked with the flag --diff-aware.
Example of commands sequence (these commands must be invoked in your Git repository):
Note: When a diff-aware scan cannot be completed, the entire directory is scanned.
Upload third-party static analysis results to Datadog
SARIF importing has been tested for Snyk, CodeQL, Semgrep, Checkov, Gitleaks, and Sysdig. Reach out to Datadog Support if you experience any issues with other SARIF-compliant tools.