https://github.com/fluent-plugins-nursery/fluent-plugin-kubernetes_metadata_filter
Enrich your fluentd events with Kubernetes metadata
https://github.com/fluent-plugins-nursery/fluent-plugin-kubernetes_metadata_filter
Keywords from Contributors
cncf data-collector fluentd rubygems crash-reporting marshalling code-formatter rubocop static-code-analysis feature-flag
Last synced: about 9 hours ago
JSON representation
Repository metadata
Enrich your fluentd events with Kubernetes metadata
- Host: GitHub
- URL: https://github.com/fluent-plugins-nursery/fluent-plugin-kubernetes_metadata_filter
- Owner: fluent-plugins-nursery
- License: apache-2.0
- Created: 2015-05-08T11:56:38.000Z (over 10 years ago)
- Default Branch: master
- Last Pushed: 2025-09-17T19:30:44.000Z (4 months ago)
- Last Synced: 2025-12-20T22:06:59.146Z (20 days ago)
- Language: Ruby
- Homepage:
- Size: 575 KB
- Stars: 357
- Watchers: 21
- Forks: 167
- Open Issues: 3
- Releases: 6
-
Metadata Files:
- Readme: README.md
- License: LICENSE.txt
README.md
fluent-plugin-kubernetes_metadata_filter, a plugin for Fluentd
The Kubernetes metadata plugin filter enriches container log records with pod and namespace metadata.
This plugin derives basic metadata about the container that emitted a given log record using the source of the log record. Records from kubernetes containers encode metadata about the container in the file name. The initial metadata derived from the source is used
to lookup additional metadata about the container's associated pod and namespace (e.g. UUIDs, labels, annotations) when the kubernetes_url is configured. If the plugin cannot
authoritatively determine the namespace of the container emitting a log record, it will use an 'orphan' namespace ID in the metadata. This behaviors supports multi-tenant systems
that rely on the authenticity of the namespace for proper log isolation.
Requirements
| fluent-plugin-kubernetes_metadata_filter | fluentd | ruby |
|---|---|---|
| >= 2.10.0 | >= v1.10.0 | >= 2.6 |
| >= 2.5.0 | >= v1.10.0 | >= 2.5 |
| >= 2.0.0 | >= v0.14.20 | >= 2.1 |
| < 2.0.0 | >= v0.12.0 | >= 1.9 |
NOTE: For v0.12 version, you should use 1.x.y version. Please send patch into v0.12 branch if you encountered 1.x version's bug.
NOTE: This documentation is for fluent-plugin-kubernetes_metadata_filter-plugin-elasticsearch 2.x or later. For 1.x documentation, please see v0.12 branch.
Installation
gem install fluent-plugin-kubernetes_metadata_filter
Configuration
Configuration options for fluent.conf are:
kubernetes_url- URL to the API server. Set this to retrieve further kubernetes metadata for logs from kubernetes API server. If not specified, environment variablesKUBERNETES_SERVICE_HOSTandKUBERNETES_SERVICE_PORTwill be used if both are present which is typically true when running fluentd in a pod.apiVersion- API version to use (default:v1)ca_file- path to CA file for Kubernetes server certificate validationverify_ssl- validate SSL certificates (default:true)client_cert- path to a client cert file to authenticate to the API serverclient_key- path to a client key file to authenticate to the API serverbearer_token_file- path to a file containing the bearer token to use for authenticationtag_to_kubernetes_name_regexp- the regular expression used to extract kubernetes metadata (pod name, container name, namespace) from the current fluentd tag.
This must use named capture groups forcontainer_name,pod_name,namespace, and eitherpod_uuid (/var/log/pods)ordocker_id (/var/log/containers)cache_size- size of the cache of Kubernetes metadata to reduce requests to the API server (default:1000)cache_ttl- TTL in seconds of each cached element. Set to negative value to disable TTL eviction (default:3600- 1 hour)ignore_nil- ignore caching if the value is null (default:true)watch- set up a watch on pods on the API server for updates to metadata (default:true)annotation_match- Array of regular expressions matching annotation field names. Matched annotations are added to a log record.allow_orphans- Modify the namespace and namespace id to the values oforphaned_namespace_nameandorphaned_namespace_id
when true (default:true)orphaned_namespace_name- The namespace to associate with records where the namespace can not be determined (default:.orphaned)orphaned_namespace_id- The namespace id to associate with records where the namespace can not be determined (default:orphaned)lookup_from_k8s_field- If the fieldkubernetesis present, lookup the metadata from the given subfields such askubernetes.namespace_name,kubernetes.pod_name, etc. This allows you to avoid having to pass in metadata to lookup in an explicitly formatted tag name or in an explicitly formattedCONTAINER_NAMEvalue. For example, setkubernetes.namespace_name,kubernetes.pod_name,kubernetes.container_name, anddocker.container_idin the record, and the filter will fill in the rest. (default:true)ssl_partial_chain- ifca_fileis for an intermediate CA, or otherwise we do not have the root CA and want
to trust the intermediate CA certs we do have, set this totrue- this corresponds to
theopenssl s_client -partial_chainflag andX509_V_FLAG_PARTIAL_CHAIN(default:false)skip_labels- Skip all label fields from the metadata.skip_pod_labels- Skip only pod label fields from the metadata.skip_namespace_labels- Skip only namespace label fields from the metadata.skip_container_metadata- Skip some of the container data of the metadata. The metadata will not contain the container_image and container_image_id fields.skip_master_url- Skip the master_url field from the metadata.skip_namespace_metadata- Skip the namespace_id field from the metadata. The fetch_namespace_metadata function will be skipped. The plugin will be faster and cpu consumption will be less.stats_interval- The interval to display cache stats (default: 30s). Set to 0 to disable stats collection and loggingwatch_retry_interval- The time interval in seconds for retry backoffs when watch connections fail. (default:10)open_timeout- The time in seconds to wait for a connection to kubernetes service. (default:3)read_timeout- The time in seconds to wait for a read from kubernetes service. (default:10)include_ownerrefs_metadata- If set to true, it will include metadata (kind&name) inkubernetes.ownerrefsabout the controller that owns the pod. (default:false)
Reading from a JSON formatted log files with in_tail and wildcard filenames while respecting the CRI-o log format with the same config you need the fluent-plugin "multi-format-parser":
fluent-gem install fluent-plugin-multi-format-parser
The config block could look like this:
<source>
@type tail
path /var/log/containers/*.log
pos_file fluentd-docker.pos
read_from_head true
tag kubernetes.*
<parse>
@type multi_format
<pattern>
format json
time_key time
time_type string
time_format "%Y-%m-%dT%H:%M:%S.%NZ"
keep_time_key false
</pattern>
<pattern>
format regexp
expression /^(?<time>.+) (?<stream>stdout|stderr)( (?<logtag>.))? (?<log>.*)$/
time_format '%Y-%m-%dT%H:%M:%S.%N%:z'
keep_time_key false
</pattern>
</parse>
</source>
<filter kubernetes.var.log.containers.**.log>
@type kubernetes_metadata
</filter>
<match **>
@type stdout
</match>
Environment variables for Kubernetes
If the name of the Kubernetes node the plugin is running on is set as
an environment variable with the name K8S_NODE_NAME, it will reduce cache
misses and needless calls to the Kubernetes API.
In the Kubernetes container definition, this is easily accomplished by:
env:
- name: K8S_NODE_NAME
valueFrom:
fieldRef:
fieldPath: spec.nodeName
Example input/output
Kubernetes creates symlinks to Docker log files in /var/log/containers/*.log. Docker logs in JSON format.
Assuming following inputs are coming from a log file named /var/log/containers/fabric8-console-controller-98rqc_default_fabric8-console-container-df14e0d5ae4c07284fa636d739c8fc2e6b52bc344658de7d3f08c36a2e804115.log:
{
"log": "2015/05/05 19:54:41 \n",
"stream": "stderr",
"time": "2015-05-05T19:54:41.240447294Z"
}
Then output becomes as belows
{
"log": "2015/05/05 19:54:41 \n",
"stream": "stderr",
"docker": {
"container_id": "df14e0d5ae4c07284fa636d739c8fc2e6b52bc344658de7d3f08c36a2e804115",
}
"kubernetes": {
"host": "jimmi-redhat.localnet",
"pod_name":"fabric8-console-controller-98rqc",
"pod_id": "c76927af-f563-11e4-b32d-54ee7527188d",
"pod_ip": "172.17.0.8",
"container_name": "fabric8-console-container",
"namespace_name": "default",
"namespace_id": "23437884-8e08-4d95-850b-e94378c9b2fd",
"namespace_annotations": {
"fabric8.io/git-commit": "5e1116f63df0bac2a80bdae2ebdc563577bbdf3c"
},
"namespace_labels": {
"product_version": "v1.0.0"
},
"labels": {
"component": "fabric8Console"
}
}
}
Contributing
- Fork it
- Create your feature branch (
git checkout -b my-new-feature) - Commit your changes (
git commit -am 'Add some feature') - Test it (
GEM_HOME=vendor bundle install; GEM_HOME=vendor bundle exec rake test) - Push to the branch (
git push origin my-new-feature) - Create new Pull Request
Copyright
Copyright (c) 2015 jimmidyson
Owner metadata
- Name: fluent-plugins-nursery
- Login: fluent-plugins-nursery
- Email:
- Kind: organization
- Description: Collaborate to maintain Fluentd plugins.
- Website:
- Location:
- Twitter:
- Company:
- Icon url: https://avatars.githubusercontent.com/u/21994554?v=4
- Repositories: 42
- Last ynced at: 2024-03-26T09:15:26.710Z
- Profile URL: https://github.com/fluent-plugins-nursery
GitHub Events
Total
- Issues event: 1
- Delete event: 1
- Issue comment event: 7
- Push event: 3
- Pull request event: 4
- Pull request review event: 2
- Fork event: 2
- Create event: 3
Last Year
- Issues event: 1
- Delete event: 1
- Issue comment event: 7
- Push event: 3
- Pull request event: 4
- Pull request review event: 2
- Fork event: 2
- Create event: 3
Committers metadata
Last synced: about 9 hours ago
Total Commits: 281
Total Committers: 61
Avg Commits per committer: 4.607
Development Distribution Score (DDS): 0.701
Commits in past year: 23
Committers in past year: 3
Avg Commits per committer in past year: 7.667
Development Distribution Score (DDS) in past year: 0.217
| Name | Commits | |
|---|---|---|
| Jeff Cantrill | j****l | 84 |
| Jimmi Dyson | j****n@g****m | 73 |
| Masahiro | w****e@c****m | 18 |
| Rich Megginson | r****s@r****m | 14 |
| Michael Grosser | m****l@g****t | 8 |
| Mike Bryant | m@o****m | 7 |
| Ling Huang | q****8 | 6 |
| simicza | s****a@n****m | 5 |
| Kentaro Hayashi | h****i@c****m | 4 |
| dependabot[bot] | 4****] | 4 |
| Alex Robinson | a****b@g****m | 3 |
| ewolinetz | e****t@r****m | 2 |
| Neal Turett | t****n@g****m | 2 |
| Jared Burns | G****l | 2 |
| Rudi Chiarito | r****i@c****m | 2 |
| Bart Van Bos | b****s@k****e | 2 |
| Gabi Davar | g****o@g****m | 1 |
| Francisco Orselli | 4****o | 1 |
| Chris Knowles | c****s | 1 |
| Arcadiy Ivanov | a****y@i****z | 1 |
| André Bauer | m****k | 1 |
| Andrzej Stencel | a****l@s****m | 1 |
| Alvaro [Andor] | a****r@p****m | 1 |
| Alexej Tessaro | r****r | 1 |
| Aaron U'Ren | a****n | 1 |
| AMO ❤️ ☕ | C****a | 1 |
| Frank Reno | f****o@s****m | 1 |
| Jamie Lennox | j****e@v****u | 1 |
| Jan Wozniak | j****k@r****m | 1 |
| Jesse Olsen | j****n@h****m | 1 |
| and 31 more... | ||
Committer domains:
- redhat.com: 4
- sumologic.com: 2
- cadenza-tech.com: 1
- grosser.it: 1
- ocado.com: 1
- nokia.com: 1
- clear-code.com: 1
- google.com: 1
- clarifai.com: 1
- kbc.be: 1
- ivanov.biz: 1
- pierdelacabeza.com: 1
- vibrato.com.au: 1
- hpe.com: 1
- derjohn.de: 1
- amazon.com: 1
- philo.com: 1
- sl.id.au: 1
- csgo.com: 1
- asx.hu: 1
- nervd.com: 1
- shagabutdinov.com: 1
- daocloud.io: 1
- triplehelix.org: 1
- mynameiswhm.ru: 1
- deis.com: 1
Issue and Pull Request metadata
Last synced: 26 days ago
Total issues: 1
Total pull requests: 4
Average time to close issues: N/A
Average time to close pull requests: 7 days
Total issue authors: 1
Total pull request authors: 2
Average comments per issue: 0.0
Average comments per pull request: 3.25
Merged pull request: 2
Bot issues: 0
Bot pull requests: 2
Past year issues: 1
Past year pull requests: 4
Past year average time to close issues: N/A
Past year average time to close pull requests: 7 days
Past year issue authors: 1
Past year pull request authors: 2
Past year average comments per issue: 0.0
Past year average comments per pull request: 3.25
Past year merged pull request: 2
Past year bot issues: 0
Past year bot pull requests: 2
Top Issue Authors
- kenhys (1)
Top Pull Request Authors
- kenhys (2)
- dependabot[bot] (2)
Top Issue Labels
Top Pull Request Labels
- dependencies (2)
- ruby (2)
Package metadata
- Total packages: 2
-
Total downloads:
- rubygems: 287,170,084 total
- Total docker downloads: 1,989,108,066
- Total dependent packages: 3 (may contain duplicates)
- Total dependent repositories: 515 (may contain duplicates)
- Total versions: 200
- Total maintainers: 3
gem.coop: fluent-plugin-kubernetes_metadata_filter
Filter plugin to add Kubernetes metadata
- Homepage: https://github.com/fluent-plugins-nursery/fluent-plugin-kubernetes_metadata_filter
- Documentation: http://www.rubydoc.info/gems/fluent-plugin-kubernetes_metadata_filter/
- Licenses: Apache-2.0
- Latest release: 3.8.0 (published 5 months ago)
- Last Synced: 2026-01-09T08:10:06.863Z (about 24 hours ago)
- Versions: 100
- Dependent Packages: 0
- Dependent Repositories: 0
- Downloads: 143,597,932 Total
- Docker Downloads: 994,554,033
-
Rankings:
- Dependent repos count: 0.0%
- Dependent packages count: 0.0%
- Average: 0.065%
- Docker downloads count: 0.088%
- Downloads: 0.171%
- Maintainers (3)
rubygems.org: fluent-plugin-kubernetes_metadata_filter
Filter plugin to add Kubernetes metadata
- Homepage: https://github.com/fluent-plugins-nursery/fluent-plugin-kubernetes_metadata_filter
- Documentation: http://www.rubydoc.info/gems/fluent-plugin-kubernetes_metadata_filter/
- Licenses: Apache-2.0
- Latest release: 3.8.0 (published 5 months ago)
- Last Synced: 2026-01-08T17:29:19.931Z (1 day ago)
- Versions: 100
- Dependent Packages: 3
- Dependent Repositories: 515
- Downloads: 143,572,152 Total
- Docker Downloads: 994,554,033
-
Rankings:
- Docker downloads count: 0.12%
- Downloads: 0.183%
- Dependent repos count: 1.521%
- Average: 1.82%
- Forks count: 2.034%
- Stargazers count: 3.04%
- Dependent packages count: 4.021%
- Maintainers (3)
Dependencies
- codeclimate-test-reporter < 1.0.0 development
- rubocop >= 0
- addressable 2.8.0
- ast 2.4.2
- bump 0.10.0
- charlock_holmes 0.7.7
- codeclimate-test-reporter 0.6.0
- concurrent-ruby 1.1.10
- cool.io 1.7.1
- copyright-header 1.0.22
- crack 0.4.5
- docile 1.4.0
- domain_name 0.5.20190701
- escape_utils 1.2.2
- ffi 1.15.5
- ffi-compiler 1.0.1
- fluent-plugin-kubernetes_metadata_filter 3.1.3
- fluentd 1.15.3
- github-linguist 7.21.0
- hashdiff 1.0.1
- http 4.4.1
- http-accept 1.7.0
- http-cookie 1.0.5
- http-form_data 2.3.0
- http-parser 1.2.3
- http_parser.rb 0.8.0
- jsonpath 1.1.2
- kubeclient 4.10.1
- lru_redux 1.1.0
- mime-types 3.4.1
- mime-types-data 3.2022.0105
- mini_mime 1.1.2
- minitest 4.7.5
- msgpack 1.6.0
- multi_json 1.15.0
- netrc 0.11.0
- parallel 1.22.1
- parser 3.1.2.0
- power_assert 2.0.2
- public_suffix 4.0.7
- rainbow 3.1.1
- rake 13.0.6
- recursive-open-struct 1.1.3
- regexp_parser 2.5.0
- rest-client 2.1.0
- rexml 3.2.5
- rr 3.0.9
- rubocop 1.28.2
- rubocop-ast 1.17.0
- ruby-progressbar 1.11.0
- rugged 1.4.3
- serverengine 2.3.0
- sigdump 0.2.4
- simplecov 0.21.2
- simplecov-html 0.12.3
- simplecov_json_formatter 0.1.4
- strptime 0.2.5
- test-unit 3.5.5
- test-unit-rr 1.0.5
- tzinfo 2.0.5
- tzinfo-data 1.2022.6
- unf 0.1.4
- unf_ext 0.0.8.2
- unicode-display_width 2.2.0
- vcr 6.0.0
- webmock 3.14.0
- webrick 1.7.0
- yajl-ruby 1.4.3
- bump >= 0 development
- bundler ~> 2.0 development
- copyright-header >= 0 development
- minitest ~> 4.0 development
- rake >= 0 development
- test-unit ~> 3.5.5 development
- test-unit-rr ~> 1.0.3 development
- vcr >= 0 development
- webmock >= 0 development
- yajl-ruby >= 0 development
- fluentd >= 0.14.0, < 1.16
- kubeclient >= 4.0.0, < 5.0.0
- lru_redux >= 0
Score: 31.542785841055608