$shardedDataDistribution (aggregation)
On this page
Definition
$shardedDataDistribution
New in version 6.0.3.
Returns information on the distribution of data in sharded collections.
Note
This aggregation stage is only available on
mongos
.This aggregation stage must be run on the
admin
database. The user must have theshardedDataDistribution
privilege action.
Syntax
The shardedDataDistribution
stage has the following syntax:
db.aggregate( [ { $shardedDataDistribution: { } } ] )
Output Fields
The $shardedDataDistribution
stage outputs an array of documents
for each sharded collection in the database. These documents contain the
following fields:
Field Name | Data Type | Description |
---|---|---|
| string | Namespace of the sharded collection. |
| array | Shards in the collection with the data distribution information for each shard. |
| integer | Number of orphaned documents in the shard. |
| integer | Number of documents owned by the shard. |
| integer | Size in bytes of documents owned by the shard when uncompressed. |
| integer | Size in bytes of orphaned documents in the shard when uncompressed. |
Starting in MongoDB 8.0, $shardedDataDistribution
only returns
output for a collection's primary shard if the primary shard
has chunks or orphaned documents.
Behavior
After an unclean shutdown of a mongod
using the Wired Tiger storage engine, size and count statistics reported by
$shardedDataDistribution
may be inaccurate.
The amount of drift depends on the number of insert, update, or delete
operations performed between the last checkpoint and the unclean shutdown. Checkpoints
usually occur every 60 seconds. However, mongod
instances running
with non-default --syncdelay
settings may have more or less frequent
checkpoints.
Run validate
on each collection on the mongod
to restore statistics after an unclean shutdown.
After an unclean shutdown:
Examples
Return All Sharded Data Distibution Metrics
To return all sharded data distribution metrics, run the following:
db.aggregate([ { $shardedDataDistribution: { } } ])
Example output:
[ { "ns": "test.names", "shards": [ { "shardName": "shard-1", "numOrphanedDocs": 0, "numOwnedDocuments": 6, "ownedSizeBytes": 366, "orphanedSizeBytes": 0 }, { "shardName": "shard-2", "numOrphanedDocs": 0, "numOwnedDocuments": 6, "ownedSizeBytes": 366, "orphanedSizeBytes": 0 } ] } ]
Return Metrics for a Specific Shard
To return sharded data distribution metrics for a specific shard, run the following:
db.aggregate([ { $shardedDataDistribution: { } }, { $match: { "shards.shardName": "<name of the shard>" } } ])
Return Metrics for a Namespace
To return sharded data distribution data for a namespace, run the following:
db.aggregate([ { $shardedDataDistribution: { } }, { $match: { "ns": "<database>.<collection>" } } ])
Confirm No Orphaned Documents Remain
Starting in MongoDB 6.0.3, you can run an aggregation using the
$shardedDataDistribution
stage to confirm no orphaned
documents remain:
db.aggregate([ { $shardedDataDistribution: { } }, { $match: { "ns": "<database>.<collection>" } } ])
$shardedDataDistribution
has output similar to the following:
[ { "ns": "test.names", "shards": [ { "shardName": "shard-1", "numOrphanedDocs": 0, "numOwnedDocuments": 6, "ownedSizeBytes": 366, "orphanedSizeBytes": 0 }, { "shardName": "shard-2", "numOrphanedDocs": 0, "numOwnedDocuments": 6, "ownedSizeBytes": 366, "orphanedSizeBytes": 0 } ] } ]
Ensure that "numOrphanedDocs"
is 0
for each shard in the
cluster.