db.createCollection()
On this page
Definition
db.createCollection(name, options)
Creates a new collection. For views, see
db.createView()
.Because MongoDB creates a collection implicitly when the collection is first referenced in a command, this method is used primarily for creating new collections that use specific options. For example, you use
db.createCollection()
to create a:New collection that uses document validation.
db.createCollection()
is a wrapper around the database commandcreate
.The
db.createCollection()
method has the following prototype form:db.createCollection( <name>, { capped: <boolean>, timeseries: { // Added in MongoDB 5.0 timeField: <string>, // required for time series collections metaField: <string>, granularity: <string>, bucketMaxSpanSeconds: <number>, // Added in MongoDB 6.3 bucketRoundingSeconds: <number> // Added in MongoDB 6.3 }, expireAfterSeconds: <number>, clusteredIndex: <document>, // Added in MongoDB 5.3 changeStreamPreAndPostImages: <document>, // Added in MongoDB 6.0 size: <number>, max: <number>, storageEngine: <document>, validator: <document>, validationLevel: <string>, validationAction: <string>, indexOptionDefaults: <document>, viewOn: <string>, pipeline: <pipeline>, collation: <document>, writeConcern: <document> } ) The
db.createCollection()
method has the following parameters:ParameterTypeDescriptionname
stringThe name of the collection to create. See Naming Restrictions.options
documentOptional. Configuration options for creating a:
Capped collection
Clustered collection
View
The
options
document contains the following fields:FieldTypeDescriptioncapped
booleanOptional. To create a capped collection, specifytrue
. If you specifytrue
, you must also set a maximum size in thesize
field.timeseries.timeField
stringRequired when creating a time series collection. The name of the field which contains the date in each time series document. Documents in a time series collection must have a valid BSON date as the value for thetimeField
.timeseries.metaField
stringOptional. The name of the field which contains metadata in each time series document. The metadata in the specified field should be data that is used to label a unique series of documents. The metadata should rarely, if ever, change.
The name of the specified field may not be
_id
or the same as thetimeseries.timeField
. The field can be of any type except array.timeseries.granularity
integerOptional. Do not use if setting
bucketRoundingSeconds
andbucketMaxSpanSeconds
. Possible values areseconds
(default),minutes
, andhours
.Set
granularity
to the value that most closely matches the time between consecutive incoming timestamps. This improves performance by optimizing how MongoDB internally stores data in the collection.For more information on granularity and bucket intervals, see Set Granularity for Time Series Data.
timeseries.bucketMaxSpanSeconds
integerOptional. Use with
bucketRoundingSeconds
as an alternative togranularity
. Sets the maximum time between timestamps in the same bucket. Possible values are 1-31536000.New in version 6.3.
timeseries.bucketRoundingSeconds
integerOptional. Use with
bucketMaxSpanSeconds
as an alternative togranularity
. Must be equal tobucketMaxSpanSeconds
.When a document requires a new bucket, MongoDB rounds down the document's timestamp value by this interval to set the minimum time for the bucket.
For example, setting both parameters to
1800
rounds new buckets down to the nearest 30 minutes. If a document with a time of2023-03-27T18:24:35Z
does not fit an existing bucket, MongoDB creates a new bucket with a minimum time of2023-03-27T18:00:00Z
and a maximum time of2023-03-27T18:29:59Z
.New in version 6.3.
expireAfterSeconds
numberOptional. Specifies the seconds after which documents in a time series collection or clustered collection expire. MongoDB deletes expired documents automatically.
For clustered collections, the documents are deleted automatically based on the clustered index key
_id
and the values must be date types. See TTL Indexes.clusteredIndex
documentStarting in MongoDB 5.3, you can create a collection with a clustered index. Collections created with a clustered index are called clustered collections.
clusteredIndex
has the following syntax:clusteredIndex: { key: { <string> }, unique: <boolean>, name: <string> } FieldDescriptionkey
Required. The clustered index key field. Must be set to{ _id: 1 }
. The default value for the_id
field is an automatically generated unique object identifier, but you can set your own clustered index key values.unique
Required. Must be set totrue
. A unique index indicates the collection will not accept inserted or updated documents where the clustered index key value matches an existing value in the index.name
Optional. A name that uniquely identifies the clustered index.New in version 5.3.
changeStreamPreAndPostImages
documentOptional.
Starting in MongoDB 6.0, you can use change stream events to output the version of a document before and after changes (the document pre- and post-images):
The pre-image is the document before it was replaced, updated, or deleted. There is no pre-image for an inserted document.
The post-image is the document after it was inserted, replaced, or updated. There is no post-image for a deleted document.
Enable
changeStreamPreAndPostImages
for a collection usingdb.createCollection()
,create
, orcollMod
.
changeStreamPreAndPostImages
has the following syntax:changeStreamPreAndPostImages: { enabled: <boolean> } enabled
Descriptiontrue
Enables change stream pre- and post-images for a collection.false
Disables change stream pre- and post-images for a collection.For complete examples with the change stream output, see Change Streams with Document Pre- and Post-Images.
For a
db.createCollection()
example on this page, see Create a Collection with Change Stream Pre- and Post-Images for Documents.New in version 6.0.
size
numberOptional. Specify a maximum size in bytes for a capped collection. Once a capped collection reaches its maximum size, MongoDB removes the older documents to make space for the new documents. Thesize
field is required for capped collections and ignored for other collections.max
numberOptional. The maximum number of documents allowed in the capped collection. Thesize
limit takes precedence over this limit. If a capped collection reaches thesize
limit before it reaches the maximum number of documents, MongoDB removes old documents. If you prefer to use themax
limit, ensure that thesize
limit, which is required for a capped collection, is sufficient to contain the maximum number of documents.storageEngine
documentOptional. Available for the WiredTiger storage engine only.
Allows users to specify configuration to the storage engine on a per-collection basis when creating a collection. The value of the
storageEngine
option should take the following form:{ <storage-engine-name>: <options> } Storage engine configuration specified when creating collections are validated and logged to the oplog during replication to support replica sets with members that use different storage engines.
validator
documentOptional. Allows users to specify validation rules or expressions for the collection.
The
validator
option takes a document that specifies the validation rules or expressions. You can specify the expressions using the same operators as the query operators with the exception of$near
,$nearSphere
,$text
, and$where
.To learn how to create a collection with schema validation, see Specify JSON Schema Validation.
validationLevel
stringOptional. Determines how strictly MongoDB applies the validation rules to existing documents during an update.
validationLevel
Description"off"
No validation for inserts or updates."strict"
Default Apply validation rules to all inserts and all updates."moderate"
Apply validation rules to inserts and to updates on existing valid documents. Do not apply rules to updates on existing invalid documents.To see an example that uses
validationLevel
, see Specify Validation Level for Existing Documents.validationAction
stringOptional. Determines whether to
error
on invalid documents or justwarn
about the violations but allow invalid documents to be inserted.Important
Validation of documents only applies to those documents as determined by the
validationLevel
.To see an example that uses
validationAction
, see Choose How to Handle Invalid Documents.indexOptionDefaults
documentOptional. Allows users to specify a default configuration for indexes when creating a collection.
The
indexOptionDefaults
option accepts astorageEngine
document, which should take the following form:{ <storage-engine-name>: <options> } Storage engine configuration specified when creating indexes are validated and logged to the oplog during replication to support replica sets with members that use different storage engines.
viewOn
stringThe name of the source collection or view from which to create a view. For details, seedb.createView()
.pipeline
arrayAn array that consists of the aggregation pipeline stage(s).db.createView()
creates a view by applying the specifiedpipeline
to theviewOn
collection or view. For details, seedb.createView()
.collation
documentSpecifies the default collation for the collection.
Collation allows users to specify language-specific rules for string comparison, such as rules for lettercase and accent marks.
The collation option has the following syntax:
collation: { locale: <string>, caseLevel: <boolean>, caseFirst: <string>, strength: <int>, numericOrdering: <boolean>, alternate: <string>, maxVariable: <string>, backwards: <boolean> } When specifying collation, the
locale
field is mandatory; all other collation fields are optional. For descriptions of the fields, see Collation Document.If you specify a collation at the collection level:
Indexes on that collection will be created with that collation unless the index creation operation explicitly specify a different collation.
Operations on that collection use the collection's default collation unless they explicitly specify a different collation.
You cannot specify multiple collations for an operation. For example, you cannot specify different collations per field, or if performing a find with a sort, you cannot use one collation for the find and another for the sort.
If no collation is specified for the collection or for the operations, MongoDB uses the simple binary comparison used in prior versions for string comparisons.
For a collection, you can only specify the collation during the collection creation. Once set, you cannot modify the collection's default collation.
For an example, see Specify Collation.
writeConcern
documentOptional. A document that expresses the write concern for the operation. Omit to use the default write concern.
When issued on a sharded cluster,
mongos
converts the write concern of thecreate
command and its helperdb.createCollection()
to"majority"
.
Access Control
If the deployment enforces
authentication/authorization,
db.createCollection()
requires the following privileges:
Task | Required Privileges |
---|---|
Create a non-capped collection |
|
Create a capped collection |
|
Create a view |
However, if the user has the |
A user with the readWrite
built in role on the database
has the required privileges to run the listed operations. Either
create a user with the required role
or grant the role to an existing user.
Behavior
Resource Locking
db.createCollection()
obtains an exclusive lock on the
specified collection or view for the duration of the operation. All
subsequent operations on the collection must wait until
db.createCollection()
releases the lock. db.createCollection()
typically holds
this lock for a short time.
Creating a view requires obtaining an additional exclusive lock
on the system.views
collection in the database. This lock blocks
creation or modification of views in the database until the command
completes.
Transactions
Starting in MongoDB 4.4, you can create collections and indexes inside a multi-document transaction if the transaction is not a cross-shard write transaction.
To use db.createCollection()
in a transaction, the transaction must use read
concern "local"
. If you specify a read concern level
other than "local"
, the transaction fails.
Examples
Create a Capped Collection
Capped collections have maximum size or document counts that prevent them from growing beyond maximum thresholds. All capped collections must specify a maximum size and may also specify a maximum document count. MongoDB removes older documents if a collection reaches the maximum size limit before it reaches the maximum document count. Consider the following example:
db.createCollection("log", { capped : true, size : 5242880, max : 5000 } )
This command creates a collection named log
with a maximum size of 5
megabytes and a maximum of 5000 documents.
See Capped Collections for more information about capped collections.
Create a Time Series Collection
To create a time series collection that captures weather data for the past 24 hours, issue this command:
db.createCollection( "weather24h", { timeseries: { timeField: "timestamp", metaField: "data", granularity: "hours" }, expireAfterSeconds: 86400 } )
Alternately, to keep the 24 hour limit but set each bucket to timestamps within the same hour, issue this command:
db.createCollection( "weather24h", { timeseries: { timeField: "timestamp", metaField: "data", bucketMaxSpanSeconds: "3600", bucketRoundingSeconds: "3600" }, expireAfterSeconds: 86400 } )
Create a Clustered Collection
The following db.createCollection()
example adds a
clustered collection named stocks
:
db.createCollection( "stocks", { clusteredIndex: { "key": { _id: 1 }, "unique": true, "name": "stocks clustered key" } } )
In the example, clusteredIndex specifies:
"key": { _id: 1 }
, which sets the clustered index key to the_id
field."unique": true
, which indicates the clustered index key value must be unique."name": "stocks clustered key"
, which sets the clustered index name.
Create a Collection with Change Stream Pre- and Post-Images for Documents
Starting in MongoDB 6.0, you can use change stream events to output the version of a document before and after changes (the document pre- and post-images):
The pre-image is the document before it was replaced, updated, or deleted. There is no pre-image for an inserted document.
The post-image is the document after it was inserted, replaced, or updated. There is no post-image for a deleted document.
Enable
changeStreamPreAndPostImages
for a collection usingdb.createCollection()
,create
, orcollMod
.
The following example creates a collection that has changeStreamPreAndPostImages enabled:
db.createCollection( "temperatureSensor", { changeStreamPreAndPostImages: { enabled: true } } );
Pre- and post-images are not available for a change stream event if the images were:
Not enabled on the collection at the time of a document update or delete operation.
Removed after the pre- and post-image retention time set in
expireAfterSeconds
.The following example sets
expireAfterSeconds
to100
seconds:use admin db.runCommand( { setClusterParameter: { changeStreamOptions: { preAndPostImages: { expireAfterSeconds: 100 } } } } ) The following example returns the current
changeStreamOptions
settings, includingexpireAfterSeconds
:db.adminCommand( { getClusterParameter: "changeStreamOptions" } ) Setting
expireAfterSeconds
tooff
uses the default retention policy: pre- and post-images are retained until the corresponding change stream events are removed from the oplog.If a change stream event is removed from the oplog, then the corresponding pre- and post-images are also deleted regardless of the
expireAfterSeconds
pre- and post-image retention time.
Additional considerations:
Enabling pre- and post-images consumes storage space and adds processing time. Only enable pre- and post-images if you need them.
Limit the change stream event size to less than 16 megabytes. To limit the event size, you can:
Limit the document size to 8 megabytes. You can request pre- and post-images simultaneously in the change stream output if other change stream event fields like
updateDescription
are not large.Request only post-images in the change stream output for documents up to 16 megabytes if other change stream event fields like
updateDescription
are not large.Request only pre-images in the change stream output for documents up to 16 megabytes if:
document updates affect only a small fraction of the document structure or content, and
do not cause a
replace
change event. Areplace
event always includes the post-image.
To request a pre-image, you set
fullDocumentBeforeChange
torequired
orwhenAvailable
indb.collection.watch()
. To request a post-image, you setfullDocument
using the same method.Pre-images are written to the
config.system.preimages
collection.The
config.system.preimages
collection may become large. To limit the collection size, you can setexpireAfterSeconds
time for the pre-images as shown earlier.Pre-images are removed asynchronously by a background process.
Important
Backward-Incompatible Feature
Starting in MongoDB 6.0, if you are using document pre- and post-images
for change streams, you must disable
changeStreamPreAndPostImages for each collection using
the collMod
command before you can downgrade to an earlier
MongoDB version.
Tip
See also:
For change stream events and output, see Change Events.
To watch a collection for changes, see
db.collection.watch()
.For complete examples with the change stream output, see Change Streams with Document Pre- and Post-Images.
Specify Collation
Collation allows users to specify language-specific rules for string comparison, such as rules for lettercase and accent marks.
You can specify collation at the collection or view level. For example, the following operation creates a collection, specifying a collation for the collection (See Collation Document for descriptions of the collation fields):
db.createCollection( "myColl", { collation: { locale: "fr" } } );
This collation will be used by indexes and operations that support
collation unless they explicitly specify a different collation. For
example, insert the following documents into myColl
:
{ _id: 1, category: "café" } { _id: 2, category: "cafe" } { _id: 3, category: "cafE" }
The following operation uses the collection's collation:
db.myColl.find().sort( { category: 1 } )
The operation returns documents in the following order:
{ "_id" : 2, "category" : "cafe" } { "_id" : 3, "category" : "cafE" } { "_id" : 1, "category" : "café" }
The same operation on a collection that uses simple binary collation (i.e. no specific collation set) returns documents in the following order:
{ "_id" : 3, "category" : "cafE" } { "_id" : 2, "category" : "cafe" } { "_id" : 1, "category" : "café" }
Specify Storage Engine Options
You can specify collection-specific storage engine configuration
options when you create a collection with
db.createCollection()
. Consider the following operation:
db.createCollection( "users", { storageEngine: { wiredTiger: { configString: "<option>=<setting>" } } } )
This operation creates a new collection named users
with a
specific configuration string that MongoDB will pass to the
wiredTiger
storage engine. See the WiredTiger documentation of
collection level options
for specific wiredTiger
options.