$topN（集計アキュムレータ）

項目一覧

定義
構文
動作
制限事項
例

定義

$topN: バージョン 5.2 で追加。
グループ内の上位n要素の集計を、指定のソート順序に従って返します。グループに含まれる要素がn未満の場合、 $topNはグループ内のすべての要素を返します。

構文

{
   $topN:
      {
         n: <expression>,
         sortBy: { <field1>: <sort order>, <field2>: <sort order> ... },
         output: <expression>
      }
}

nはグループごとの結果数を制限し、定数であるか、$group の_id値に依存する正の整数式である必要があります
sortBy は、 $sortに似た構文で結果の順序を指定します。
outputはグループ内の各要素の出力を表し、任意の式にすることができます。

動作

NULL および欠損値

$topNは null 値を除外しません。
$topNは、出力に保持される欠落値を null に変換します。

db.aggregate( [
   {
      $documents: [
         { playerId: "PlayerA", gameId: "G1", score: 1 },
         { playerId: "PlayerB", gameId: "G1", score: 2 },
         { playerId: "PlayerC", gameId: "G1", score: 3 },
         { playerId: "PlayerD", gameId: "G1"},
         { playerId: "PlayerE", gameId: "G1", score: null }
      ]
   },
   {
      $group:
      {
         _id: "$gameId",
         playerId:
            {
               $topN:
                  {
                     output: [ "$playerId", "$score" ],
                     sortBy: { "score": 1 },
                     n: 3
                  }
            }
      }
   }
] )

この例では、次のことが行われます。

$documentsは、プレイヤーのスコアを含むリテラルドキュメントを作成します。
$groupはドキュメントをgameIdでグループ化します。この例ではgameId 、 G1は 1 つのみです。
PlayerDにはスコアがあり、 PlayerEには null scoreがあります。これらの値は両方とも null と見なされます。
playerIdフィールドとscoreフィールドはoutput : ["$playerId"," $score"]として指定され、配列値として返されます。
sortBy: { "score" : 1 }により、null 値は返されたplayerId配列の前にソートされます。

[
   {
      _id: 'G1',
      playerId: [ [ 'PlayerD', null ], [ 'PlayerE', null ], [ 'PlayerA', 1 ] ]
   }
]

BSON データ型ソート順

異なるタイプをソートする場合、順序を決定するためにBSON データ型の順序が使用されます。例として、値が string と数値で構成されるコレクションを考えてみましょう。

昇順ソートでは、string 値は数値の後にソートされます。
降順ソートでは、string 値が数値の前にソートされます。

db.aggregate( [
   {
      $documents: [
         { playerId: "PlayerA", gameId: "G1", score: 1 },
         { playerId: "PlayerB", gameId: "G1", score: "2" },
         { playerId: "PlayerC", gameId: "G1", score: "" }
      ]
   },
   {
      $group:
         {
            _id: "$gameId",
            playerId: {
               $topN:
               {
                  output: ["$playerId","$score"],
                  sortBy: {"score": -1},
                  n: 3
               }
            }
         }
   }
] )

この例では、次のことが行われます。

PlayerAのスコアは整数です。
PlayerBには string の"2"スコアがあります。
PlayerCには空の string スコアがあります。

並べ替えは{ "score" : -1 }の降順であるため、string リテラル値はPlayerAの数値スコアより前にソートされます。

[
   {
      _id: "G1",
      playerId: [ [ "PlayerB", "2" ], [ "PlayerC", "" ], [ "PlayerA", 1 ] ]
   }
]

制限事項

ウィンドウ関数と集計式のサポート

$topNは集計式としてサポートされていません。

$topNはwindow operatorとしてサポートされています。

メモリ制限に関する考慮事項

$topN集計パイプライン内のグループには100 MBのパイプライン制限が適用されます。個々のグループでこの制限を超えると、集計はエラーで失敗します。

例

以下のドキュメントを持つgamescoresコレクションを考えてみましょう。

db.gamescores.insertMany([
   { playerId: "PlayerA", gameId: "G1", score: 31 },
   { playerId: "PlayerB", gameId: "G1", score: 33 },
   { playerId: "PlayerC", gameId: "G1", score: 99 },
   { playerId: "PlayerD", gameId: "G1", score: 1 },
   { playerId: "PlayerA", gameId: "G2", score: 10 },
   { playerId: "PlayerB", gameId: "G2", score: 14 },
   { playerId: "PlayerC", gameId: "G2", score: 66 },
   { playerId: "PlayerD", gameId: "G2", score: 80 }
])

3 つの最高の`Scores`を見つける

$topNアキュムレータを使用すると、単一のゲームで最もスコアの高いプレイヤーを見つけることができます。

db.gamescores.aggregate( [
   {
      $match : { gameId : "G1" }
   },
   {
      $group:
         {
            _id: "$gameId",
            playerId:
               {
                  $topN:
                  {
                     output: ["$playerId", "$score"],
                     sortBy: { "score": -1 },
                     n:3
                  }
               }
         }
   }
] )

サンプルパイプライン:

$matchを使用して結果を単一のgameIdでフィルタリングします。この場合は、 G1ます。
$groupを使用して結果をgameIdでグループ化します。この場合は、 G1ます。
結果を降順で並べ替えるには、 { "score": -1 }による並べ替えを使用します。
を使用して$topN output : ["$playerId"," $score"]から出力するフィールドを指定します。
$topNを使用して、 n : 3を持つG1のゲームでscoreが最も高い上位 3 つのドキュメントを返します。

この操作は次の結果を返します。

[
   {
      _id: 'G1',
      playerId: [ [ 'PlayerC', 99 ], [ 'PlayerB', 33 ], [ 'PlayerA', 31 ] ]
   }
]

このクエリに相当する SQL は次のとおりです。

SELECT T3.GAMEID,T3.PLAYERID,T3.SCORE
FROM GAMESCORES AS GS
JOIN (SELECT TOP 3
         GAMEID,PLAYERID,SCORE
         FROM GAMESCORES
         WHERE GAMEID = 'G1'
         ORDER BY SCORE DESC) AS T3
            ON GS.GAMEID = T3.GAMEID
GROUP BY T3.GAMEID,T3.PLAYERID,T3.SCORE
   ORDER BY T3.SCORE DESC

複数のゲームにわたる最高スコアドキュメントの検索

$topNアキュムレータを使用して、各ゲームで最もスコアの高いプレイヤーを見つけることができます。

db.gamescores.aggregate( [
      {
         $group:
         { _id: "$gameId", playerId:
            {
               $topN:
                  {
                     output: [ "$playerId","$score" ],
                     sortBy: { "score": -1 },
                     n: 3
                  }
            }
         }
      }
] )

サンプルパイプライン:

$groupを使用して結果をgameIdでグループ化します。
を使用して$topN output : ["$playerId", "$score"]から出力するフィールドを指定します。
結果を降順で並べ替えるには、 { "score": -1 }による並べ替えを使用します。
$topNを使用して、 n: 3を持つゲームごとに、最も高いscoreを持つ上位 3 つのドキュメントを返します。

この操作は次の結果を返します。

[
   {
      _id: 'G1',
      playerId: [ [ 'PlayerC', 99 ], [ 'PlayerB', 33 ], [ 'PlayerA', 31 ] ]
   },
   {
      _id: 'G2',
      playerId: [ [ 'PlayerD', 80 ], [ 'PlayerC', 66 ], [ 'PlayerB', 14 ] ]
   }
]

このクエリに相当する SQL は次のとおりです。

SELECT PLAYERID,GAMEID,SCORE
FROM(
   SELECT ROW_NUMBER() OVER (PARTITION BY GAMEID ORDER BY SCORE DESC) AS GAMERANK,
   GAMEID,PLAYERID,SCORE
   FROM GAMESCORES
) AS T
WHERE GAMERANK <= 3
ORDER BY GAMEID

`$group`のグループキーに基づいて`n`を計算

また、 nの値を動的に割り当てることもできます。この例では、 $cond式はgameIdフィールドで使用されています。

db.gamescores.aggregate([
   {
      $group:
      {
         _id: {"gameId": "$gameId"},
         gamescores:
            {
               $topN:
                  {
                     output: "$score",
                     n: { $cond: { if: {$eq: ["$gameId","G2"] }, then: 1, else: 3 } },
                     sortBy: { "score": -1 }
                  }
            }
      }
   }
] )

サンプルパイプライン:

$groupを使用して結果をgameIdでグループ化します。
を使用して$topN output : "$score"から出力するフィールドを指定します。
gameIdがG2の場合、 nは 1、それ以外の場合、 nは 3 になります。
結果を降順で並べ替えるには、 { "score": -1 }による並べ替えを使用します。

この操作は次の結果を返します。

[
   { _id: { gameId: 'G1' }, gamescores: [ 99, 33, 31 ] },
   { _id: { gameId: 'G2' }, gamescores: [ 80 ] }
]

戻る

$top

$toString

$topN（集計アキュムレータ）

定義

構文

動作

NULL および欠損値

BSON データ型ソート順

制限事項

ウィンドウ関数と集計式のサポート

メモリ制限に関する考慮事項

例

3 つの最高のScoresを見つける

複数のゲームにわたる最高スコア ドキュメントの検索

$groupのグループキーに基づいてnを計算

3 つの最高の`Scores`を見つける

複数のゲームにわたる最高スコアドキュメントの検索

`$group`のグループキーに基づいて`n`を計算