Indexer
public protocol Indexer
Creates a Database index used to then query the database
-
Defines the function that will be called to add an index row to the index. May be called multiple times to create multiple index rows for a single document, or not at all to skip a document from being included in the index.
See also the
Indexer.Map
typealias.Declaration
Swift
typealias AddIndexRow = (_ key: Any, _ value: Any?) -> Void
Parameters
key
The index key. When indexes are queried, the key is used to match documents and sort the query results. This key can be single values or be made up of arrays for more complex querying and sorting needs.
value
An optional value associated with the key. When indexes are queried, the value is included in the result set. Retreiving these values is therefore a more efficient operation than retrieving an entire document if only a few properties from the document are needed.
-
Defines the function that will be called when the database documents are indexed.
See also the
Indexer.map
property.Declaration
Swift
typealias Map = (_ document: Document, _ addIndexRow: @escaping AddIndexRow) -> Void
Parameters
document
The document being indexed.
addIndexRow
closure to be called to add an index row to the index. May be called multiple times to create multiple index rows for a single document, or not at all to skip a document from being included in the index. First parameter is the index key. When indexes are queried, the key is used to match documents and sort the query results. This key can be single values or be made up of arrays for more complex querying and sorting needs. Second parameter is an optional value associated with the key. When indexes are queried, the value is included in the result set. Retreiving these values is therefore a more efficient operation than retrieving an entire document if only a few properties from the document are needed.
-
Defines the function used for MapReduce style indexes to perform the reduce sumarization.
See also the
Index.reduce
property.If provided, the reduce function will be called when the index is queried to summarize the results of a index.
Declaration
Swift
typealias Reduce = (_ keys: [Any], _ values: [Any], _ rereduce: Bool) -> (Any)
Parameters
keys
An array of keys to be reduced (or empty if this is a rereduce)
values
A parallel array of values to be reduced, corresponding 1::1 with the keys.
rereduce
true if the input values are the results of previous reductions.
Return Value
The reduced value; almost always a scalar or small fixed-size object.
-
Name of the index
Declaration
Swift
var name: String
-
Version of the index
Important
Versioning:Once created, a database index is stored as part of the data in database, for this reason, changes to an index require special handling.
If any part of the
map
function orreduce
function change, theversion
string should be changed to ensure the index is properly updated. Failure to change this value when updating the code will lead to unpredicable results.Declaration
Swift
var version: String
-
Map
function that will be called when the database documents are indexed.A map function’s job is to analyze a database document’s contents and identify key/value pairs to be indexed. A given document can add a single key/value pair, multiple key/value pairs, or no key/value pairs to the index, depending on the data and what the index is indexing. The index is then used to query the database, and controls the sort order of the query results.
Important rules for the map function:
- The function must always produce the same output for the same input
The function’s output must be consistant or the index it builds will not produce valid queries. The function must not use external information or state to make it’s decisions.
- The function must be thread-safe
The indexing will take place in a background thread or threads, and may be parallelized.
- The function cannot change external state
In addition to causing thread-safty issues, it is unpredicable as to when the function is called or what documents are sent to it.
Both the key and value passed to
emit
can be any JSON-compatible objects such as strings, numbers, booleans, arrays, dictionaries, and null (NSNull)Sorting:
Sorting in the index depends on the data type of the key. Sorting is done in this order:
- Null
- Boolean: false, then true
- Numbers, in numeric order
- Strings, case-insensitive, following the Unicode Collation Algorithm
- Arrays, compared item-by-item.
- Dictionaries/maps, also compared item-by-item. See warning below.
Details:
Arrays are particularly useful to achieve multiple layers of sorting, since elements are compared in order of their array index. So, all the first array elements are compaired, then all the second elements, etc. The net effect of this type of index can be a sorted structure like:
| Ingredient | Type | |------------|-----------------| | Olive | Black | | Olive | Green | | Olive | Pimento stuffed | | Vermouth | Dry | | Vermouth | Extra Dry | | Vermouth | French | | Vodka | Grey Goose | | Vodka | Stoli | | Vodka | Tito's |
Dictionaries/maps are a particularly poor type to use for sorting since JSON doesn’t specify any ordering of keys, so the order of items is ambiguous. So while using dictionaries as key is technically possible, it is not recommended.
Values:
Emitting a value is not required, but can improve performance when querying large documents. Values are generally a subset of one or more document properties needed when a query is run against the index; for example to display search results in a grid, or to summarize a child record in a master view. This can avoid the cost of retreiving the entire document to paint a UI needing only a few values. Unlike the key element, dictionaries/maps is very useful type as a value, since they can be used to return multiple document properties in the single value emitted element.
It is not recommended that the entire document be returned as a value, even in cases where the entire document will be needed by the consumer of the index. When querying the index, the Id of the document that generated the index row is always returned as part of the query results. Thus the Id can be used to retrive selected documents. Additionally, there is also the option to return the entire document in the query results as well. Retrieving the document during the query is more efficent in both storage and processing time than adding the entire docuement to the index as a value.
There can be instances when it is useful for the query’s document id to be the id of a different docuement than that which emitted the index row. This can be acheived by emiting a dictionary value where the dictionary contains a key/value pair where the key is
_id
, and the value is the document id of the document you want to associate with the index row, rather than the source document id.For example:
let myMap: Indexer.Map = { (document, addIndexRow) in // if the source document contains a "related_documents" property if let relatedDocuments = document["related_documents"] as? [String] { // for each value in "related_documents" (which are related document ids) for relatedDocument in relatedDocuments { // add an index row with a key of type and id, and a value that changes the query results document id // to the related document. let key = [type, document.id] let value = ["_id": relatedDocument] addIndexRow(key, value) } } } let myIndex = Index(name: "myIndex", version: "1", map: myMap)
Using the map above, a query could then be run that would pass a given document Id, then receive back all the documents related to that document.
Declaration
Swift
var map: Map
-
An optional
Reduce
function that will be called when a database index is queried to summarize the results of the query.Reduce functions are the second half of the map/reduce technique. They’re optional, and less commonly used. A reduce function post-processes the indexed key/value pairs generated by the map function, by aggregating the values together and is commonly used to provide counts, totals, or averages.
A reduce function takes an ordered list of key/value pairs, the keys and values from the index, and aggregates them together into a single object, then returns that object.
For indexes that do not emit a value, the value array will be an array of NSNull objects.
Rereduce
The rereduce flag is used when querying large data sets. When the data set is large the underlying system will break the map/reduce into smaller chucks, run the reduce on each chunk, then run reduce again on the reduced chunks. When this happens, the rereduce flag will true, the key array will be empty, and the value array will contain the partial reduced values.
Example
// Given a map function that emits the "type" of each document... let myMap: Indexer.Map = { (document, addIndexRow) in if let type = document["type"] as? String { addIndexRow(type, nil) } } // This reduce function will summarize those types, and output a dictionary // which contains each unique type and the total number of documents of that type. let myReduce: Indexer.Reduce = = { ( keys, values, rereduce) in var result: [String: Int] = [:] // if this is not a rereduce if !rereduce { // count each unique key value if let sKeys = keys as? [String] { for key in sKeys { var count = result[key] ?? 0 count += 1 result[key] = count } } } else { // This is a rereduce, then our value array will be an array of // dictionaries of unique key values and their counts from above. if let counts = values as? [[String: Int]] { // for each result array for count in counts { // for each key in the result for key in count.keys { // count and compile a final result dictionary var count = result[key] ?? 0 count += 1 result[key] = count } } } } // return the unique key values // Note that regardless of the rereduce flag the result is the same data type. This must always be the case. return result } let myIndex = Index(name: "myIndex", version: "1", map: myMap, reduce: myReduce)
Declaration
Swift
var reduce: Reduce?