# Archivist Archivist is a key-value abstraction layer generally used with state objects. The player module created through the previous sections already uses Archivist to store data on the local file system; more specifically, the auth module used in the [Actors & Sessions](./index.html#actors-sessions) section of this user guide uses Archivist behind the scenes to store credentials for newly created users. ## Vaults ### List, read and write order > ./config/default.yaml ```yaml archivist: # When doing "list" operations, will attempt each mentioned vault until successful listOrder: - userVault - itemVault # When doing "get" operations, will attempt each mentioned vault until successful readOrder: - userVault - itemVault # When doing "add/set/touch/del" operations, will write to each mentioned vault in the given order writeOrder: - userVault - itemVault ``` listOrder, readOrder, and writeOrder properties **have to** be defined in your configuration file to specify the order when reading or writting data.
For read operations (listOrder, readOrder), mage will attempt the mentioned vaults until one is successful. However, for write operations (writeOrder), mage will write to each mentioned vault, in the given order. > The item won't be written before the player because it will follow the order of the writeOrder configuration ```javascript state.archivist.set('item', { userId: userId }, playerData); state.archivist.set('player', { itemId: itemId }, itemData); ``` Please note that if you are doing multiple operations in a single state transaction, the order in which the operations will be done correspond to the order provided by your configuration, not the order of the actual calls. ### Vaults types > ./config/default.yaml ```yaml archivist: vaults: userVault: type: file config: path: ./filevault/userVault itemVault: type: file config: path: ./filevault/itemVault ``` As mentioned, vaults are used by archivist to store data. Currently, the following backend targets are supported: | Backend | Description | | --------------------- | ------------------------------------------------------------ | | file | Store data to the local disk, in JSON files. | | memory | Keep data in memory (does not persist). | | client | Special vault type (client-side archivist support required). | | couchbase | [Couchbase](https://www.couchbase.com/) interface | | mysql | [MySQL](https://www.mysql.com/) interface. | | redis | [Redis](https://redis.io/) interface. | Vaults can have different configuration for different environments, as long as the Archivist API set used in your project is provided by the different vault backends you wish to use. ### File vault backend The file vault can be used to store data directly in your project. A ommon case for the use of the file vault backend is static data storage. ```yaml type: file config: path: ./filevault disableExpiration: true # optional (default: false) ``` operation | supported | implementation ----------|:---------:|--------------- list | ✔ | `fs.readdir(config.path);` get | ✔ | `fs.readFile('myfile.filevault' and 'myfile.json');` add | ✔ | `fs.writeFile('myfile.filevault' and 'myfile.json');` set | ✔ | `fs.writeFile('myfile.filevault' and 'myfile.json');` touch | ✔ | `fs.readFile('myfile.filevault'); fs.writeFile('myfile.filevault');` del | ✔ | `fs.readFile('myfile.filevault'); fs.unlink('myfile.filevault' and 'myfile.json');` `archivist:create` support is done via `mkdirp` and just requires that the user running the command has enough rights to create folders in the project's path. ### Memory ```yaml type: memory ``` The memory vault backend can be used to keep data in-memory for the duration of the execution of your MAGE instance. Data will not be persisted to disk. operation | supported | implementation ----------|:---------:|--------------- list | ✔ | `for (var trueName in cache) { }` get | ✔ | `deserialize(cache[trueName(fullIndex, topic)])` add | ✔ | `cache[trueName(fullIndex, topic)] = serialize(data)` set | ✔ | `cache[trueName(fullIndex, topic)] = serialize(data)` touch | ✔ | `setTimeout()` del | ✔ | `delete cache[trueName(fullIndex, topic)]` ### Client This vault is used to send updates to the player, so that their data is always synchronized in real time. This vault is always created when an archivist is instantiated by a `State` object, using a name identical to the type: `client`. This vault type requires no configuration. operation | supported | implementation ----------|:---------:|--------------- list | | get | | add | ✔ | `state.emitToActors('archivist:set')` set | ✔ | `state.emitToActors('archivist:set' or 'archivist:applyDiff')` touch | ✔ | `state.emitToActors('archivist:touch')` del | ✔ | `state.emitToActors('archivist:del')` ### Couchbase ```yaml type: couchbase config: options: # List of hosts in the cluster hosts: [ "localhost:8091" ] # Only for Couchbase Server >= 5.0 # User credentials username: Administrator password: "password" # Only for Couchbase Server < 5.0 # Bucket password (optional) password: "toto" # optional bucket: default # optional, useful if you share a bucket with other applications prefix: "bob/" # optional, can use any option specified in https://developer.couchbase.com/documentation/server/5.1/sdk/nodejs/client-settings.html#topic_pkk_vhn_qv__d397e189 options: # usefull to debug network errors (eg. authentication errors) detailed_errcodes: 1 # options only used with archivist:create create: adminUsername: admin adminPassword: "password" bucketType: couchbase # can be couchbase or memcached ramQuotaMB: 100 # how much memory to allocate to the bucket ``` For **Couchbase Server >= 5.0**, `options.user` and `options.password` have to be set to a user who has access to the `options.bucket`. For **Couchbase Server < 5.0**, however, you just need to configure `options.password` which correspond to the bucket password and is **optional**. `create.adminUsername` and `create.password` need to be configured only if you wish to create the underlying bucket through `archivist:create`, or to create views and query indexes through `archivist:migrate`. operation | supported | implementation ----------|:---------:|--------------- list | | get | ✔ | `couchbase.get()` add | ✔ | `couchbase.add()` set | ✔ | `couchbase.set()` touch | ✔ | `couchbase.touch()` del | ✔ | `couchbase.remove()` `archivist:create` support requires a separate `create` entry in the `config`, meanwhile views should be created/managed in migration scripts.

### MySQL ```yaml type: mysql config: options: host: "myhost" user: "myuser" password: "mypassword" database: "mydb" ``` The available connection options are documented in the [node-mysql readme](https://github.com/felixge/node-mysql#connection-options). For pool options please look at [Pool options](https://github.com/felixge/node-mysql#pool-options). operation | supported | implementation ----------|:---------:|--------------- list | ✔ | `SELECT FROM table WHERE partialIndex` get | ✔ | `SELECT FROM table WHERE fullIndex` add | ✔ | `INSERT INTO table SET ?` set | ✔ | `INSERT INTO table SET ? ON DUPLICATE KEY UPDATE ?` touch | | del | ✔ | `DELETE FROM table WHERE fullIndex` `archivist:create` support requires that the user is allowed to create databases. > Sample query to create a basic "people" topic table store. ```sql CREATE TABLE people ( personId INT UNSIGNED NOT NULL, value TEXT NOT NULL, mediaType VARCHAR(255) NOT NULL, PRIMARY KEY (personId) ); ``` Queries against your database are done through a combination of the generated keys and serialized values. A generated key must yield a table name and a primary key. A serialized value must yield a number of column names with their respective values. For instance, given a topic `people` and index `{ personId: 1 }`, the destination table will need to have a `personId` field, but also a `value` field to store data ad a `mediaType` field so that MAGE may know how to process the stored value. > Overriding the serializer ```javascript exports.people.vaults.mysql.serialize = function (value) { return { value: value.setEncoding(['utf8', 'buffer']).data, mediaType: value.mediaType, lastChanged: parseInt(Date.now() / 1000) }; }; ``` If you want to change how this information is stored, by adding columns, etc, you can overload the serializer method to do so. For example, consider the following example if you want to add a timestamp to a `lastChanged INT UNSIGNED NOT NULL` column. ### Redis ```yaml type: redis config: port: 6379 host: "127.0.0.1" options: {} prefix: "key/prefix/" ``` The `options` object is described in the [node-redis readme](https://npmjs.org/package/redis). Both `options` and `prefix` are optional. The option `return_buffers` is turned on by default by the Archivist, because the default serialization will prepend values with meta data (in order to preserve mediaType awareness). operation | supported | implementation ----------|:---------:|--------------- list | | get | ✔ | `redis.get()` add | ✔ | `redis.set('NX')` set | ✔ | `redis.set()` touch | ✔ | `redis.expire()` del | ✔ | `redis.del()` ## Topics > lib/archivist/index.js ```javascript exports.player = { index: ['userId'], vaults: { userVault: {} } }; ``` Topics are essentially Archivist datatypes; they define which vault(s) to use for storage, the key structure for accessing data, and so on. In this example, we simply specify a new topic, called player, in which we will be identifying by userId. ### Add an expiration time In your topic config, you can specify a `ttl` to make your data expires after a certain amount of time. `ttl` should be a string matching one of the following formats: - days: "[num]d" - hours: "[num]h" - minutes: "[num]m" - seconds: "[num]s" > Add an expiration time ```javascript exports.player = { // ... ttl: '1m' // Expire the data after 1 minute }; ``` ## Store & retrieve topics > lib/modules/players/index.js ```javascript exports.create = function (state, userId, playerData) { state.archivist.set('player', { userId: userId }, playerData); }; exports.list = function (state, callback) { var topic = 'player'; var partialIndex = {}; state.archivist.list(topic, partialIndex, function (error, indexes) { if (error) { return callback(error); } var queries = indexes.map(function (index) { return { topic: topic, index: index }; }); state.archivist.mget(queries, callback); }); }; ``` > lib/modules/players/usercommands/register.js ```javascript var mage = require('mage'); exports.acl = ['*']; exports.execute = function (state, username, password, callback) { mage.players.register(state, username, password, function (error, userId) { if (error) { return state.error(error.code, error, callback); } mage.players.create(state, userId, { coins: 10, level: 1, tutorialCompleted: false }); state.respond(userId); return callback(); }); }; ``` > lib/modules/players/usercommands/list.js ```javascript var mage = require('mage') exports.acl = ['*']; exports.execute = function (state, callback) { mage.players.list(state, function (error, players) { // We ignore the error for brievety's sake state.respond(players); callback(); }); }; ``` Again, in this example we are leaving the ACL permissions entirely open so that you may try to manually access them; in the real world, however, you would need to make sure to put the right permissions in here. In this example, we augment the players module we have previously created with two methods: `create`, and `list`. In each method, we use `state.archivist` to retrieve and store data. We then modify the `players.register` user command, and have it create the player's data upon successful registration. Finally, we add a new user command called `players.list`, which will let us see a list of all players' data. You may notice that `players.list` actually calls two functions: `state.archivist.list` and `state.archivist.mget`; this is because `list` will return a list of indexes, which we then feed into `mget` (remember, Archivist works with key-value). You may also notice that while `state.archivist.list` is asynchronous (it requires a callback function), `state.archivist.set` is not; because states act as transactions, writes are not executed against your backend storage until the transaction is completed, thus making write operations synchronous. This will generally be true of all `state.archivist` APIs; reads will be asynchronous, but writes will be synchronous. ## Testing storage ```shell curl -X POST http://127.0.0.1:8080/game/players.list \ --data-binary @- << EOF [] {} EOF ``` ```powershell Invoke-RestMethod -Method Post -Uri "http://127.0.0.1:8080/game/players.list" -Body '[] {}' | ConvertTo-Json ``` We can re-use the previous command to create a new user; once we have done so, we can use the following command to retrieve the data we have just created. ## Key-based filtering > lib/archivist/index.js ```javascript exports.item = { index: ['userId', 'itemId'], vaults: { itemVault: {} } }; ``` > lib/modules/items/index.js ```javascript exports.getItemsForUser = function (state, userId, callback) { var topic = 'item'; var partialIndex = { userId: userId }; state.archivist.scan(topic, partialIndex, callback); }; ``` With certain APIs you can provide only a portion of an index and search for all indexes who have the same value for that option. These indexes are referred to as *partial indexes*. There are a few ways by which you can split and filter the data stored in your topics: 1. You can use [archivist.list](https://mage.github.io/mage/api/classes/archivist.html#list) to list all the indexes matching a given partial index 2. You can use [archivist.mget](https://mage.github.io/mage/api/classes/archivist.html#mget) to fetch multiple indexes at once 3. You can use [archivist.scan](https://mage.github.io/mage/api/classes/archivist.html#scan), which combines both operations mentioned above into one You will generally want to use `scan` for most of your operations, but in some cases, you will find that manually fetching the list of indexes using `list` and applying your own custom filtering before calling `mget` will give you a better result. ## Limiting access > lib/archivist/index.js ```javascript exports.item = { index: ['userId', 'itemId'], vaults: { client: { shard: function (value) { return value.index.userId; }, acl: function (test) { test(['user', 'test'], 'get', { shard: true }); test(['cms', 'admin'], '*'); } }, inventoryVault: {} } }; ``` In most cases, you will want to make sure that a given user will only be able to access data they have the permission to access. There are primarily two ways to limit access to topics: * **shard function**: used to filter what data can be viewed; * **acl function**: used to determine if the data can be accessed; In this example, we use the shard function to limit returned data to only data which matches the userId. We then use the acl function to only allow users and tests access to the `get` API, but full access to CMS users and administrators. ## Database Creation Some vaults also support database/vault creation via the `archivist:create` command as long as admin credentials are properly configured, see each vault's configuration on how to use it. List of `archivist:create` enabled backends: * `file` * `couchbase` * `mysql` It is not recommended to use those features in production. ## Migrations MAGE supports database migration scripts similar to [Ruby on Rails 2.1](http://api.rubyonrails.org/classes/ActiveRecord/Migration.html) which are aligned with your `package.json` version and exposed via the `archivist:migrate` command. Running the `archivist:migrate` command will go through each migrations in order up to the current `package.json` version and apply them. The command also allows specifying an exact version allowing for testing new migrations or reverting to a previous version. ### How to write migration scripts Migration scripts are single files per vault and per version. These files are JavaScript modules and should export two methods: `up` and `down`, to allow migration in two directions. You are strongly encouraged to implement a `down` migration path, but if it's really impossible, you may leave out the `down` method. Keep in mind that **this will block rollback operations**. See the sample on the right side for more details. ```javascript exports.up = function (vault, cb) { var sql = 'CREATE TABLE inventory (\n' + ' actorId VARCHAR(255) NOT NULL PRIMARY KEY,\n' + ' value TEXT NOT NULL,\n' + ' mediaType VARCHAR(255) NOT NULL\n' + ') ENGINE=InnoDB'; vault.pool.query(sql, null, function (error) { if (error) { return cb(error); } return cb(null, { summary: 'Created the inventory table' }); }); }; exports.down = function (vault, cb) { vault.pool.query('DROP TABLE inventory', null, cb); }; ``` The migration file goes to your game's `lib/archivist/migrations` folder into a subfolder per vault. This folder should have the exact same name as your vault does. The migration file you provide should be named after the version in `package.json` and have the extension `.{js,ts}`. Other file extensions are ignored. Some typical examples: * `lib/archivist/migrations//v0.1.0.js` * `lib/archivist/migrations//v0.1.1.js` * `lib/archivist/migrations//v0.2.0.js` The callback of the `up` method allows you to pass a report, that will be stored with the migration itself inside the version history. In MySQL for example, this is all stored in a `schema_migrations` table, which is automatically created. ### How to execute migrations Migrations can be executed by calling some specific CLI commands, which are detailed when you run `npm run help` or `mage --help`. They allow you to create a database, drop a database, and run migrations. This is what they look like: ```plaintext archivist:create [vaults] create database environments for all configured vaults archivist:drop [vaults] destroy database environments for all configured vaults archivist:migrate [version] migrates all vaults to the current version, or to the version requested ``` ## Maximum data size (warning and errors) ```yaml archivist: size: # Default set to 1Mb (1024 kilobytes); set to false to disable warning: 256 # Disabled by default error: 512 ``` MAGE will keep track of all data that is written into vaults, and automatically log a warning when the data size reaches a certain threshold. Optionally, you can also configure MAGE to throw an error should the data reach a certain size. You may configure this behavior by configuring `archivist.size.warning` and `archivist.size.error` in your configuration files. By default, MAGE will never throw an error no matter how big your data gets, but will log a warning should your data be bigger than 1 megabyte.