Author: | James R. McIntosh |
---|---|
Contact: | j.mcintosh@columbia.edu |
Version: | 0.0.8 |
To allow rapid prototyping of SSM extensions, for example for conflict tasks. Code is a synthesis of tricks that I have been using to run drift-diffusion based models for a few years, as well as some taken from the HDDM toolbox.
- Easily extensible framework:
- Inherit from a single function (ssm_def.m)
- Define all your additional model parameters
- Specify the link between data (e.g. trial-trial eeg, visual stimuli) and model inputs
- Write your likelihood function making use of model parameters and model inputs (examples provided)
- Because this is all specified directly, the model structure is very clear
- Model is specified as two binary numbers representing each model parameter. One number represents all the set parameters for this evaluation, the other number represents all the parameters over which to optimise.
- For example, if our full model is s,a,t,v,st then we could define the following submodel: model_id = [1,1,1,1,0], model_search = [0,1,1,1,0], which would initialise a search by setting st to 0 (unused), s to its default value (1) and randomly initialise and then optimise a, t and v.
- This means it's easy to setup complex models and systematically search submodels, or freeze certain parts of the model while allowing some parameters to be free.
- Simple to start optimisation of complex models based on simpler models
- Easy to keep track of many candidate moels
- MCMC estimation available, and cost function minimisation uses a pattern search algorithm (easily modifiable)
- Written in Matlab (a feature for some!). Minimisation of cost function
- Fits are saved as matlab objects - makes it very easy to load previous fits, extend, re-fit, and keep track of the model fitting history.
- Three likelihood calculation methods are currently included as examples:
- Core functions of HDDM (Navarro and Fuss analytical solutions) for speed - when initialising models or only using simple models (although see todo, regarding a slight inconsistency in NF/HDDM method implementation).
- Brute force simulation with density estimation (using matlab's ksdensity) - easy to extend, but slow and prone to local minima due to stochastic nature (although pattern search generally does a good job)
- Calculation decision-variable path with transition matrix iterative multiplication - easy to extend, quite fast with reasonable discretization (apart from for specific parameters, e.g. including noise in drift is a x10 slow down).
Without using an analytical solution we do not have the advantage of being able to generate likelihoods without generating the whole p(rt,accuracy|parameters,conditions). For each trial we consequently create a structure of parameters and conditions e.g. parameters: p(ix_trial).s = 1;p(ix_trial).v = 1; condition:p.(ix_trial).conflict = 1; Then this structure is stripped of duplicates, the likelihood density is estimated, and the relevant accuracies/rt are evaluated from it.
It's not hierarchical. Straying from the SSM means you lose the advantage of analytical solutions - i.e. things get slow(er)! For reasonable computational speed, the transition matrix approach is not quite as accurate as the analytical solutions of NF/HDDM - although considerably better than using brute force simulation. The main deviation of the transition matrix approach to the analytical solution comes from setting the time step too low (see todo, for potential fix).
For example, adding a conflict dependent bias is straight forwards:
- inherit from ssm_def
- modify the method ssm_cost_add_stim_dependencies so that the core SSM functions can make use of conflict
- (e.g. one line: p_mat.c = obj.data.stim_conflict;)
- overload ssm_def_instance with the introduced bias parameter. Set its prior, default and bounds.
- write the core ssm_pdf function (the current implemented version uses a transition matrix approach)
The structure is like this: the modelclass defines the space of all parameters you want to include in this specific version of the SSM. Then there is an id_model and id_search. id_model sets the specific subset of parameters that you want to be non-default (I hope not non-zero). id_search sets the which of these parameters are then optimised (for example, you want noise in the model, but you don't want it to be a free parameter). id_model and id_search are fundamentally binary strings that represent the inclusion of the model, and are converted to decimals and used to save/restore files.
- Download.
- Add the folder containing ssm_def.m to path, and subfolders: addpath(genpath(fullfile('pathtofolder','malleable-SSM'));
- Move the contents of the testing folder to somewhere where you will be working, and delete the empty testing folder.
McIntosh, J. R., & Sajda, P. (2019). Decomposing Simon task BOLD activation using a drift-diffusion model framework. bioRxiv, DOI:809947.
The following functions in the testing folder can be run directly, and should clarify usage: test_compare_likelihoods.m - Plots the likelihood of data after model fitting with the analytical method, and compared it to likelhood of other methods. Deviation is due to discretization of decision variable and time for transition matrix method, and time for brute force method. test_compare_pdf.m - Similar to above, but shows the PDF. test_ssm_run.m - fit the basic SSM with the various methods. test_ssm_run_conflict.m - fit a conflict SSM by initializing with the analytical methods, and then running the transition matrix approach to capture conflict.
To generate your own model with new parameters test_ssm_run_conflict.m is a good start - shows how to include trial conditions (presence of conflict in this case, but could be stimulus strength etc.). Includes the formulation for the transition matrix method, and shows how to add non-standard SSM parameters.
- MCMC could be implemented
- Transition matrix approach (like brute force approach, most commonly used for SSM extensions where there is no analytical solution), suffers if the discretisation is too relaxed. In particular, the peak of the RT distribution is not fully captured if the timestep (dt) is too low. This could be improved substantially, by having dt increment more finely near the peak of the distribution.