µCell - Interdisciplinary Research in Modelling and Simulation of Cell Spatial Behaviour
|
by Dominic Orchard, Jonathan Gover, Lee Lewis Herrington, James Lohr, Duncan Stead, Cathy Young and Sara Kalvala,[1] Department of Computer Science, University of Warwick
AbstractA central aim of systems biology is the strengthening of quantitative and qualitative knowledge of biological systems by studying the interactions between components and processes that lead to emerging properties and behaviours. Systems biology proliferated over the latter half of the twentieth century with the aid of technological advances and subsequent interdisciplinary research between natural scientists, computer scientists, and mathematicians. In this paper we present μCell, an interdisciplinary research project undertaken by undergraduates at the University of Warwick, seeking to aid systems biology intuition. The project's main contribution is a modelling and simulation tool for multi-cellular environments aimed at simulating higher-level cellular behaviours based on the interoperation of biochemical signalling pathway models and procedural models of cell components and structures, such as flagella. Based on these interoperated models, μCell is able to simulate spatial properties and behaviours of cells, such as chemotaxis. This paper introduces μCell, gives a case study model and simulation of flagella-based chemotaxis in E. coli, and discusses the pedagogical outcomes of the project for the students. Keywords: μCell, systems biology, biological modelling, simulation, model interoperation, chemotaxis.
IntroductionTechnological and scientific progress has yielded successively higher resolution techniques for the observation and manipulation of cells, aiding biological research. However, there is still much to be understood. The interaction of cells with their environment and with each other, via processes such as adhesion, movement, and quorum sensing, induces behaviour such as cell migration, aggregation of cells into tissues, precision growth across relatively large distances, and group behaviour such as fruiting body formation (Hynes and Lander, 1992). A thorough understanding of the cause and control of such behaviours is difficult due to the complexity of cells and their interactions. Studying the components and processes of a cell independently from other components and processes often fails to expose the full spectrum of a cell's properties and behaviour. The study of emergent behaviours and properties at the cellular and organismal level requires an understanding of the dynamic interactions and causal relationships between individual lower-level processes within a cell, and between cells. Such study is the focus of the burgeoning field of systems biology (Kitano, 2002). In the last decade, interdisciplinary research between natural scientists, computer scientists, and mathematicians has greatly improved understanding through computational analyses, modelling, and simulation. We present here the μCell modelling and simulation tool for multi-cellular environments where spatial behaviours can be simulated from models of biochemical signalling pathways interoperated with abstract procedural models of cell components, such as flagella. The models are procedural in the sense that they describe the operations of a component in the form of a program procedure. This model-interoperation approach contrasts with the approach of deriving spatial simulations purely from abstract statistical models, such as in the Cellular Potts Model (François and Glazier, 1992). μCell's main contribution is its approach to interoperation between signalling models and procedural models of cell components. The aim of μCell is to improve understanding of the mediating role of cell biochemistry to spatial behaviour. This paper introduces μCell and, as an example, discusses the modelling and simulation of flagella-based chemotaxis, a form of cell motility, in E. coli bacteria. μCell is the result of interdisciplinary research undertaken by a group of six final-year undergraduate students in the Department of Computer Science at the University of Warwick during 2007/2008 for the fulfilment of the final-year group project requirement for the MEng Computer Science course[2]. The students involved chose to use this opportunity to learn about a different scientific discipline and do significant research rather than concentrate on software development. The University of Warwick is a leading university in terms of interdisciplinary biological research, with the Warwick Systems Biology Centre[3], Warwick Complexity Complex[4], and the Molecular Organisation and Assembly in Cells Centre[5]. Furthermore, the University of Warwick is a forerunner in undergraduate research with the Undergraduate Research Scholarship Scheme[6], and the Reinvention Centre for Undergraduate Research[7]and its journal[8], with opportunities for research-based projects in many undergraduate courses. The project was supervised by Sara Kalvala, a computer scientist with research interests in the field of computational biology and conducted under the advisement of David Whitworth, a biologist involved in systems biology research. Central aims of the MEng group project are to:
This paper is structured as follows. The Materials and Methods section describes the materials and methods, including the computer science methods and software development techniques employed in developing μCell, the key components of the μCell software, and the example models used to simulate chemotaxis. The results section describes the results of chemotaxis simulations in μCell and the learning outcomes of the project. We then give a discussion of related work, followed by further work and conclusions.
Materials and MethodsThe Development of μCellThe team had a diverse range of software development and computer science skills that were employed in developing μCell, incorporating 2D and 3D graphical programming, XML data handling, parsing and handling syntax trees, object-oriented data patterns, and functional programming. The biological aspect of the project presented the team with a significant challenge due to insufficient understanding of the problem domain as no member of the team had a background in biology. The agile software development methodology (in particular extreme programming (XP) (Beck 1999)) was therefore employed to minimise the risk of not satisfying the requirements of the software. Agile methodologies such as XP promote multiple, short development iterations throughout a project with frequent evaluation phases. Change is facilitated as the team develops a deeper conceptual understanding of the task, which is informed by their own development and exploration. Prototyping was used to learn the biological background and explore the core component requirements of μCell. Learning was enhanced through enrolment in a level M computational biology module[9] by all team members. The foundational components of the μCell program (such as the data structures, data parser, formula parser, and simulator) were developed using the test-driven development approach, where the design process for a program component's functionality is followed by development of test code, giving a procedural specification for the intended functionality of a component prior to implementation. Extensive tests form a basis for testing the functional equivalence of code between development iterations, thus enabling safe refactoring and reimplementation. The team used the object-oriented language C# for its speed and cross-platform capabilities. The team employed standard collaborative development tools such as Subversion for source code and documentation version control, BaseCamp project management for online collaborative design and decision making, and the Eventum bug tracking system for collecting bugs and assigning members to resolve the issues. The Visual Studio integrated development environment and the MonoDevelop environment were used as development platforms.
|
|
|
Control |
Gradient |
|
|
Concentration (moles x10-6) |
0 |
10 densley centred sphere |
|
|
Mean tumble duration (s) |
1.52 +/- 2.77 |
1.52 +/- 2.73 |
|
|
Mean run duration (s) |
2.25 +/- 6.25 |
4.55 +/- 0.42 |
|
|
Max run duration (s) |
5.82 +/- 1.10 |
29.68 +/- 6.76 |
|
|
Max tumble duration (s) |
2.05 +/- 1.33 |
2.05 +/- 1.33 |
Figure 11. Experiment 1 results (to 2 decimal place) +/- mean deviation
Comparing Results of Experiment 1 to Real Experimental Data
We compared the results from experiment 1 with real-world experimental data for chemotaxis of an E. coli "wild type" in a capillary tube with a concentration gradient of 10 micro moles of serine from (Rojdestvenski, 2003):
|
|
Control |
Gradient |
|
|
Concentration (moles x10-6) |
0 |
10 |
|
|
Mean run duration (s) |
0.83 +/- 0.88 |
1.67 +/- 2.56 |
|
Although these measurements differ from our simulation results, both exhibit the same behaviour of run duration roughly doubling in the presence of a gradient. In fact, we can use experimental data such as this to adjust the model parameters. In this case, the tumble update frequency can be scaled according to the magnitude difference between the μCell results and the real-world results. Using a tumble update frequency of 1.355 instead of the previous 0.5, μCell measures the following:
|
|
Control |
Gradient |
|
|
Concentration (moles x10-6) |
0 |
10 |
|
|
Mean run duration (s) |
0.83 +/- 1.44 |
1.74 +/- 0.33 |
Note, however, that the average deviation is very different between the experimental data and the data collected from μCell. We conjecture that this is a result of the approximated adaption pathway in our pathway model for chemotaxis. The approximated adaption pathway results in overly quick adaption to the magnitude of the relevant concentration. Therefore, the mean deviation of a cell's tumble duration when in a gradient is lower than expected as the pathway adapts more quickly to a higher concentration. The probability weightings of the flagella model may also require further tuning. Further collaboration with biologists would be needed to tune the models based on results from real-world, in vivo experiments.
Experiment 2
Figure 12 shows that in our simulation the mean attractant over all cells, in all three experiments, increased in the presence of a concentration gradient and the first derivative of the curve decreased as the cells reached the area of greatest nutrient concentration. The curve is irregular at its limit due to tumbling and short runs resulting in occasional forays into areas of slightly lower concentration.

Figure 12. Mean attractant across all cells increased as they moved up the concentration gradient, eventually reaching a plateau as cells converge at the area of highest concentration.
Figure 13 shows one set of visual results from the 3D spatial environment for the first 400 seconds of the simulated time. In the control situation (Figure 13(b)), the cells wander randomly with no attractant to promote directed movement. Conversely, in the presence of a concentration gradient (Figure 13(a)), almost all the cells converge in the high concentration area in the environment's centre. There are a few cells that have not reached the centre due to the fact that chemotaxis, even though it is a biased walk, is a random walk and therefore can be unsuccessful for a long time.
| 0s | ![]() |
![]() |
| 100s | ![]() |
![]() |
| 200s | ![]() |
![]() |
| 300s | ![]() |
![]() |
| 400s | ![]() |
![]() |
| (a). Cells in 10 micromoles of centre weighted attractant presenting chemotaxis towards the high concentration area. | (b). Cells in 0.0 moles of attractant presenting no chemotaxis. |
Figure 13. Screen captures of the 3D spatial environment shown from the front during simulation at approximately 100 second intervals.
The µCell software
The μCell tool developed during the project was successful in its goal to be practical and usable for the modelling and simulation of cell behaviour in spatial environments. The tool is easy to use and may also be useful as a teaching tool in biology. The tool is now freely available as open-source software and has a public code repository and 'Wiki’[11]. A video is available showing μCell being used in a chemotaxis experiment[12]. Development of the tool has continued, albeit slowly, since the formal end of the project, and we expect to collaborate further with biologists to improve its usefulness. The μCell tool will be discussed in the context of related work in the next section.
Learning outcomes
The team's study of the undergraduate computational biology course and the work on the μCell project were mutually beneficial, promoting research as learning, and providing domain-specific knowledge for interdisciplinary research. Biological insights were developed parallel to the computer science skills acquired and developed during the project, including: collaborative software development, use of the C# language, and the use of tools for software development, version control, and bug tracking. Additionally, skills in project management, communication, writing, and collaboration were learnt and reinforced.
The team thoroughly enjoyed the project and benefited greatly from being involved in a different area of scientific research from computer science. The team was exposed to biological research, and more generally to methodologies and common practices in another scientific field. Group members gained an understanding of common cell processes, biochemistry, signalling pathways, diffusion mechanics, flagella mechanics, SBML, and of the issues in computational modelling and simulation of cells.
The project was deemed to have satisfied the requirements for the MEng group project and all students received a first class grade for the work.
Related Work
There are numerous tools that provide modelling and cell design abilities and/or simulation features to aid research in biology. A few projects that share some commonality with μCell will be discussed here.
The modelling and simulation of signalling pathways can be performed with many existing tools; the SBML website provides an extensive list of software supporting SBML models[13]. μCell does not claim to improve upon these tools in terms of speed of simulation or SBML-support but is novel, as far the authors are aware, in providing a tool for combining pathway models with pre-defined models of cell components for the simulation of spatial behaviour.
Virtual Cell, an advanced tool for cell simulations based on layers of models, performs complex spatial simulations of compartments and membranes within a single cell derived from real geometric data (Moraru et al., 2008). The spatial simulations within Virtual Cell are at a finer granularity than μCell's spatial simulations, which are at the multi-cellular level.
The BioSPICE open source framework for modelling and simulation in systems biology is an extendable framework with a multitude of pluggable modules that can be configured to interoperate (Kumar & Feidler, 2003). Systems Biology Workbench (SBW) is a grand effort to combine the best tools from all aspects of biological modelling into a single complete framework to enable interoperation between tools via standardisation (Hucka et al., 2001). The SBML standard is at the heart of this endeavour, originally introduced by the same organisation. BioSPICE and SBW are both formidable systems that are at times difficult to use, with a high barrier of entry for inexperienced users. μCell is more specific and succinct in its purposes and features, although the spirit of interoperation is shared at a smaller scale of model interoperation.
Another approach to modelling and simulation of spatial behaviours is through the programming of custom models, either through generalised modelling tools or programming languages. For example, NetLogo (Tisue & Wilensky, 2004) provides a generalised programmable modelling environment into which μCell-like models and simulations could be programmed. Other spatial models have been implemented using custom programming languages, such as the simulation of aggregation in Dictyostelium molds using a specially developed "cell" programming language (Agarwal, 1994). μCell provides specialised modelling and simulation tools for emerging behaviours of cells without requiring any programming experience from the user.
The CompuCell3D modelling environment provides 2D and 3D visualisations of various cell models (Cickovski et al., 2005), including cellular Potts model simulations of morphogenesis – the shaping of cells. CompuCell3D is also capable of simulating other processes, such as clustering, cell death, cell haptotaxis (the directional growth of cells), and chemotaxis. CompuCell3D uses custom modules written in C++ and Python, requiring programming expertise from the user and custom rule-bases to produce models and simulations, as opposed to providing pre-defined models and using lower-level biochemical dynamics to drive these processes as in μCell.
Spatial behaviours have been modelled and simulated using statistical models, such as extended Potts models (François & Glazier, 1992), whose behaviours are not mediated by signalling pathways as in μCell.
Further Work
There is a considerable amount of potential further work. A more extensive list with outlined research proposals for student projects is available on the μCell Wiki[14]. Here we briefly discuss a few areas of work.
Currently, the numerical accuracy of measurements within μCell is correct up to the definition of the models, but measurements taken from simulations may not match measurements from in vivo experiments. As suggested in Comparing Results of Experiment 1 to Real Experimental Data above, the parameters of models can be tweaked to give data that matches experimental data to an extent, but further collaboration with biologists is required if more realistic measurements are to be achieved. An example usage of μCell in research may be the modification of a signalling pathway due to a genetic fault and to test the effects of this modification on the emerging behaviours of the cell. For example, in E. coli chemotaxis, if CheY cannot phosphorylate quickly enough due to some unusual inhibition relation, will chemotaxis occur?
Currently, there are only procedural models for flagella bundles, receptors, and cell bodies (which facilitate cell collisions). This set should be extended to include components for modelling further processes such as cell growth, death, mitosis and cytokinesis (cell division), haptotaxis, and cell excretion. Furthermore, there should be tools for the user to develop their own procedural models of cell components. This may involve some form of synthetic language, or construction tool that should be sufficiently flexible to allow the definition of current and future models.
Additionally there are many improvements that can be made to μCell within current requirements to improve the user experience, such as more comprehensive error messages, improved saving and loading facilities, and the fixing of known bugs in the user interface.
Conclusions
This paper has introduced the μCell software tool for modelling and simulation of multi-cellular environments with a particular focus on the spatial behaviour of cells. The spatial behaviour of cells can be modelled in μCell via the interoperation of signalling pathway models (defined in the SBML data format) with pre-defined procedural models of cell components, such as flagella bundles. This approach to spatial simulations, controlled by signalling pathways, differs from stochastic approaches or cellular automata based approaches. The capabilities of this tool were demonstrated with two experiments showing μCell's ability to model and simulate E. coli chemotaxis. A user of μCell is not required to do any programming of their own as the procedural models of cell components are built-in and configurable.
At a meta-level, this paper has also presented a case study of interdisciplinary research at undergraduate level. The research project was undertaken by a group of six Level 4 computer science students at the University of Warwick. Their initial lack of biological knowledge required them to undertake their own learning, through courses, books, papers, and discussions with biologists in order to understand the problem domain. Thus, an outcome of this project, apart from the μCell software, has been the increase in the students’ scientific knowledge, and proliferation of skills in research.
It is our hope that the successful development of μCell and the outcomes described in this paper inspire further students to take on interdisciplinary research at both undergraduate and postgraduate level.
Acknowledgements
Many thanks are due to David Whitworth for all his help during the project. Thank you to all the biologists and biochemists who endured our many questions. Thanks also to the helpful comments of the anonymous referees.
Appendix A - Interpolation using Gaussian Weighting
The discretisation of the continuous simulation environment into cubes produces discontinuous changes in concentration between cubes and a constant concentration inside a cube. Chemotaxis relies on continuous changes in the concentration gradient thus the discontinuity and quiescence poses a significant problem for the chemotaxis simulation. The problem is overcome by interpolation using Gaussian weighting.
A co-ordinate (x, y, z) in the simulation environment has a mapping to a cube with co-ordinates (i, j, k) in the approximated concentration space. Each cube has 26 neighbours that are connected either by an edge, face, or vertex. The 27 cubes surrounding (i, j, k), including itself, are sampled, taking a weighted average of each where the weighting is based on the distance from the (x, y, z) to the centroid of each cube (illustrated in Figure 14 with 5 cubes as opposed to 27). A Gaussian function is applied to each distance to provide smooth interpolation:

Where diis the distance from the point P to the centre of the ithcube centre in the array of 3x3x3 cubes surrounding point P, qiis the quantity in the ithcube, and finally cLis the length of the cube's side. Note that in general:

Hence, the weighting kernel will always sum to 1. For performance reasons, the nested summation is computed separately in an initial pass of the 27 cubes, stored, and then used in a second pass to avoid nested computation.

Figure 14. Nutrient field sampling interpolation scheme
Notes
[1] Author information will be available shortly.
[2] http://www2.warwick.ac.uk/fac/sci/dcs/teaching/modules/cs407
[3] http://go.warwick.ac.uk/systemsbiology
[4] http://go.warwick.ac.uk/complexity
[5] http://go.warwick.ac.uk/moac
[6] http://go.warwick.ac.uk/urss
[7] http://go.warwick.ac.uk/reinvention
[8] http://go.warwick.ac.uk/reinventionjournal
[9] http://www2.warwick.ac.uk/fac/sci/dcs/teaching/modules/cs904/
[10] http://github.com/dorchard/mucell/wikis/
[11] http://github.com/dorchard/mucell/wikis
[12] http://github.com/dorchard/mucell/wikis/screenshots-and-videos
[13] http://sbml.org/SBML_Software_Guide/SBML_Software_Summary
[14] http://github.com/dorchard/mucell/wikis/further-work
References
Agarwal, P. (1994), 'Simulation of aggregation in Dictyostelium using the cell programming language’, Computer Applications in the Biosciences, 6(10), 647-655
Alon, U., M. Surette, N. Barkai and S. Leibler (1999), 'Robustness in bacterial chemotaxis', Nature, 397(6715), 168-171
Beck, K. (1999), 'Embracing Change with Extreme Programming', Computer, 32(10), 70-77
Berg, H. C. and D. A. Brown (1972), 'Chemotaxis in Escherichia coli Analysed by Three-dimensional Tracking', Nature, 239(5374), 500-504
CellDesigner, Celldesigner website, http://www.celldesigner.org/, accessed December 2008
Cickovski, T. M., C. Huang, R. Chaturvedi, T. Glimm, H. G. E. Hentschel, M. S. Alber, J. A. Glazier, S. A. Newman and J. A. Izaguirre (2005), 'Compucell3d', IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB), 2, 273-288
Falke, J. J., R. B. Bass, S. L. Butler, S. A. Chervitz and M. A. Danielson (1997), 'The Two-component Signalling Pathway of Bacterial Chemotaxis: A Molecular View of Signal Transduction by Receptors, Kinases, and Adaptation Enzymes', Annual Review of Cell and Developmental Biology 13, 457-512
Hucka, M., A. Finney, H. Sauro, and H. Bolouri (2001), 'Introduction to the Systems Biology Workbench', JST ERATO Kitano Symbiotic Systems Project
Hucka, M., A. Finney, H. M. Sauro, H. Bolouri, J. C. Doyle, H. Kitano, et. al. (2003), 'The systems biology markup language (SBML): a medium for representation and exchange of biochemical network models', Bioinformatics 19 (4), 524-531
Hynes, R. and A. Lander (1992), 'Contact and adhesive specificities in the associations, migrations, and targeting of cells and axons', Cell 68, 303-322
Kanehisa, M. and S. Goto (2000), 'KEGG: Kyoto Encyclopedia of Genes and Genomes', Nucl. Acids Res. 28 (1), 27-30
KEGG, Bacterial Chemotaxis Escherichia coli K-12 MG1655, http://www.genome.jp/dbget-bin/show pathway?eco02030, accessed December 2008
Kitano, H. (2002), 'Systems Biology: A Brief Overview', Science 295 (5560), 1662-1664
Kumar, S. P. and J. C. Feidler (2003), 'BioSPICE: a computational infrastructure for integrative biology', OMICS: A Journal of Integrative Biology, 7 (3), 223-225
Macnab R. M. (1996), Flagella and motility p123-145. in Escherichia coli and Salmonella: Cellular and Molecular Biology, F. C. Neidhart et al (eds.), 2nded, ASM Press, Washington, DC
Moraru, I., J. Schaff, B. Slepchenko, M. Blinov, F. Morgan, A. Lakshminarayana, F. Gao, Y. Li, and L. Loew (2008), 'Virtual cell modelling and simulation software environment', Systems Biology, IET 2 (5), 352-362
Rojdestvenski, I. (2003), 'Metabolic pathways in three dimensions’, Bioinformatics19 (18), 2436-2441
Sourjij, V. (2004), 'Receptor clustering and signal processing in E. coli chemotaxis', TRENDS in Microbiology 12
Sowa Y and R. M. Berry (2008), 'Bacterial Flagellar Motor’, Quarterly Review of Biophysics, 41, 103-132
Tisue, S. and U. Wilensky (2004), 'NetLogo: A simple environment for modelling complexity’, International Conference on Complex Systems (ICCS 2004), Boston, MA, pp. 16-21
To cite this paper please use the following details: Orchard, D., Gover, J., Herrington, L., Lohr, J., Stead, D., Young, C. and Kalvala, S. (2009), 'muCell - Interdisciplinary Research in Modelling and Simulation of Cell Spatial Behaviour', Reinvention: a Journal of Undergraduate Research, Volume 2, Issue 1, http://www2.warwick.ac.uk/go/reinventionjournal/issues/volume2issue1/orchard Date accessed [insert date].
© Reinvention: a Journal of Undergraduate Research (2009). Full copyright remains with the author.






















