Peter Wills Bioinformatic Centre

Vincent


To advance our understanding of the genetics of neurological disorders we need to bring genotype and phenotype information together. Information management techniques for genotype data are well known and not particularly difficult to implement. Information management techniques for phenotype data, particularly in neurological disorders, are more difficult. Paper-based systems, such as DIGS, have the limitation that each time we want to look at different aspects of disease presentation, we need to pull the paper questionnaire and examine the answers for the new aspects for each participant of the study and then re-derive our diagnosis. Online systems, such as VISDIGS, address this to an extent, but do not integrate with genotype data.


We have developed the data model of an integrated information system, called Vincent, to bring genotype and phenotype data together for our bipolar study.

The key features of this system are:

  • secure web client allowing on-line data collection from our remote clinics
  • integration with the VISDIGS systems, allowing full upload of the VISDIGS data for participants into our centralised database
  • perform initial diagnosis from VISDIGS data to present preliminary diagnosis
  • ability to generate pedigree diagram data for import into diagramming tools such as Cyrillic and GenoPedigree
  • integrated marker and genotype data management
  • ability to define a cohort to be a subset of all participants
  • ability to define the disease model for linkage analysis
  • ability to define the diagnostic model to vary the features from VISDIGS that determine affection status
  • directly output .pre and .dat files ready for analysis with systems such as Linkage
  • integrated sample location management for blood and blood product
  • flexible, powerful reporting feature to generate output to suit all research needs
  • security model to ensure appropriate access by authorised staff


Participants visit our clinics and are interviewed using DIGS and the data are entered into VISDIGS running on the clinic computer. Using a web interface, the clinic staff export data from VISDIGS and upload it to our central database. Our scientists upload marker and genotype data into the same database. Our scientists can then define a subset of the study participants for analysis, the parameters associated with the disease models (penetrance levels) and how to translate clinical diagnosis to affection status (the disease model). Vincent can then produce .pre and .dat files for the participants in the analysis with all associated details such as allele values for each marker for each participant, overall allele frequencies and penetrance levels. In order to consider different disease presentation, we only need to alter the component that produces the diagnosis, create new .pre and .dat files and rerun the analysis.
This system saves huge amounts of research time and maintains a clean, single-source version of our invaluable research data. It allows us to consider alternate presentation of the diseases we study without the need to hire staff to re-evaluate paper copies of DIGS questionnaires. It integrates our remote clinics. In the future we will look to enable Vincent to produce linkage analysis files and actually run linkage unattended.

 

Vincent is built with REALbasic and released as a native binary (Win32, MacOS 8/9 and MacOS X) and is also accessible via a web interface. The data is warehoused in Sybase hosted on Solaris technology.


©2003 PWBC