In: Enterprise Biology Software, Version 7.0  © 2007 Robert P. Bolender                           

 

Enterprise Biology Software: VIII. Research (2007)

Robert P. Bolender

Enterprise Biology Software Project, P. O. Box 303, Medina, WA  98039-0303, USA

http://enterprisebiology.com

 


 

Biological data differ remarkably from those of physics and chemistry in that their properties are intimately tied to their physical locations.  A hierarchical arrangement - consisting of parts contained within parts - defines biology as a complex set of interacting complexities, scaling in size from molecules to organisms.  None of this uniqueness becomes apparent until we start putting the parts back together.  If, for example, we don’t return the parts to their original in vivo settings before attempting an interpretation, we run the risk of allowing our data to become meaningless or at best severely limited.  In effect, a failure to recognize this special property of biology leads inevitably to a semiquantitative science.  Our task this year therefore becomes one of understanding how biology can become broken experimentally and then showing how we can fix it.  The reward for our efforts takes us one step closer to a quantitative biology. 

 

The purpose of the report this year is to explore ways of extending what we have learned from the stereology literature to disciplines often reduced to relying on semiquantitative approaches, such as biochemistry and molecular biology.  We begin by defining the boundary conditions for experimentation in biology, explore the unstable foundations of semiquantitative data, and finally suggest workable alternatives.  For example, a new category of hybrid hierarchy equations can integrate biological data across disciplines and provide methodological gold standards.  The main product to emerge from this effort is a rule book, one that carefully addresses the uniqueness of biology.  The software package for 2007/8 (EBS, Version 7.0) includes new data harvested largely from years 2004-5, expands the biology blueprint, introduces cluster analysis, and offers guidelines for a mathematical biology.

 


 

Introduction

 

Transforming biology into a quantitative science begins by changing the way we manage our research data.  An essential first step consists of moving published data from the pages of journal articles into the tables of relational databases where they can be standardized and used to look for mathematical patterns.  Such patterns can be readily found in the connections that occur between parts and captured as data pairs and repertoire equations.  In turn, these data pairs and equations can become the raw material for building a mathematical biology.  Fundamental to the success of this approach includes an ability to manage complexity in biology by mathematically unfolding and refolding structural relationships - throughout the biological hierarchy of size.  By leveraging the mathematical order inherent in biological systems, we now have a workable strategy for reverse and forward engineering structures of all sizes – one based largely on our ability to tap into the mathematical core of biology.  From this we learn that mathematics and technology can become powerful discovery tools when we use them in harmony with the intrinsic order of biology.

 

THE MODELS

 

Models define the ways in which we explore, interpret, and discover biology.  Two models, reductionism and change, largely define the research biology of today and have been eminently successful in producing a descriptive and semiquantitative biology.  However, the major limitation of these models is their inability to address biology as a complex adaptive system.  One approach to overcoming this limitation is to upgrade our current models and add new ones.  To make this transition, however, we will need a mathematical stepping-stone to take us from semiquantitative to quantitative.   Stereology can serve remarkably well in this role and connection, integration, and engineering models offer new discovery platforms that can be designed to operate comfortably in a complex setting.

 

Reductionism

 

Reductionism reduces or eliminates complexity by extracting specific parts of biology so that they can be carefully analyzed and studied.  The extraction process, however, usually forfeits the structural order of the part, severs connections with other parts, and produces isolated data.  When such data are interpreted out of context, they often become semiquantitative.  In other words, perfectly good quantitative data can collapse into a semiquantitative state by simply assuming they can do something they are unable to do.

 

Change

 

The change model compares two data points – usually a control and experimental – with the goal of detecting a significant change.  Detecting a significant change in isolated data is extremely easy to do in a semiquantitative setting, whereas detecting a significant change – that is also valid - in a biological setting requires the power of a quantitative biology.  Fortunately, the change model can be readily upgraded to quantitative by merely designing experiments as equations that contain all the required variables.  

 

Connectionism

 

The connection model maintains that all parts of biology are connected by rule, which means that the structural order of biology can be captured with equations.  Here the practical solution consists of standardizing published research data by moving them into a relational database and by forming data pairs.  In turn, these data pairs can be assembled into repertoire equations that define the connections mathematically.  The connection model can therefore offer a robust solution to the problem of unfolding complexity.       

 

Integration

 

The integration model operates by storing standardized data of all biological disciplines in a single database table and then using them to form data pairs and decimal repertoire equations.  This cleaning and summarizing process leads to a universal biology database.  Such a database can effectively integrate published research data across all disciplines and most experimental settings.   

 

Engineering

 

The engineering model uses biological data to unfold (reverse engineer) and refold (forward engineer) biological complexity.  This can be done locally with hierarchy equations and locally and globally with decimal repertoire equations.  This model serves as a quantitative foundation for diagnosis and prediction.  

 

 

MATHEMATICAL BIOLOGY – A Brief Introduction

 

The mathematical biology described herein depends wholly on published data to generate the empirical equations that we can use to explore an information space of uncommon complexity.  To qualify, these data must be gathered with unbiased sampling methods within the framework of a rule-based approach.  The brief introduction that follows considers three basic components of a mathematical biology that will serve to illustrate how this quantitative approach to biology works.  The Rule Book: Guidelines to a Mathematical Biology continues this process, but in greater detail (Bolender, 2007). 

 

Unbiased Sampling

 

In biology, sampling is everything.  Unbiased sampling requires that all parts of the structure being sampled must have an equal chance of being sampled.  Any other sampling scheme automatically becomes suspect.  Samples collected for biochemical analyses that do not come from a total cell or tissue homogenate, or come from isolated cell or tissue fractions will also fail this test – unless the data are collected and interpreted within the framework of analytical fractionation (de Duve, 1974).

 

Experimental Design

 

Research experiments can become consistent with the organizing principles of biology when they are designed as hierarchy equations.  The process is surprisingly straightforward.  The equations define the problem in terms of variables, which, in turn, are collected as data in the laboratory by applying unbiased sampling methods.  Finding a solution to an experiment consists of entering data into an equation and evaluating it.  The challenge for the reader will be to learn how to write balanced hierarchy equations.  However, the reward for such an effort can be substantial.  Such a skill will save countless hours in designing experiments, in reading research papers, and in reviewing manuscripts and research proposals.     

 

Biological Data

 

Data appearing in the biology literature can be quantitative, semiquantitative, or descriptive.  To qualify as quantitative data in a mathematical biology, they must be clearly identified, satisfy the unbiased sampling requirement, and detect differences and changes unambiguously.  Being quantitative also depends on how, where, and when the data are being used.  For example, quantitative data in one setting can quickly become semiquantitative in another. 

 

Let’s begin.  Most biological data represent directly or are derived from four basic quantities: volume (V), surface (S), length (L), and number (N).  Recall that weight includes the product of a volume (V) and a density (ρ): W=V x ρ.

 

In biology, however, these four basic quantities can become linked mathematically in curious ways because of the hierarchical organization of the parts.  When parts are contained within parts, with cells serving as the basic unit of construction, dependencies occur that become mathematically inseparable.  This means that the parts cannot be separated from one another when they belong to a quantitative unit.  Let’s look at an example.

 

Biological parts arranged in a structural hierarchy become a function of three variables: volume (V), mean volume (meanV), and number (N).  This gives three relationships. 

 

V = meanV ∙ N

N = V / meanV

Mean V = V / N

 

If we want to know at any given time what biology and its parts are up to, we need to know what’s happening to these three variables.  Notice that in practice we have to measure (or estimate) only two of the variables, because the equation allows us to solve for the third.  What happens when we decide to separate these three variables in our experimental design?  We get exactly what we don’t want, namely an incomplete or semiquantitative result.  In effect, by breaking them up, we break the rules.  Reductionism tells us we can, but we really cannot.  Why?  We can take the data out of the hierarchy, but not the hierarchy out of the data.  This turns out to be a fundamental property of biology, behaving as a complex hierarchical system.  It is a principle of experimental biology perhaps universally unknown and yet enormously important. 

 

Now, let’s work through a few examples.  We begin with the general equation for volume, wherein a compartmental volume is the product of the mean volume of a part and the number of parts:

 

V = meanV ∙ N.

 

By adding subscripts for a cell (cell), we can focus on the behavior of a specific part, namely a cell:

 

Vcell = meanV(cell) ∙ N(cell), where for convenience we can assign centimeter units:

 

cm3 = cm3 ∙ cm0 ; recall that cm0 = 1.

 

This equation tells us that to get the total volume of a compartment of cells, we need to know both the mean cell volume and the cell number.  Alternatively, we can get the same information by knowing the concentration (Vcell / Vstructure) of the cells in the containing structure (Vstructure):

 

Vcell = Vstructure ∙ (Vcell / Vstructure), where cm3 = cm3 ∙ (cm3 / cm3).

 

Now, lets compare the information content of these two equations.

 

(1) Vcell = meanV(cell) ∙ N(cell)

(2) Vcell = Vstructure ∙ (Vcell / Vstructure)

 

Equation (1) contains information about the volumes and numbers of the parts (i.e., cells), whereas equation (2) contains information only about the volumes of the parts.  To interpret a change in the cells (Vcell) unambiguously, equation (1) will work but not equation (2).  Why?  Because Vcell = meanV(cell) ∙ N(cell).  A change in the volume of cells can be influenced by a change in the mean cell volume, a change in the number of cells, or a change in some combination of the two.  Moreover, a change in equation (2) can be influenced by a change in the volume of the parent structure (Vstructure) plus all the changes that can occur in equation (1). 

 

By combining equations (1) and (2), we get:

 

meanV(cell) ∙ N(cell) = Vstructure ∙ (Vcell / Vstructure) , 

 

We can then solve for the concentration of cells (Vcell / Vstructure):

 

(Vcell / Vstructure) = (meanV(cell) ∙ N(cell)) / Vstructure , where (cm3 ∙ cm0) / cm3 = cm0.

 

Why is this equation important?  It unveils the complexity of a cell concentration.  Measuring a concentration is easy, explaining why a concentration has changed is considerably more difficult because many variables come in play.  Here’s the point.  When comparing experimental to control concentrations, we are dealing with two variables in the numerators and four in the denominators for a grand total of 6 variables - all of which can contribute to the outcome.  Remember that more than two variables in play effectively render an experimental result ambiguous (see Rules #4 and #5 in the Rule Book (Bolender 2007)).

 

Consider this.  When we add counts of molecules to our experiment, the cells and all the containing structures (parts in parts) continue to influence the outcome of the experiment.  This explains why it’s so important to express all experiments as hierarchy equations. 

 

Now let’s see what happens when we want to detect changes in molecules.  We begin by writing the equation of our experiment, one that will give us the number of molecules in a population of cells contained within a structure (Vstructure).  The same equation applies to control and experimental data.  Color-coding identifies the variables with units and phenotypes that can cancel.

 

The usual stereological equation is given as:

 

N(molecules,cell) = Vstructure ∙ Vv(cell/structure) ∙ Nv(molecules/cell) .

 

Next, we can unfold it to view the inherent complexity:

 

N(molecules,cell) = Vstructure ∙ {(meanV(cell) ∙ N(cell)) / V(structure)} ∙ {N(molecules) / (meanV(cell) ∙ N(cell))}, where

 

cm0 = cm3 ∙ (cm3 ∙ cm0 / cm3) ∙ (cm0 / cm3 ∙ cm0), where cm0 = 1.

 

Now we are ready to design an experiment that will let us detect a change in the number of molecules in a biological setting and to explain the sources of the change. 

 

In Vivo and In Vitro Experiments

 

The experiment will compare the number of molecules in the experimental (Nmolecules(E)) to those of the control (Nmolecules(C)):

 

(Nmolecules(E)) / (Nmolecules(C)) =

{Vstructure(E) ∙ {(meanVcell(E) ∙ Ncell(E)) / Vstructure(E)}∙ {Nmolecules(E) / (meanVcell(E) ∙ Ncell(E))} / {Vstructure(C) ∙ {(meanVcell(C) ∙ Ncell(C)) / Vstructure(C)}∙ {Nmolecules(C) / (meanVcell(C) ∙ Ncell(C))}.

 

What does this equation tell us?  When we wish to count molecules in an in vivo setting several variables come into play.  Recall that a common way of counting molecules in biochemistry and molecular biology is to measure an optical density (OD) in a homogenate or a sample isolated there from.  Such a measure represents a molecular concentration - the number of molecules contained within a unit of containing volume.  Now let’s ask the telling questions.  If we count molecules with optical densities, will our results be the same as those coming from the equation above?  Will we be able to explain the changes in terms of actual biological events?  Will the equation below work in a biological setting?  Is the following equation correct?

 

(Nmolecule(E)) / (Nmolecule(C)) = (ODmolecule(E)) / (ODmolecule(C)) = Correct?

 

The answers to the four questions above are no, no, no, and no.  Why?  Because an optical density generates only a molecular concentration, it ignores what’s happening to everything else, and therefore produces an ambiguous outcome.  In large part, optical densities are responsible for creating an environment where in the quantitative data of biochemistry and molecular biology are being routinely – and regrettably - downgraded to a semiquantitative status when used to detect changes. 

 

In an experimental setting, an optical density experiment simply compares two molecular concentrations, one control and one experimental.  The method assumes that comparing optical densities is equivalent to comparing molecular numbers and that the following equation must be correct.

 

ODmolecule(E) / ODmolecule(C)  =  (Nmolecule(E) / Vreference(E)) / (Nmolecule(C) / Vreference(C))  =  Nmolecule(E) / Nmolecule(C))

 

To accept this assumption, what must be true?  For the above equation to work, the reference volumes and the reference phenotypes must cancel.

 

Reference volume: Vreference(E) / Vreference(C) = 1, where, for example, cm3 / cm3 = 1.

 

Reference phenotype: Vreference phenotype(E) / Vreference phenotype(C) = 1, where the contents of the cm3 reference in the control (meanVcell(C) ∙ Ncell(C)) is exactly the same as contents of a cm3 reference in the experimental (meanVcell(E) ∙ Ncell(E)).  Recall that V = meanV ∙ N.

 

If these control and experimental reference volumes fail to cancel, then the optical density equation produces an ambiguous result because it is being influenced by the behavior of four variables:

 

ODmolecule(E) / ODmolecule(C)  =  (Nmolecule(E) / Vreference(E)) / (Nmolecule(C) / Vreference(C)),

 

which can be unfolded to give six variables:

 

ODmolecule(E) / ODmolecule(C)  =  (Nmolecule(E) / (meanVcell(E) ∙ Ncell(E))) / (Nmolecule(C) / (meanVcell(C) ∙ Ncell(C))).

 

The point being made here is that experiments in biology cannot be well designed in the absence of a realistic and correctly balanced equation.

 

Unrealistic Experiment

Nmolecule(E) / Nmolecule(C) = ODmolecule(E) / ODmolecule(C)

 

Realistic Experiment

ODmolecule(E) / ODmolecule(C)  =  (Nmolecule(E) / (meanVcell(E) ∙ Ncell(E))) / (Nmolecule(C) / (meanVcell(C) ∙ Ncell(C))).

 

If a unit of reference volume is always a standard unit of volume (e.g., one cm3), why does it fail to cancel?  It a biological setting, a cm3 of reference volume can cancel only when the contents of the cm3 in the control and experimental settings are exactly the same.  What can change in a cm3?  The number of molecules, the number of cells, the number of molecules in the mean cell, the size of the cells, the shapes of the cells, the amount of interstitial material, and a host of changes that can occur in all the other parts that may be present.  It turns out that a cm3 of reference volume represents a reference phenotype, one that will be unique to each control and experimental data point.  This means that in such a setting a cm3 or gram of tissue cannot be expected to cancel.  Understanding the role of the reference phenotype will allow both biochemistry and molecular biology to prevent their research data from becoming semiquantitative – just as it has already done for the densities of biological stereology. 

 

Just how important is this reference phenotype rule?  It turns out to be very important.  Is there, for example, any convincing way of judging quantitatively the impact of the reference phenotypes on the results of an experiment?  Yes.  If we turn to the Stereology Literature Database and collect data from studies that reported both the number (N) and concentration (N/V) for the same part(s), then we can see firsthand how often the two estimates agree. 

 

To this end, I have updated the percentage change table, relabeled it the Concentration Trap, and included a fresh copy in the current software package.  This software tool can be quite helpful in that it allows us to estimate the amount of damage to expect when concentrations are being compared in an experimental setting – both locally and globally. 

 

Some examples may help.  Globally, changes in the same part detected as a total number of parts (N) and as a concentration (N/V) agree only about 50% of the time.  Since we can expect a similar outcome for OD data, how might we expect to interpret an experiment based just on optical densities?  If the OD data are reported to be significantly different at the 95% level (p<0.05), what might this really mean?  Since such a comparison – on average - enjoys only about a 50:50 chance of being correct, this could reduce the overall significance level of the outcome to a probability of less than 50%.  Such an outcome is clearly an ambiguous one.  It tells us that concentration (OD) data become unreliable when they are pulled out of the equation of an experiment and used on their own to detect a change.  They simply cannot do it alone because they need the help of other variables, along with directions from a rule-based equation.   

 

On close inspection, OD comparisons – control vs. experimental – clearly do not carry enough information to detect a biological change credibly.  If we can demonstrate convincingly that the concentrations being detected with optical densities suffer the same fate as the concentrations (called densities) of stereology, then OD data coming from in vivo experiments become suspect and will most likely warrant a similar downgrading to semiquantitative. 

 

Let’s look for solutions to this OD problem, because it seems quite unlikely that that these data can survive the downgrade - when put to the test described above.  To use OD data productively, we must figure out how to reduce the mischief being caused by the variability of the reference phenotypes.  Recall that the reference phenotype of the control data will not cancel the one of the experimental because they are different parts phenotypically.  This leaves us with at least two workable options.  We can use the approach illustrated by the equations given at the beginning of this discussion, or we can eliminate the reference phenotype locally – at the level of the control and experimental data points.  This is accomplished by forming data pairs, each part of which must be related to the same reference phenotype.

 

Local Fix:  Form data pairs to upgrade semiquantitative OD data to quantitative.

 

When both molecule A and B are related to the same reference phenotype, then the following data pairs can be formed.

 

For control data

ODmolecule A(C) / ODmolecule B(C)  =  (Nmolecule A(C) / Vreference ph (C)) / (Nmolecule B(C) / Vreference ph(C)) =  Nmolecule A(C) / Nmolecule B(C)

 

For experimental data

ODmolecule A(E) / ODmolecule B(E)  =  (Nmolecule A(E) / Vreference ph(E)) / (Nmolecule B(E) / Vreference ph(E)) =  Nmolecule A(E) / Nmolecule B(E)

 

Within the framework of a connection model, these optical density data can now provide reliable information about the proportions of molecules (e.g., A:B) in both control and experimental settings.  However, these data provide no information about the total number of molecules.  The point to be made here is that semiquantitative OD data can be transformed into a reliable form of quantitative data.  Once expressed as data pairs, the OD data can be added to the Universal Biology Database, interact with other data types, and contribute to forming equations and uncovering mathematical patterns. 

 

Global Fix: Design the experiment as a hierarchy equation and collect all the essential variables.

 

 

Mathematical Phenotypes

 

The term mathematical phenotype describes the relationship of biological parts, from two (data pairs) to many (repertoire networks).  Here we consider two ways of assembling such phenotypes.

 

Decimal Repertoire Equations: Parts sharing the same (or adjacent) decimal repertoire equations display stronger associations than with those located in more distant equations.  Such information becomes useful when looking for related parts or when assembling networks of equations.  A further advantage of these repertoire data (and equations) is that they can be searched and sorted very quickly in the database table. 

 

Cluster Analysis: Cluster analysis sorts data into groups (clusters) according to stronger or weaker associations.  It can be especially useful for comparing graphically similar sets of biological parts in different experimental settings.  It extends our current ability from following the behavior of a few variables to one of looking at the larger patterns being produced by many variables.

 

Cluster analysis is a multivariate technique that provides graphical output as trees (dendrograms).  Parts on the same branch are more closely related than those on distant branches.  The results given in the software were calculated with StatistiXL (Version 1.6).  An Internet search will provide enough background and examples to introduce the method to the reader (see, for example, the Clustan website).

 

When looking for mathematical patterns in the large data sets coming from microarrays, molecular biologists often use cluster analysis.  A paper by Eisen et.al., (1998) includes an illustration and a worked example.   

 

 


 

Methods and Results

 

 

Enterprise Biology Software Package – 2007

 

The software package includes a main menu sitting atop eight program modules.  Each module provides access to a collection of programs, along with an introductory document (Read).  The installation program installs the relational database, front-end tools, and includes supporting documents and programs.  To run a program module, click on an item in the menu.  Note that the text documents will display only after the new reader has been installed (see Setup 8.).   Since several of the programs were described in detail last year (Bolender, 2006), that effort will not be duplicated here.  For more information about a given program, click on the Read button.

 

..\..\..\2007 UBD\Images\2007PR001.BMP

 

1. Read Progress Reports 2007/2008: Current and past reports can be called from this screen.

 

..\..\..\Program Files\EBSTicket 2006\Files\2007cluster101.BMP

 

 

 

2. Find Research Papers: References can be found by searching on methods and on data.  The read document provides examples.

 

..\..\..\2007 UBD\Images\2007PR002.BMP

 

By Citation:

..\..\..\2007 UBD\Images\2007PR003.BMP

 

By Method(s):

..\..\..\2007 UBD\Images\2007PR004.BMP

 

By Data:

..\..\..\2007 UBD\Images\2007PR005.BMP

 

 

3. Explore with Data & Equations: The Universal Biology Database includes roughly 40,000 data pairs fitted to regression curves and summarized as decimal repertoire equations.  This year, new data were added from about 100 papers.  The module offers a variety of options for using these data in a discovery mode.  The programs use a SQL (structured query language) front-end, one that can be mastered quickly even by beginners.

     

..\..\..\2007 UBD\Images\2007PR006.BMP

 

Data table (UBD):

..\..\..\2007 UBD\Images\2007PR007.BMP

 

SQL – control data:

..\..\..\Program Files\EBSTicket 2006\Files\2007cluster105.BMP

 

SQL – experimental data:

..\..\..\2007 UBD\Images\2007PR008.BMP

 

SQL – control & experimental data:

 

..\..\..\2007 UBD\Images\2007PR009.BMP

 

 

4. Find Quantitative Patterns with a Biology Blueprint:  The blueprint summarizes the 40,000 decimal repertoire equations as a connection matrix.  It shows how biological parts are connected quantitatively and how these parts can change in an experimental setting.  In effect, it uses the proportion of parts (stoichiometry) to summarize the range of design options being employed by biology.  This year, the biology blueprint was updated to include experimental data.  

 

..\..\..\2007 UBD\Images\2007PR010.BMP

 

Blueprint out: The new version (3.0) includes both control (green) and experimental data (blue).

..\..\..\2007 UBD\Images\2007PR011.BMP

 

Blueprint SQL: A new SQL interface quickly locates specific information and patterns.  Such tools should simplify the task of assembling networks and may become helpful in deciphering genetic switching and control mechanisms.

 

..\..\..\2007 UBD\Images\2007PR012.BMP

 

 

5. Viewing Mathematical Phenotypes:  Decimal repertoire equations and cluster analysis sort data pairs into groups (or clusters) according to strong or weak associations.  They offer effective ways of finding patterns in large and small data sets.

 

..\..\..\2007 UBD\Images\2007PR013.BMP

 

Comparing patterns:  This module includes examples of tree graphs (dendrograms) and suggests a wide range of applications for the method.   The figure at the left illustrates the phenotype of the hippocampus of alcoholics, whereas the one at the right shows the different strains of mice used earlier in the connection matrix (Bolender, 2005).

 

..\..\..\2007 UBD\Images\2008clust033.BMP     ..\..\..\2007 UBD\Images\2008clust034.BMP

 

6. Add New Data:  These data entry screens allow the user to enter new data pairs and blueprint data.

 

..\..\..\2007 UBD\Images\2007PR014.BMP

 

Data point data - in: Use this screen to enter new data pairs.

 

..\..\..\2007 UBD\Images\2007PR015.BMP

 

 

Blueprint in: Use this screen to enter new data into the blueprint.

 

..\..\..\2007 UBD\Images\2007PR016.BMP

 

7. Rule Book: The Rule Book introduces the ground rules for a mathematical biology based largely on the data and principles harvested from the literature of biological stereology.  It also includes a software tool – called the concentration trap - that offers assistance in assessing the risk associated with using solo concentrations for detecting biological changes. 

 

..\..\..\2007 UBD\Images\2007PR018.BMP

 

Rule Book: Guidelines for a mathematical biology.

 

..\..\..\2007 UBD\Images\2007PR017.BMP

 

Concentration trap: What happens to the results of an experiment when it is based on an equation or on a single variable plucked out of that equation?  For example, an outcome labeled V, S, L, and N all come from an equation, whereas Vv, Sv, Lv, And Nv would represent plucked variables.  Recall that the color-coding in this table includes: red an increase, blue a decrease, and green no change.

 

..\..\..\2007 UBD\Images\2007PR019.BMP

 

A global view: For example, click on the global button marked Vv vs. V to see how often an isolated variable (Vv) – a concentration - gives the same result as that of the equation to which it belongs (V).  Scroll through the screens - by clicking on them with the mouse - and become convinced that V does not always share the same background color with Vv.  This means that an isolated variable cannot be trusted to give the same result as that of a properly written equation.  On average, the two different outcomes agree only about 50% of the time.     

 

A local view: All cells and tissues do not necessarily share the same risk of falling into the concentration trap.  For example, counting the number of glomeruli in the kidney (Nv vs. N) is accompanied by 86% error (the concentration data will be wrong 86% of the time (24 disagree/28 total count)), whereas counting neurons in the hippocampus (Nv vs. N) has only a 22% error (18 disagree/82 total).  In any case, comparing densities or concentrations becomes a risky business and all such comparisons should be considered highly questionable.  Why?  A single isolated variable can never be as effective at detecting a change reliably as a group of variables working together in a properly designed equation.      

 

Here is the filter script for the glomerulus (match(ex_datapoint_ex_datapoint_name, '[g][l][o][m][e][r]') and ex_data_1_ex_n >0 and ex_data_1_ex_nv >0) and the one for the hippocampus (match(ex_datapoint_ex_datapoint_name, '[h][i][p][p][o]') and ex_data_1_ex_n >0 and ex_data_1_ex_nv >0).  Direction for writing such scripts can be found in the software (click on Read).