Variable

From Opasnet
Revision as of 07:15, 17 September 2015 by Heta (talk | contribs) (→‎Rationale)
Jump to navigation Jump to search


<section begin=glossary />

Variable is a description of a particular piece of reality. It can be a description of a physical phenomenon, or a description of value judgements. Also decisions included in an assessment are described as variables. Variables are continuously existing descriptions of reality, which develop in time as knowledge about the topic increases. Variables are therefore not tied into any single assessment, but instead can be included in other assessments. A variable is the basic building block of describing reality.<section end=glossary />

Question

What should be the structure of a variable such that it

  • is able to systematically handle all kinds of information about the particular piece of reality that the variable is describing, especially
    • it is generic enough to be a standard building block in decision support work (including interpretation of scientific information and political discussions),
  • is able to systematically describe causal relationships between phenomena and variables that describe them,
  • enables both quantitative and qualitative descriptions,
  • is suitable for any kinds of variables, especially physical phenomena, decisions, and value judgements,
  • inherits its main structure from universal objects,
  • complies with the PSSP ontology,
  • can be operationalised in a computational model system,
  • results in variables that are independent of the assessment(s) they belong to;
  • results in variables that pass the clairvoyant test.
  • can be implemented on a website, and
  • is easy enough to be usable and understood by interested non-experts?

Answer

Variable is implemented as a web page in Opasnet wiki web-workspace. A variable page has the following structure.

The attributes of a variable.
Attribute Sub-attribute Comments specfic to the variable attributes
Name An identifier for the variable. Each Opasnet page have two kinds of identifiers: the name of the page (e.g. Variable) and the page identifier (e.g. Op_en2022). The former is used e.g. in links, the latter in R code.
Question Gives the question that is to be answered. It defines the scope of the variable. The question should be defined in a way that it has relevance in many different situations, i.e. makes the variable re-usable. (Compare to an assessment question, which is more specific to time, place and user need.)
Answer An answer presents an understandable and useful answer to the question. Its essence is often a machine-readable and human-readable probability distribution (which can in a special case be a single number), but an answer can also be non-numerical such as "very valuable" or a descriptive table like on this page. The units of interconnected variables need to be coherent with each other given the functions describing causal relations. The units of variables can be used to check the coherence of the causal network description. This is a so called unit test. Typically the answer contains an R code that fetches the ovariable created under Rationale/Calculations and evaluates it.
Rationale Rationale contains anything that is necessary to convince a critical reader that the answer is credible and usable. It presents the reader the information required to derive the answer and explains how it is formed. Typically it has the following sub-attributes, but also other are possible. Rationale may also contain lengthy discussions about relevant topics.
Data Data tells about direct observations (or expert judgements) about the variable itself.
Dependencies Dependencies R↻ tells what we know about how upstream variables (i.e. causal parents) affect the variable. In other words, we attempt to estimate the answer indirectly based on information of causal parents. Sometimes also reverse inference is possible based on causal children. Dependencies list the causal parents and expresses their functional relationships (the variable as a function of its parents) or probabilistic relationships (conditional probability of the variable given its parents).
Calculations Calculations R↻ is an operationalisation of how to calculate or derive the answer. Formula uses algebra, computer code, or other explicit methods if possible. Typically it is R code that produces and stores the necessary ovariables to compute the current best answer to the question.

In addition, it is practical to have additional subtitles on a variable page. These are not attributes, though.

  • See also
  • Keywords (not always used)
  • References
  • Related files

Rationale

The structure is based on extensive discussions between Mikko Pohjola and Jouni Tuomisto in 2006-2008 and intensive application in Opasnet ever since.


Variables or more accurately open variables give the current best answer to some specific research question based on the crowd's interpretation of existing information. Variables are the basic elements of assessments. They always describe some phenomenon in the real world. This are for example desciptions of physical phenomenon like exposure to some chemical but also the opinion distribution of the population about immigration. It's a art of the nature of variables that they are never fully finished but that their contents evolves with new knowledge and the work done to better the variables. Variables also are not bound to any one assessment but can be used as parts of many different assessments. It's worth noticing that the word variable is used in also many other meanings, but in this context it is used to mean precisely variables used in assessments.

Variables contain scientific knowledge, but they differ from classic products of scientific research. Here a short description and comparison.

  • A scientific article is the basic unit of doing science today. In it a researcher or a research group does research that gives out observation data. It is analysed, and in the end interpretations and conclusions are made based on the new results and previous scientific articles. The goal is to publish the articles in a peer reviewed paper meaning a few researches in the field looks through the manuscript and has backed it up before it is published. The peer review -systems aims to raise the quality of the manuscripts and weed out bad research. For both purpose it is agreed that the system isn't especially efficient, but no one has come up with anything better.
  • Expert reports are gathered by an expert well familiar with the field in question, and are usually about some specific question like the topic of a future decision. They usually don't produce any new knowledge and are usually not peer reviewed, so they're not well respected among researchers and research funders. However, they are much more better suited to be used in decision support, because they answer precisely the questions that are relevant to the decision at hand.
  • Open data is usually measured data that has been made public as raw data for anyone to use. It depends on the case whether the data is well cultured and quality-proofed, but it usually isn't. The practises of open data have only begun to take shape in the last few years, because researches haven't been in the habit of publishing raw data before. The problem with supporting decision-making is that it doesn't involve any interpretations or conclusions, and even less of the relevant issues. Open data is great raw material for someone who knows how to analyse and interpret it and has the time, but quite useless to anyone else.
  • The idea of a variable is to combine the sides useful to decision support of all the other mentioned information products and avoid the bad ones. The idea of a variable is to built an information product around a specific research question. The question can be purely scientific, but in the case of decision support it can me phrased to help precisely the future decision. To answer the question experts gather all possible material that will help answer the question. This includes research articles, expert reports and open data and all other silent knowledge of the experts that is not found in written form. The variable is worked on from the beginning in an open webworkspace with the help of crowdsourcing, and all information it contains is free to use. The material is structured, assessed and interpreted. The result is an answer that has passed all critique that has come up during the working process. Thus the answer is the best current interpretation of how the thing the question asks is in reality. Criticising of the variables openly during the work ensures that the result is scientifically sound. The answer is usually in a computer-readable form for models to use and also for humans in text and picture form. The strengths of a variable are that it uses all relevant information (no only own data as in an article), interprets the data (unlike open data) and is produced by following the principles of openness and critique (unlike an expert report).


See also

References


Related files