summaryrefslogtreecommitdiff
path: root/docs/design.org
blob: 41713ebef7c38ffd49345ce155c1244654f466fb (plain)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
#+TITLE: 7Units Design Document
#+SUBTITLE: For version 0.3.1
#+DATE: 2021 August 24
#+LaTeX_HEADER: \usepackage[a4paper, lmargin=25mm, rmargin=25mm, tmargin=25mm, bmargin=25mm]{geometry}
#+LaTeX_HEADER: \usepackage{xurl}

* Introduction
  7Units is a program that can convert between units.  This document details the internal design of 7Units, for current and future developers.

  The frontend code is currently subject to change, so it is not included in the current version of this document.
* Unit System Design
  Any code related to the backend unit system is stored in the ~sevenUnits.unit~ package.
** Dimensions
   Dimensions represent what a unit is measuring, such as length, time, or energy.  Dimensions are represented as an [[*ObjectProduct][ObjectProduct]]<BaseDimension>, where ~BaseDimension~ is a very simple class (its only properties are a name and a symbol) which represents the dimension of a base unit; these base dimensions can be multiplied to create all other Dimensions.
** Unit Classes
   Units are internally represented by the abstract class ~Unit~.  All units have an [[*ObjectProduct][ObjectProduct]]<BaseUnit> (referred to as the base) that they are based on, a dimension (ObjectProduct<BaseDimension>), one or more names and a symbol (these last two bits of data are contained in the ~NameSymbol~ class).  The dimension is calculated from the base unit when needed; the variable is just a cache.  It has two constructors: a package-private one used to make ~BaseUnit~ instances, and a protected one used to make general units (for other subclasses of ~Unit~).  All unit classes are immutable.

   Units also have two conversion functions - one which converts from a value expressed in this unit to its base unit, and another which converts from a value expressed in the base unit to this unit.  In ~Unit~, they are defined as two abstract methods.  This allows you to convert from any unit to any other (as long as they have the same base, i.e. you aren't converting metres to pounds).  To convert from A to B, first convert from A to its base, then convert from the base to B.

   ~BaseUnit~ represents a unit that all other units are defined by.  All of the units used by this system are defined by seven SI ~BaseUnit~ instances (metre, second, kilogram, ampere, kelvin, mole, candela; this is what 7Units is named after) and two non-SI ~BaseUnit~ instances (US dollar and bit).  Because base units are themselves units (and should be able to be used as units), ~BaseUnit~ is a subclass of ~Unit~, using its own package-private constructor.

   However, most units are instances of ~LinearUnit~, another subclass of ~Unit~.  ~LinearUnit~ represents a unit that is /a product of a base unit and a constant called the *conversion factor*/.  Most units you've ever used fall under this definition, the only common exceptions are degrees Celsius and Fahrenheit.  This simplicity allows the ~LinearUnit~ to do many things:
   - It can implement conversion to and from the base as multiplying and dividing respectively by the conversion factor
   - You can easily create new units by multiplying or dividing a ~LinearUnit~ by a number (for example, kilometre = metre * 1000).  This can be easily implemented as multiplying this unit's conversion factor by the multiplier and returning a new ~LinearUnit~ with that conversion factor factor.
   - You can add or subtract two ~LinearUnit~ instances to create a third (as long as they have the same base) by adding or subtracting the conversion factor.
   - You can multiply or divide any two ~LinearUnit~ instances to create a third by multiplying or dividing the bases and conversion factors.
   - Note that any operations will return a unit without name(s) or a symbol.  All unit classes have a ~withName~ method that returns a copy of them with different names and/or a different symbol (all of this info is contained in the ~NameSymbol~ class)

   There are a few more classes which play small roles in the unit system:
   - Unitlike :: A class that is like a unit, but its "value" can be any class.  The only use of this class right now is to implement ~MultiUnit~, a combination of units (like "foot + inch", commonly used in North America for measuring height); its "value" is a list of numbers.
   - FunctionalUnit :: A convenience class that implements the two conversion functions of ~Unit~ using ~DoubleUnaryOperator~ instances.  This is used internally to implement degrees Celsius and Fahrenheit.  There is also a version of this for ~Unitlike~, ~FunctionalUnitlike~.
   - UnitValue :: A value expressed as a certain unit (such as "7 inches").  This class is used by the simple unit converter to represent units.  You can convert them between units.  There are also versions of this for ~LinearUnit~ and ~Unitlike~.
   - Metric :: A static utility class with instances of all of the SI named units, the 9 base dimensions, SI prefixes, some common prefixed units like the kilometre, and a few non-SI units used commonly with them.
   - BritishImperial :: A static utility class with instances of common units in the British Imperial system (not to be confused with the US Customary system, which is also called "Imperial"; it has the same unit names but the values of a few units are different).  This class and the US Customary is divided into static classes for each dimension, such as ~BritishImperial.Length~.
   - USCustomary :: A static utility class with instances of common units in the US Customary system (not to be confused with the British Imperial system; it has the same unit names but the values of a few units are different).
** Prefixes
   
** The Unit Database
* Utility Classes
  7Units has a few general "utility" classes.  They aren't directly related to units, but are used in the units system.
** ObjectProduct
   An ~ObjectProduct~ represents a "product" of elements of some type.  The units system uses them to represent coherent units as a product of base units, and dimensions as a product of base dimensions.

   Internally, it is represented using a map mapping objects to their exponents in the product.  For example, the unit "kg m^2 / s^2" (i.e. a Joule) would be represented with a map like ~[kg: 1, m: 2, s: -2]~.
** ExpressionParser
   The ~ExpressionParser~ class is used to parse the unit, prefix and dimension expressions that are used throughout 7Units.  An expression is something like "(2 m + 30 J / N) * 8 s)".  Each instance represents a type of expression, containing a way to obtain values (such as numbers or units) from the text and operations that can be done on these values (such as addition, subtraction or multiplication).  Each operation also has a priority, which controls the order of operations (i.e. multiplication gets a higher priority than addition).

   ~ExpressionParser~ has a parameterized type ~T~, which represents the type of the value used in the expression.  The expression parser currently only supports one type of value per expression; in the expressions used by 7Units numbers are treated as a kind of unit or prefix.  Operators are represented by internal types; the system distinguishes between unary operators (those that take a single value, like negation) and binary operators (those that take 2 values, like +, -, * or /).

   Expressions are parsed in 2 steps:
   1. Convert the expression to [[https://en.wikipedia.org/wiki/Reverse_Polish_notation][Reverse Polish Notation]], where operators come *after* the values they operate on, and brackets and the order of operations are not necessary.  For example, "2 + 5" becomes "~2 5 +~", "(1 + 2) * 3" becomes "~1 2 + 3 *~" and the example expression earlier becomes "~2 m * 30 J * N / + 8 s * *~".  This makes it simple to evaluate - early calculators used RPN for a good reason!
   2. Evaluate the RPN expression.  This can be done simply with a for loop and a stack.  For each token in the expression, the progam does the following:
      - if it is a number or unit, add it to the stack.
      - if it is a unary operator, take one value from the stack, apply the operator to it, and put the result into the stack.
      - if it is a binary operator, take two values from the stack, apply the operator to them, and put the result into the stack.
      After evaluating the last token, there should be one value left in the stack - the answer.  If there isn't, the original expression was malformed.
** Math Classes
   There are two simple math classes in 7Units:
   - ~UncertainDouble~ :: Like a ~double~, but with an uncertainty (e.g. \(2.0 \pm 0.4\)).  The operations are like those of the regular Double, only they also calculate the uncertainty of the final value.  They also have "exact" versions to help interoperation between ~double~ and ~UncertainDouble~.
   - ~DecimalComparison~ :: A static utility class that contains a few alternate equals() methods for ~double~ and ~UncertainDouble~.  These methods allow a slight (configurable) difference between values to still be considered equal, to fight roundoff error.
** Collection Classes
   The ~ConditionalExistenceCollections~ class contains wrapper implementations of ~Collection~, ~Iterator~, ~Map~ and ~Set~.  These implementations ignore elements that do not pass a certain condition - if an element fails the condition, ~contains~ will return false, the iterator will skip past it, it won't be counted in ~size~, etc. even if it exists in the original collection.  Effectively, any element of the original collection that fails the test does not exist.