Linear Functions
- Linear and affine functions
- First-order approximation of non-linear functions
- Other sources of linear models
Linear and affine functions
Definition
Linear functions are functions which preserve scaling and addition of the input argument. Affine functions are ‘‘linear plus constant’’ functions.
Formal definition, linear and affine functions. A function is linear if and only if
preserves scaling and addition of its arguments:
- for every
, and
,
; and
- for every
,
.
A function is affine if and only if the function
with values
is linear.
An alternative characterization of linear functions is given here.
Examples: Consider the functions with values
,
,
.
The function is linear;
is affine; and
is neither.
Connection with vectors via the scalar product
The following shows the connection between linear functions and scalar products.
A function is affine if and only if it can be expressed via a scalar product:
data:image/s3,"s3://crabby-images/d457e/d457eafc4510c99e74cb7fe3f2f7b127422f2df8" alt="Rendered by QuickLaTeX.com f(x) = a^Tx+b,"
for some unique pair , with
and
, given by
, with
the
-th unit vector in
,
, and
. The function is linear if and only if
.
The theorem shows that a vector can be seen as a (linear) function from the ‘‘input“ space to the ‘‘output” space
. Both points of view (matrices as simple collections of numbers, or as linear functions) are useful.
Gradient of an affine function
The gradient of a function at a point
, denoted
, is the vector of first derivatives with respect to
(see here for a formal definition and examples). When
(there is only one input variable), the gradient is simply the derivative.
An affine function , with values
has a very simple gradient: the constant vector
. That is, for an affine function
, we have for every
:
data:image/s3,"s3://crabby-images/54f1d/54f1dde934ef12f6d2fe1b37e42845ea691eaac5" alt="Rendered by QuickLaTeX.com \nabla f(x) = a"
Example: gradient of a linear function.
Interpretations
The interpretation of are as follows.
- The
is the constant term. For this reason, it is sometimes referred to as the bias, or intercept (as it is the point where
intercepts the vertical axis if we were to plot the graph of the function).
- The terms
,
, which correspond to the gradient of
, give the coefficients of influence of
on
. For example, if
, then the first component of
has much greater influence on the value of
than the third.
Example: Beer-Lambert law in absorption spectrometry.
First-order approximation of non-linear functions
Many functions are non-linear. A common engineering practice is to approximate a given non-linear map with a linear (or affine) one, by taking derivatives. This is the main reason for linearity to be such a ubiquitous tool in Engineering.
One-dimensional case
Consider a function of one variable , and assume it is differentiable everywhere. Then we can approximate the values function at a point
near a point
as follows:
data:image/s3,"s3://crabby-images/d7304/d73048db603fe8510ff29531b14e7a65dc9ec837" alt="Rendered by QuickLaTeX.com f(x) \simeq l(x):= f(x_0) + f'(x_0)(x-x_0),"
where denotes the derivative of
at
.
Multi-dimensional case
With more than one variable, we have a similar result. Let us approximate a differentiable function by a linear function
, so that
and
coincide up and including to the first derivatives. The corresponding approximation
is called the first-order approximation to
at
.
The approximate function must be of the form
data:image/s3,"s3://crabby-images/f0f8c/f0f8c79e3a613f3890d2c4db24ab393ac25f758d" alt="Rendered by QuickLaTeX.com l(x) = a^Tx+b,"
where and
. Our condition that
coincides with
up and including to the first derivatives shows that we must have
data:image/s3,"s3://crabby-images/d1d1e/d1d1eed0ddda06dbfb93c895082e7b3afe284f64" alt="Rendered by QuickLaTeX.com \nabla l(x) = a =\nabla f(x_0), a^Tx_0 + b = f(x_0),"
where the gradient, of
at
. Solving for
we obtain the following result:
The first-order approximation of a differentiable function at a point
is of the form
data:image/s3,"s3://crabby-images/44b68/44b680cfae3b05e2a89dcd196afbb296f6712af1" alt="Rendered by QuickLaTeX.com f(x) \approx l(x) = f(x_0) + \nabla f(x_0)^T(x-x_0),"
where is the gradient of
at
.
Example: a linear approximation to a non-linear function.
Other sources of linear models
Linearity can arise from a simple change of variables. This is best illustrated with a specific example.
Example: Power laws.