I want to talk about several parts this
week, including the difference between how machine learning and statistics work,
and considerations for Machine Learning.
First of all, what’s the difference between
these two seemingly similar yet different analysis? Linear regression analysis
in statistics use a method called ordinary least squares estimation, it is a
representative method; it finds the regression coefficients by minimizing the
distance between each observation and hyperplane and verifies if they are important
in the statistics. The equation was actually quite overwhelming to me, but I find
that it’s not that hard to understand when I looked up the definition, equation,
and how it’s done. To understand the whole process in statistics, understanding
what a regression coefficient is is the first and foremost step. It’s the most crucial
part of trying to make a prediction, and it is not possible to do it until we
have this coefficient which is a number indicating how much influence the
independent variable has to the dependent variable; we then use the independent
variable to calculate, predict the dependent variable and after finishing it, we
can get the result! The equation we use for calculating this result is as follows:
.
here is the dependent variable we want, you
can see the small arrow above the character y, for it is a number that is a predictive
number calculated by the regression analysis model rather than an actual
observation data, and
stands for the regression coefficient we are
introducing, which is also accompanied by
, a dependent
variable, when combined together, it signifies
that
is the expected value when the dependent
variable
There are two main considerations for performing
Machine Learning, one is underfitting, and the other one is overfitting.
Underfitting means that the model is too simple to adequately explain the input
data and is thus less predictive. I consider it quite severer because you
simply can’t predict things with this little amount of data. The judgement of
whether the model is underfitted is by examining the cost, which is the error
between the model’s predicted value and its actual value. The closer it is, the
better predictions it can offer. Adding additional training data is one way of
compensating the problem of underfitting. Overfitting, on the other hand, is a
case in which the model is too closely fitted to the training data. You may
consider it a good thing, and it’s what I thought when first looked at the
concept, having high in predictive power, it sounds pretty good and appealing
to me. However, when the model is applied to actual fields, the predictive
power is greatly reduced. I have come up with a simple example of the concept:
overfitting is the kind of player who always does good in practice but always gives
poor numbers and performances in actual games. The reason for this overfitting
problem is quite easy to understand, because major machine learning algorithms
build models in an inductive manner. As a result, how should we fix the problem?
The solution is to go through a series of model validation procedures to check
the validity and the feasibility of the model.

No comments:
Post a Comment