Cost Function

July 24, 2025 less than 1 minute read

Cost Function

What is different to loss function

Objective function (Optimizer) $\subset $ Cost function $\subset$ Loss function

Term	Loss Function	Cost Function
Scope	Individual sample	Entire dataset
Definition	Measures how wrong the prediction is for one example	Measures the total/average error of the model
Used for	Per-sample error	Overall model performance (used in optimization)
Example	MSE, MAE, Cross-Entropy (for one sample)	Mean Squared Error over the whole dataset

More easyly explain,

$ \mathcal{L}{(i)} = \left( y{(i)} - \hat{y}_{(i)} \right)^2 $

$ J(\theta) = \frac{1}{m} \sum_{i=1}^{m} \mathcal{L}^{(i)} $

$J(\theta) = \frac{1}{m} \sum_{i=1}^{m} \left( y_{(i)} - \hat{y}_{(i)} \right)^2 $

$J(\theta) = \frac{1}{2 \cdot m} \sum_{i=1}^{m} \left( y_{(i)} - \hat{y}_{(i)} \right)^2 $