linear-regression least-squares svd minimization

Why is SVD applied on Linear Regression

I cannot understand on these slides why is the SVD applied to the Least Square Problem?

And then it follows this:

And here I don't understand why was the Derivative of the Residuals taken, and is it the Idea in that graph to take the Projection of y to minimize the error?

Solution

Here is my humble trial to explain this...
The first slide does not explain yet how SVD is related to LS. But it says that whenever X is a "standard" matrix, one can transform the problem with a Singular matrix (only diagonal elements are not null) - which is convenient for computation.
Slide 2 shows the computation to be done using the singular matrix.
Explanation are on slide 3 : minimizing the norm of r is equivalent to minimizing its square which is the RSS (because x -> x*x is an increasing function for x>0). Minimizing RSS: same as minimizing any "good" function, you derivate it, and then equal the derivative to 0.

Neural network learning to sum two numbers
Linear regression using flux.jl in julia
Linear regression model barely optimizes the intercept b
Gradient descent on linear regression not converging
Linear Regression in Javascript
The feature names should match those that were passed during fit
Is this a mistake in R? To my understanding the output should be the same
R: add summation to regression equation
What is the best way to fit a quadratic polynomial to p-dimensional data and compute its gradient and Hessian matrix?
Returning multiple outputs from a raster calculation using either raster or terra
How to get scikit-learn to ensure that all prediction outputs should sum to 100%?
How to fit regression model in R
How to compute multiple linear models over combinations of columns in data frame?
Gradient Descent for Linear Regression Exploding
Forest plot univariable in R (finalfit | or_plot)
How to do exponential and logarithmic curve fitting in Python? I found only polynomial fitting
Predictions of coupled PyMC linear bayesian models for new inputs
Linear regression with matplotlib / numpy
Output of a statsmodels regression
Do linear regression with all points not separated by hue
AIC or BIC calculation in R?
How to generalise fitting function to allow sciPy curve fit to infer the number of inputs
SHAP Partial Dependence Plot Misalignment with Train-Test Split in Linear Regression
pytorch error: element 0 of tensors does not require grad and does not have a grad_fn
statsmodels add_constant for OLS intercept, what is this actually doing?
Difference between numpy.linalg.lstsq and sklearn.linear_model.LinearRegression
How to adjust x axis in coefficient plot with sjPlot's plot_model when x value range is smaller than -1,1?
Numpy reshape issue: ValueError: cannot reshape array
ValuerError: Found input variables with inconsistent numbers of samples
Reshape your data either using array.reshape(-1, 1) if your data has a single feature or array.reshape(1, -1) if it contains a single sample