Here's a mnemonic to help you remember the formula for arc length: from calculus, you should recognize the "formula"
Δf≈f′(t)Δt.
This still works for vector functions:
Δr≈r′(t)Δt⟹Δs≈∥r′(t)∥Δt,
after taking magnitudes. Thus, this becomes
ds=∥r′(t)∥dt⟹L=∫0Lds=∫ab∥r′(t)∥dt
after integrating both sides. This should hopefully give a little intuition on why arc length is defined the way it is.
Arc Length Parametrization
A big source of confusion for me when I first learned the material is the difference between t and s. This is because when we write r′(s), we really mean dsdr(s) and notdtdr(s) (notice that the first derivative is taken with respect to arc length, s, instead of t).
The distinction is really important when doing certain calculations. For example, to calculate curvature, you have
κ=∥∥dsdT∥∥=∥∥dtdT∥∥.
If you want to use the t-derivative to calculate arc length, then you will want to sue the chain rule instead: