Logistic regression cost function intuition

Asked 10 years, 5 months ago

Viewed 426 times

$\begingroup$

My question is regarding the LR cost function from andrews ML course (http://feature-space.com/en/document50.pdf , page -5)

$cost= \frac{1}{m}[ -y \times \log(\psi) - (1-y) \times \log(\kappa) ]$

The vector y holds values for the digits (1-10), so if we plug these values in the cost function then the cost function takes ambiguous values. For instance if y=5, then the cost function will have both the parameters

$cost (y=4) = \frac{1}{m} [ -4 \times \log(\psi) -(1+4) \times \log(\kappa) ]$

As per Andrew's lecture I remember him saying that only one of the log terms would remain inside the cost function as if classified correct $\log(\psi)$ term remains else $\log(\kappa)$ remains.

Please help me where I'm getting it wrong.

Improve this question

edited Dec 29, 2016 at 20:05

ilanman's user avatar

ilanman

5,0151 gold badge30 silver badges50 bronze badges

asked Jun 10, 2015 at 9:40

shalini's user avatar

shalini

1194 bronze badges

$\endgroup$

2

$\begingroup$ Remember that y is a vector in this case, not a scalar. $\endgroup$

Bar
– Bar

2015年06月10日 13:21:56 +00:00
Commented Jun 10, 2015 at 13:21

Add a comment |

1 Answer 1

Sorted by: Reset to default

$\begingroup$

$y$ always takes on values of 1 or 0, as you noted. For the multi-class problem, you're going to solve for the "one vs. all" case. You'll need to transform your $y$ vector into a vector of 1's and 0's depending on the class you are minimizing for. So for the number 5, you'll solve for $P(y=5)$ vs. $P(y \ne 5)$. You repeat that for all your digits. You come up with 10 different $h_{i}{\theta}$'s, i.e. $h_{1i}{\theta},ドル $h_{2i}{\theta},ドル ...

Also, the following is from your link at the bottom of page 8:

When training the classifier for class k ∈ {1, ..., K}, you will want a m-dimensional vector of labels y, where yj ∈ 0, 1 indicates whether the j-th training instance belongs to class k (yj = 1), or if it belongs to a different class (yj = 0).

Improve this answer

answered Jun 17, 2015 at 23:14

ilanman's user avatar

ilanman

5,0151 gold badge30 silver badges50 bronze badges

$\endgroup$

Add a comment |

Your Answer

Draft saved

Draft discarded

Sign up or log in

Post as a guest

Name

Required, but never shown

Post as a guest

Name

Required, but never shown

By clicking "Post Your Answer", you agree to our terms of service and acknowledge you have read our privacy policy.

Start asking to get answers

Find the answer to your question by asking.

Ask question

Explore related questions

See similar questions with these tags.

Stack Exchange Network

Logistic regression cost function intuition

1 Answer 1

Your Answer

Sign up or log in

Post as a guest

Post as a guest

Hot Network Questions

Logistic regression cost function intuition

1 Answer 1

Your Answer

Sign up or log in

Post as a guest

Post as a guest

Related

Hot Network Questions