1
$\begingroup$

The motivation for k-d trees and ball trees is that k-nearest neighbors involves eliciting the distances between a particular data point and every other data point, which becomes inefficient as the number of data points grows large.

However, a step in the ball tree algorithm involves finding the least-near neighbor to a data point, and also the least-near neighbor to that least-near neighbor. This also feels like it should involve eliciting distances over the entire set of data points (twice). Furthermore, any clever shortcut to one of these feels like it should also be a clever shortcut to the other, by symmetry.

How does this address the inefficiencies in k-nearest neighbors rather than making them worse?

asked Aug 2, 2024 at 17:03
$\endgroup$

0

Know someone who can answer? Share a link to this question via email, Twitter, or Facebook.

Your Answer

Draft saved
Draft discarded

Sign up or log in

Sign up using Google
Sign up using Email and Password

Post as a guest

Required, but never shown

Post as a guest

Required, but never shown

By clicking "Post Your Answer", you agree to our terms of service and acknowledge you have read our privacy policy.

Start asking to get answers

Find the answer to your question by asking.

Ask question

Explore related questions

See similar questions with these tags.