1,757 questions
- Bountied 0
- Unanswered
- Frequent
- Score
- Trending
- Week
- Month
- Unanswered (my tags)
0
votes
0
answers
46
views
rapidminer attribute not found but it shows in display
I am trying to filter my data based on the attribute outlier_flag but it says it does not exist even though it clearly does:
As you can see RapidMiner is refusing to recognize outlier_flag as an ...
0
votes
0
answers
86
views
Performance of OpenSearch K-NN Search with Additional Filters (e.g., Metadata Conditions)
I came across this AWS blog on OpenSearch K-NN — very helpful!
I'm exploring K-NN queries with filters (e.g., status = 'active') and wondering about performance implications.
In pgvector, there's an &...
0
votes
1
answer
164
views
How to apply KNN vector search in Elasticsearch only on documents matching a prior query?
I'm using Elasticsearch for semantic search with dense vectors (dense_vector + knn_search). I also perform a traditional keyword/fuzzy search to filter relevant documents.
Now I want to apply KNN ...
0
votes
1
answer
183
views
How to validate search terms when using embedding to look for objects in images
I have a search on my site that does both tradition full text search and searches using embeddings. So, for example, when you search 'red balloon' I want both the text and image results. The problem ...
0
votes
0
answers
48
views
elasticSearch vector retrieval(ES KNN) has no slow logs
The knn queries in the ES client are extremely slow. Although I have enabled the slow log configuration, no slow logs for knn queries are being generated.
Below is the slow log configuration
PUT /...
0
votes
0
answers
55
views
Elastic Search KNN Semantic Search with pre stored embeddings returing the same score for every hit
I was following this documentation page from Elastic Search
https://www.elastic.co/guide/en/elasticsearch/reference/current/bring-your-own-vectors.html
I have stored the vectors already and tried to ...
2
votes
1
answer
98
views
KNN algorithm with range estimation almost always returns more than K points
I am writing an algorithm on spatial data processing using an R-tree for a university project, where we are tasked to write a KNN algorithm as a range query using range estimation. We were not ...
1
vote
1
answer
233
views
How to implement KNNImputer in GPU?
I’m working with a large dataset on Kaggle and want to speed up the imputation process by using GPU acceleration for KNN imputation. My current approach uses the CPU-based KNNImputer from sklearn, but ...
2
votes
1
answer
121
views
Apache Sedona ST_KNN reports java.lang.NoClassDefFoundError
I use Apache Sedona to process map data in my work. After reading the docs about Sedona KNN, I really want to have a try to use KNN. however, I even not able to run the example code in the doc.
Below ...
0
votes
0
answers
59
views
KDTree buildBalancedTree infinite recursion issue
I tried to speed up my KD tree by implementing balancing and bounding boxes, but now it can't even build the tree, and I don't understand why.
Input Example
Here's how I provide the input:
n and dim = ...
1
vote
1
answer
142
views
KNeighborsClassifier predict throws "Expected 2D array, got 1D array instead"
I am writing an image similarity algorithm. I am using cv2.calcHist to extract image features. After the features are created I save them to a json file as a list of numpy.float64:
list(numpy.float64(...
0
votes
1
answer
262
views
Elastic KNN search num candidates. How are the candidates selected?
I'm trying to understand the "num_candidates" parameter when using KNN search in elastic search.
What I undertand reading some posts/documentation I will list at the end is that ES select &...
1
vote
2
answers
320
views
Custom k-Nearest Neighbor (kNN) slow implementation
I'm writing for an advice about a kNN implementation I made.
I'm studying a dataset representing a multiple choice question exam and my variable are an identifier of the student, the course, the year ...
0
votes
1
answer
285
views
Using an iterator to get the next(k) in FAISS similarity search
In FAISS after constructing an index (IVF, HNSW, or LSH) you can obtain the k-nearest neighbour based on a query q_x.
After my initial vector search in FAISS of let's say k = 50, I want to get more NN ...
0
votes
0
answers
91
views
Numba JIT code runs twice as slow as original code
I am attempting to optimize a function I wrote to get the k Nearest Neighbors of a vector in python using Numba JIT. However, the Numba version of my code runs twice as slow as the original code. I ...