Asked 13 years, 9 months ago

Viewed 271 times

\$\begingroup\$

I wanted to do the following:

Count the frequencies of words in a text (over 5 letters)
Invert the map of words to frequencies, but group together words that have the same frequency in the inversion.
Sort the inverted map by keys descending order and take the top 25.

Here is the code I came up with. Did I re-invent the wheel with map-invert-preserve-dups? Is there a more concise way to do anything I did? Am I doing anything unnecessarily (i.e. (~k)?

(defn map-invert-preserve-dups
 [m]
 (reduce
 (fn [m [k v]]
 (if (contains? m v)
 (assoc m v (cons k (get m v)))
 (assoc m v `(~k))))
 {}
 m))
(->> "http://www.weeklyscript.com/Pulp%20Fiction.txt"
 (slurp)
 (re-seq #"\w{5,}")
 (frequencies)
 (map-invert-preserve-dups)
 (sort)
 (reverse)
 (take 25))

edited Mar 24, 2016 at 22:44

Jamal's user avatar

Jamal

35.2k13 gold badges134 silver badges238 bronze badges

asked Dec 10, 2011 at 2:22

noahz's user avatar

noahz noahz

3182 silver badges11 bronze badges

\$\endgroup\$

Add a comment |

1 Answer 1

Sorted by: Reset to default

\$\begingroup\$

Well, the most obvious fix is indeed map-invert-preserving-dups - the whole thing could be more easily written as:

(defn map-invert-preserving-dups [m]
 (apply merge-with into
 (for [[k v] m]
 {v [k]})))

The for expression yields a sequence of maps like [{a [1]} {b [2]} {a [5]}]. Apply calls merge-with into on all of those maps. If you look up the definition of merge-with, you can see that this means basically: "Merge all of these maps together, and if the same key exists twice, with values x and y, then make its value (into x y)".

edited Mar 24, 2016 at 22:51

answered Dec 10, 2011 at 3:42

amalloy's user avatar

amalloy amalloy

6754 silver badges11 bronze badges

\$\endgroup\$

Add a comment |

Your Answer

Draft saved

Draft discarded

Sign up or log in

Post as a guest

Name

Required, but never shown

Post as a guest

Name

Required, but never shown

By clicking "Post Your Answer", you agree to our terms of service and acknowledge you have read our privacy policy.

Start asking to get answers

Find the answer to your question by asking.

Ask question

Explore related questions

See similar questions with these tags.

lang-clj

Stack Exchange Network

Clojure code adapted from map-invert

1 Answer 1

Your Answer

Sign up or log in

Post as a guest

Post as a guest

Hot Network Questions

Clojure code adapted from map-invert

1 Answer 1

Your Answer

Sign up or log in

Post as a guest

Post as a guest

Related

Hot Network Questions