692 Top K Frequent Words

Given a non-empty list of words, return the k most frequent elements.

Your answer should be sorted by frequency from highest to lowest. If two words have the same frequency, then the word with the lower alphabetical order comes first.

Example 1:

Input: ["i", "love", "leetcode", "i", "love", "coding"], k = 2
Output: ["i", "love"]
Explanation: "i" and "love" are the two most frequent words.
    Note that "i" comes before "love" due to a lower alphabetical order.

Example 2:

Input: ["the", "day", "is", "sunny", "the", "the", "the", "sunny", "is", "is"], k = 4
Output: ["the", "is", "sunny", "day"]
Explanation: "the", "is", "sunny" and "day" are the four most frequent words,
    with the number of occurrence being 4, 3, 2 and 1 respectively.

Note: You may assume k is always valid, 1 ≤ k ≤ number of unique elements. Input words contain only lowercase letters. Follow up: Try to solve it in O(n log k) time and O(n) extra space. Can you solve it in O(n) time with only O(k) extra space?

The Idea: Python makes this really easy. First accumulate the frequencies and then sort by value descending, and then secondarily by key in ascending. Finally return the slice of the first k values from the sorted list of tuples.

Complexity: O(N + NlogN + k) time and O(N) space

def topKFrequent(self, words, k):
    """
    :type words: List[str]
    :type k: int
    :rtype: List[str]
    """

    m = collections.defaultdict(int)
    for word in words:
        m[word] += 1

    m_sorted = sorted(m.items(),key=lambda x:(-x[1], x[0]))
    first_k = m_sorted[:k]
    return [first for first, _ in first_k]

Last updated