CS-Notes/docs/notes/Leetcode 题解 - 排序.md

250 lines
7.6 KiB
Java
Raw Normal View History

2019-11-02 14:39:13 +08:00
<!-- GFM-TOC -->
* [快速选择](#快速选择)
* [](#)
* [1. Kth Element](#1-kth-element)
* [桶排序](#桶排序)
* [1. 出现频率最多的 k 个元素](#1-出现频率最多的-k-个元素)
* [2. 按照字符出现次数对字符串排序](#2-按照字符出现次数对字符串排序)
* [荷兰国旗问题](#荷兰国旗问题)
* [1. 按颜色进行排序](#1-按颜色进行排序)
<!-- GFM-TOC -->
2019-03-08 20:31:07 +08:00
2019-03-27 20:57:37 +08:00
# 快速选择
2019-03-08 20:31:07 +08:00
2019-11-02 14:39:13 +08:00
用于求解 **Kth Element** 问题也就是第 K 个元素的问题
2019-03-08 20:31:07 +08:00
2019-05-14 18:03:11 +08:00
可以使用快速排序的 partition() 进行实现需要先打乱数组否则最坏情况下时间复杂度为 O(N<sup>2</sup>)
2019-03-08 20:31:07 +08:00
2019-05-14 18:03:11 +08:00
#
2019-03-08 20:31:07 +08:00
2020-11-02 01:06:09 +08:00
用于求解 **TopK Elements** 问题也就是 K 个最小元素的问题使用最小堆来实现 TopK 问题最小堆使用大顶堆来实现大顶堆的堆顶元素为当前堆的最大元素实现过程不断地往大顶堆中插入新元素当堆中元素的数量大于 k 移除堆顶元素也就是当前堆中最大的元素剩下的元素都为当前添加过的元素中最小的 K 个元素插入和移除堆顶元素的时间复杂度都为 log<sub>2</sub>N
2019-03-08 20:31:07 +08:00
2020-11-02 01:06:09 +08:00
堆也可以用于求解 Kth Element 问题得到了大小为 K 的最小堆之后因为使用了大顶堆来实现因此堆顶元素就是第 K 大的元素
2019-03-08 20:31:07 +08:00
2019-03-27 20:57:37 +08:00
快速选择也可以求解 TopK Elements 问题因为找到 Kth Element 之后再遍历一次数组所有小于等于 Kth Element 的元素都是 TopK Elements
2019-03-08 20:31:07 +08:00
2019-03-27 20:57:37 +08:00
可以看到快速选择和堆排序都可以求解 Kth Element TopK Elements 问题
2019-03-08 20:31:07 +08:00
2019-05-14 18:03:11 +08:00
## 1. Kth Element
2019-03-08 20:31:07 +08:00
2019-10-27 00:52:52 +08:00
215\. Kth Largest Element in an Array (Medium)
[Leetcode](https://leetcode.com/problems/kth-largest-element-in-an-array/description/) / [力扣](https://leetcode-cn.com/problems/kth-largest-element-in-an-array/description/)
2019-03-27 20:57:37 +08:00
2019-05-14 18:03:11 +08:00
```text
Input: [3,2,1,5,6,4] and k = 2
Output: 5
```
题目描述找到倒数第 k 个的元素
2019-03-27 20:57:37 +08:00
2019-11-02 14:39:13 +08:00
**排序** 时间复杂度 O(NlogN)空间复杂度 O(1)
2019-03-08 20:31:07 +08:00
```java
2019-03-27 20:57:37 +08:00
public int findKthLargest(int[] nums, int k) {
Arrays.sort(nums);
return nums[nums.length - k];
2019-03-08 20:31:07 +08:00
}
```
2019-11-02 14:39:13 +08:00
**** 时间复杂度 O(NlogK)空间复杂度 O(K)
2019-03-08 20:31:07 +08:00
```java
2019-03-27 20:57:37 +08:00
public int findKthLargest(int[] nums, int k) {
PriorityQueue<Integer> pq = new PriorityQueue<>(); // 小顶堆
for (int val : nums) {
pq.add(val);
if (pq.size() > k) // 维护堆的大小为 K
pq.poll();
}
return pq.peek();
2019-03-08 20:31:07 +08:00
}
```
2019-11-02 14:39:13 +08:00
**快速选择** 时间复杂度 O(N)空间复杂度 O(1)
2019-03-08 20:31:07 +08:00
```java
2019-03-27 20:57:37 +08:00
public int findKthLargest(int[] nums, int k) {
k = nums.length - k;
int l = 0, h = nums.length - 1;
while (l < h) {
int j = partition(nums, l, h);
if (j == k) {
break;
} else if (j < k) {
l = j + 1;
} else {
h = j - 1;
}
}
return nums[k];
2019-03-08 20:31:07 +08:00
}
2019-03-27 20:57:37 +08:00
private int partition(int[] a, int l, int h) {
int i = l, j = h + 1;
while (true) {
while (a[++i] < a[l] && i < h) ;
while (a[--j] > a[l] && j > l) ;
if (i >= j) {
break;
}
swap(a, i, j);
}
swap(a, l, j);
return j;
2019-03-08 20:31:07 +08:00
}
2019-03-27 20:57:37 +08:00
private void swap(int[] a, int i, int j) {
int t = a[i];
a[i] = a[j];
a[j] = t;
2019-03-08 20:31:07 +08:00
}
```
2019-03-27 20:57:37 +08:00
# 桶排序
2019-03-08 20:31:07 +08:00
2019-05-14 18:03:11 +08:00
## 1. 出现频率最多的 k 个元素
2019-03-08 20:31:07 +08:00
2019-10-27 00:52:52 +08:00
347\. Top K Frequent Elements (Medium)
[Leetcode](https://leetcode.com/problems/top-k-frequent-elements/description/) / [力扣](https://leetcode-cn.com/problems/top-k-frequent-elements/description/)
2019-03-08 20:31:07 +08:00
```html
2019-03-27 20:57:37 +08:00
Given [1,1,1,2,2,3] and k = 2, return [1,2].
2019-03-08 20:31:07 +08:00
```
2019-05-14 18:03:11 +08:00
设置若干个桶每个桶存储出现频率相同的数桶的下标表示数出现的频率即第 i 个桶中存储的数出现的频率为 i
2019-03-08 20:31:07 +08:00
2019-03-27 20:57:37 +08:00
把数都放到桶之后从后向前遍历桶最先得到的 k 个数就是出现频率最多的的 k 个数
2019-03-08 20:31:07 +08:00
```java
2020-11-01 16:16:33 +08:00
public int[] topKFrequent(int[] nums, int k) {
2019-03-27 20:57:37 +08:00
Map<Integer, Integer> frequencyForNum = new HashMap<>();
for (int num : nums) {
frequencyForNum.put(num, frequencyForNum.getOrDefault(num, 0) + 1);
}
List<Integer>[] buckets = new ArrayList[nums.length + 1];
for (int key : frequencyForNum.keySet()) {
int frequency = frequencyForNum.get(key);
if (buckets[frequency] == null) {
buckets[frequency] = new ArrayList<>();
}
buckets[frequency].add(key);
}
List<Integer> topK = new ArrayList<>();
for (int i = buckets.length - 1; i >= 0 && topK.size() < k; i--) {
if (buckets[i] == null) {
continue;
}
if (buckets[i].size() <= (k - topK.size())) {
topK.addAll(buckets[i]);
} else {
topK.addAll(buckets[i].subList(0, k - topK.size()));
}
}
2020-11-01 16:16:33 +08:00
int[] res = new int[k];
for (int i = 0; i < k; i++) {
res[i] = topK.get(i);
}
return res;
2019-03-08 20:31:07 +08:00
}
```
2019-05-14 18:03:11 +08:00
## 2. 按照字符出现次数对字符串排序
2019-03-08 20:31:07 +08:00
2019-10-27 00:52:52 +08:00
451\. Sort Characters By Frequency (Medium)
[Leetcode](https://leetcode.com/problems/sort-characters-by-frequency/description/) / [力扣](https://leetcode-cn.com/problems/sort-characters-by-frequency/description/)
2019-03-08 20:31:07 +08:00
```html
Input:
"tree"
Output:
"eert"
Explanation:
2019-03-27 20:57:37 +08:00
'e' appears twice while 'r' and 't' both appear once.
So 'e' must appear before both 'r' and 't'. Therefore "eetr" is also a valid answer.
2019-03-08 20:31:07 +08:00
```
```java
2019-03-27 20:57:37 +08:00
public String frequencySort(String s) {
Map<Character, Integer> frequencyForNum = new HashMap<>();
for (char c : s.toCharArray())
frequencyForNum.put(c, frequencyForNum.getOrDefault(c, 0) + 1);
List<Character>[] frequencyBucket = new ArrayList[s.length() + 1];
for (char c : frequencyForNum.keySet()) {
int f = frequencyForNum.get(c);
if (frequencyBucket[f] == null) {
frequencyBucket[f] = new ArrayList<>();
}
frequencyBucket[f].add(c);
}
StringBuilder str = new StringBuilder();
for (int i = frequencyBucket.length - 1; i >= 0; i--) {
if (frequencyBucket[i] == null) {
continue;
}
for (char c : frequencyBucket[i]) {
for (int j = 0; j < i; j++) {
str.append(c);
}
}
}
return str.toString();
2019-03-08 20:31:07 +08:00
}
```
2019-03-27 20:57:37 +08:00
# 荷兰国旗问题
2019-03-08 20:31:07 +08:00
荷兰国旗包含三种颜色
2019-05-14 18:03:11 +08:00
有三种颜色的球算法的目标是将这三种球按颜色顺序正确地排列它其实是三向切分快速排序的一种变种在三向切分快速排序中每次切分都将数组分成三个区间小于切分元素等于切分元素大于切分元素而该算法是将数组分成三个区间等于红色等于白色等于蓝色
2019-03-08 20:31:07 +08:00
2019-12-06 01:04:29 +08:00
<div align="center"> <img src="https://cs-notes-1256109796.cos.ap-guangzhou.myqcloud.com/7a3215ec-6fb7-4935-8b0d-cb408208f7cb.png"/> </div><br>
2019-03-08 20:31:07 +08:00
2019-05-05 12:09:53 +08:00
2019-05-14 18:03:11 +08:00
## 1. 按颜色进行排序
2019-03-08 20:31:07 +08:00
2019-10-27 00:52:52 +08:00
75\. Sort Colors (Medium)
[Leetcode](https://leetcode.com/problems/sort-colors/description/) / [力扣](https://leetcode-cn.com/problems/sort-colors/description/)
2019-03-08 20:31:07 +08:00
```html
2019-03-27 20:57:37 +08:00
Input: [2,0,2,1,1,0]
Output: [0,0,1,1,2,2]
2019-03-08 20:31:07 +08:00
```
2019-03-27 20:57:37 +08:00
题目描述只有 0/1/2 三种颜色
2019-03-08 20:31:07 +08:00
```java
2019-03-27 20:57:37 +08:00
public void sortColors(int[] nums) {
int zero = -1, one = 0, two = nums.length;
while (one < two) {
if (nums[one] == 0) {
swap(nums, ++zero, one++);
} else if (nums[one] == 2) {
swap(nums, --two, one);
} else {
++one;
}
}
2019-03-08 20:31:07 +08:00
}
2019-03-27 20:57:37 +08:00
private void swap(int[] nums, int i, int j) {
int t = nums[i];
nums[i] = nums[j];
nums[j] = t;
2019-03-08 20:31:07 +08:00
}
```
2019-11-02 14:39:13 +08:00
2019-11-02 17:33:10 +08:00
<div align="center"><img width="320px" src="https://cs-notes-1256109796.cos.ap-guangzhou.myqcloud.com/githubio/公众号二维码-2.png"></img></div>