Redis SCAN命令实现有限保证的原理(2)

日期：2020-06-02 栏目：程序人生浏览：次

但如果游标返回的不是这四种，例如返回了7，7&11之后变为了3，所以会从size为4的哈希表的bucket3开始继续遍历，而bucket3包含了size为8的哈希表中的bucket3与bucket7，所以会造成重复读取size为8的哈希表中的bucket3的情况。

所以，redis里rehash从小到大时，SCAN命令不会重复也不会遗漏。而从大到小时，有可能会造成重复但不会遗漏。

当正在进行rehash时，游标计算过程：

/* Make sure t0 is the smaller and t1 is the bigger table */ if (t0->size > t1->size) { t0 = &d->ht[1]; t1 = &d->ht[0]; } m0 = t0->sizemask; m1 = t1->sizemask; /* Emit entries at cursor */ if (bucketfn) bucketfn(privdata, &t0->table[v & m0]); de = t0->table[v & m0]; while (de) { next = de->next; fn(privdata, de); de = next; } /* Iterate over indices in larger table that are the expansion * of the index pointed to by the cursor in the smaller table */ do { /* Emit entries at cursor */ if (bucketfn) bucketfn(privdata, &t1->table[v & m1]); de = t1->table[v & m1]; while (de) { next = de->next; fn(privdata, de); de = next; } /* Increment the reverse cursor not covered by the smaller mask.*/ v |= ~m1; v = rev(v); v++; v = rev(v); /* Continue while bits covered by mask difference is non-zero */ } while (v & (m0 ^ m1));

算法会保证t0是较小的哈希表，不是的话t0与t1互换，先遍历t0中游标所在的bucket，然后再遍历较大的t1。

求下一个游标的过程基本相同，只是把m0换成了rehash之后的哈希表的m1，同时还加了一个判断条件:

v & (m0 ^ m1)

size4的m0为00000011，size8的m1为00000111，m0 ^ m1取值为00000100，即取二者mask的不同位，看游标在这些标志位是否为1。

假设游标返回了2，并且正在进行rehash，此时size由4变成了8，二者mask的不同位是低第三位。

首先遍历t0中的bucket2，然后遍历t1中的bucket2，公式计算出的下一个游标为6(00000110)，低第三位为1，继续循环，遍历t1中的bucket6，然后计算游标为1，结束循环。

所以正在rehash时，是两个哈希表都遍历的，以避免遗漏的情况。

Linux公社的RSS地址：https://www.linuxidc.com/rssFeed.aspx

内容版权声明：除非注明，否则皆为本站原创文章。

转载注明出处：https://www.heiqu.com/e28d14df8e48df2a50e1ad3aba75d880.html