[PDF] Insertion and promotion for tree-based PseudoLRU last-level caches

Skip to search formSkip to main contentSkip to account menu

Semantic ScholarSemantic Scholar's Logo

DOI:10.1145/2540708.2540733
Corpus ID: 1268408

@article{Jimnez2013InsertionAP, title={Insertion and promotion for tree-based PseudoLRU last-level caches}, author={Daniel A. Jim{\'e}nez}, journal={2013 46th Annual IEEE/ACM International Symposium on Microarchitecture (MICRO)}, year={2013}, pages={284-296}, url={https://api.semanticscholar.org/CorpusID:1268408}}

Daniel A. Jiménez
Published in Micro 7 December 2013
Computer Science

A novel last-level cache replacement algorithm with approximately the same complexity and storage requirements as tree-based PseudoLRU, but with performance matching state of the art techniques such as dynamic re-reference interval prediction (DRRIP) and protecting distance policy (PDP).

View on ACM

cs.utsa.edu

64 Citations

Highly Influential Citations

Background Citations

Methods Citations

Figures from this paper

figure 1
figure 2
figure 3
figure 4
figure 8
figure 10
figure 12
figure 13

Topics

PseudoLRU (opens in a new tab)Dynamic Re-reference Interval Prediction (opens in a new tab)Set Dueling (opens in a new tab)Least Recently Used (opens in a new tab)Cache Replacement Policies (opens in a new tab)Cache Block (opens in a new tab)Last-level Cache (opens in a new tab)State-of-the Art Replacement Policies (opens in a new tab)Power-delay Products (opens in a new tab)Cache Replacement Algorithms (opens in a new tab)

64 Citations

Performance and area aware replacement policy for GPU architecture

Fatemeh Kazemi Hassan AbadiSaeed Safari

Computer Science, Engineering

2014 4th International Conference on Computer and…

2014

This work proposes old Tree-based PLRU on two-level caches with higher speed up or performance matching of LRU at GPUs with high accuracy profiling logic and cache partitioning hardware for this scheme.

2
Highly Influenced

Block value based insertion policy for high performance last-level caches

Lingda LiJunlin LuXu Cheng

Computer Science

ICS '14

31 References

Insertion policy selection using Decision Tree Analysis

S. KhanDaniel A. Jiménez

Computer Science

2010 IEEE International Conference on Computer…

2010

This work uses decision tree analysis of multi-set-dueling to choose the optimal insertion position in the LRU stack, which reduces misses by 5.16% and achieves 7.19% IPC improvement over LRU.

Adaptive insertion policies for high performance caching

Moinuddin K. QureshiA. JaleelY. PattS. SteelyJ. Emer

Computer Science

ISCA '07

2007

A Dynamic Insertion Policy (DIP) is proposed to choose between BIP and the traditional LRU policy depending on which policy incurs fewer misses, and shows that DIP reduces the average MPKI of the baseline 1MB 16-way L2 cache by 21%, bridging two-thirds of the gap between LRU and OPT.

715
Highly Influential
PDF

Counter-Based Cache Replacement and Bypassing Algorithms

Mazen KharbutliYan Solihin

Computer Science

IEEE Transactions on Computers

2008

A new counter-based approach to deal with cache pollution, predicting lines that have become dead and replacing them early from the L2 cache and identifying never-reaccessed lines, which is augmented with an event counter that is incremented when an event of interest such as certain cache accesses occurs.

Cache replacement based on reuse-distance prediction

G. KeramidasPavlos PetoumenosS. Kaxiras

Computer Science

2007 25th International Conference on Computer…

2007

This work proposes to directly predict reuse-distances via instruction-based (PC) prediction and use this information for cache level optimizations and evaluates the reusedistance based replacement policy of the L2 cache using a subset of the most memory intensive SPEC2000.

Improving Cache Management Policies Using Dynamic Reuse Distances

Nam DuongDali ZhaoTaesu KimRosario CammarotaM. ValeroA. Veidenbaum

Computer Science

2012 45th Annual IEEE/ACM International Symposium…

2012

A new way to use dynamic reuse distances to further improve cache management policies is proposed which prevents replacing a cache line until a certain number of accesses to its cache set, called a Protecting Distance (PD).

A Case for MLP-Aware Cache Replacement

Moinuddin K. QureshiDaniel N. LynchO. MutluY. Patt

Computer Science

33rd International Symposium on Computer…

2006

Evaluations with the SPEC CPU2000 benchmarks show that MLP-aware cache replacement can improve performance by as much as 23% and a novel, low-hardware overhead mechanism called sampling based adaptive replacement (SBAR) is proposed, to dynamically choose between an MLp-aware and a traditional replacement policy, depending on which one is more effective at reducing the number of memory related stalls.

Sampling Dead Block Prediction for Last-Level Caches

S. KhanYingying TianDaniel A. Jiménez

Computer Science

2010 43rd Annual IEEE/ACM International Symposium…

2010

This paper introduces sampling dead block prediction, a technique that samples program counters (PCs) to determine when a cache block is likely to be dead, and shows how this technique can reduce the number of LLC misses over LRU and be used to significantly improve a cache with a default random replacement policy.

PIPP: promotion/insertion pseudo-partitioning of multi-core shared caches

Yuejian XieG. Loh

Computer Science, Engineering

ISCA '09

2009

This work proposes a new cache management approach that combines dynamic insertion and promotion policies to provide the benefits of cache partitioning, adaptive insertion, and capacity stealing all with a single mechanism.

329
Highly Influential
PDF

Cache bursts: A new approach for eliminating dead blocks and increasing cache efficiency

Haiming LiuM. FerdmanJaehyuk HuhD. Burger

Computer Science

2008 41st IEEE/ACM International Symposium on…

2008

This paper proposes a new class of dead-block predictors that predict dead blocks based on bursts of accesses to a cache block, and evaluates three ways to increase cache efficiency by eliminating dead blocks early: replacement optimization, bypassing, and prefetching.

The ZCache: Decoupling Ways and Associativity

Daniel SánchezChristos Kozyrakis

Computer Science

2010 43rd Annual IEEE/ACM International Symposium…

2010

The zcache is presented, a cache design that allows much higher associativity than the number of physical ways, and it is shown that zcaches provide higher performance and better energy efficiency than conventional caches without incurring the overheads of designs with a large number of ways.

...

Related Papers

Showing 1 through 3 of 0 Related Papers

[PDF] Insertion and promotion for tree-based PseudoLRU last-level caches | Semantic Scholar (2024)

Figures from this paper

Topics

64 Citations

31 References

Related Papers