Problem Memory Partition Algorithm

Nvidia shrinks LLM memory 20x without changing model weights

Nvidia's KV Cache Transform Coding (KVTC) compresses LLM key-value cache by 20x without model changes, cutting GPU memory costs and time-to-first-token by up to 8x for multi-turn AI applications.

IEEE

Fully Tensorized GPU-Accelerated Multi-Population Evolutionary Algorithm for Constrained Multiobjective Optimization Problems

Abstract: Real-world constrained multiobjective optimization problems (CMOPs) are prevalent and often come with stringent time-sensitive requirements. However, most contemporary constrained ...

IEEE

Learning-Aided Evolutionary Algorithm for Solving Energy-Minimized Deadline-Constrained Task Scheduling Problem in Human-Cyber-Physical Systems

Abstract: This work addresses an energy-minimized deadline-constrained task scheduling problem in human-cyber-physical systems. It consists of three subproblems: processor allocation, task sequencing, ...

News Medical

MERLIN algorithm unlocks immune cell location memory in organs

A new AI-based method reconstructs spatial information about where immune cells were originally located in an organ, even after these cells have been removed from the tissue and analyzed individually.

Reuters

AI's memory chip champion has a value problem

LONDON, Feb 20 (Reuters Breakingviews) - Not long ago, memory chip makers were in crisis. A post-pandemic supply glut in 2023 pushed prices into freefall, wiping out operating profits across the ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results