Mali Bifrost - Cache Clean

2 minute read

What Invokes Cache Clean?

When power state is changed (see kbase_pm_l2_update_state() / kbase_pm_shaders_update_state())
When the job is done (see jd_done_worker()) – unlikely happen
When GPU context is switched (see at kbase_js_pull() / kbase_js_unpull()) – unlikely happen

mali_kbase_jm_rb.c

void kbase_backend_complete_wq(struct kbase_device *kbdev,
                       struct kbase_jd_atom *katom)
{
   /*
    * If cache flush required due to HW workaround then perform the flush
    * now
    */
   kbase_backend_cache_clean(kbdev, katom);
}

mali_kbase_device_hw.c

void kbase_gpu_start_cache_clean_nolock(struct kbase_device *kbdev)
{
   u32 irq_mask;

   lockdep_assert_held(&kbdev->hwaccess_lock);

   if (kbdev->cache_clean_in_progress) {
       /* If this is called while another clean is in progress, we
        * can't rely on the current one to flush any new changes in
        * the cache. Instead, trigger another cache clean immediately
        * after this one finishes.
        */
       kbdev->cache_clean_queued = true;
       return;
   }

   /* Enable interrupt */
   /** EE("GPU_IRQ_MASK - CLEAN_CACHES_COMPLETED"); */
   irq_mask = kbase_reg_read(kbdev, GPU_CONTROL_REG(GPU_IRQ_MASK));
   kbase_reg_write(kbdev, GPU_CONTROL_REG(GPU_IRQ_MASK),                                                                                                
               irq_mask | CLEAN_CACHES_COMPLETED);

   KBASE_TRACE_ADD(kbdev, CORE_GPU_CLEAN_INV_CACHES, NULL, NULL, 0u, 0);
   kbase_reg_write(kbdev, GPU_CONTROL_REG(GPU_COMMAND),
                   GPU_COMMAND_CLEAN_INV_CACHES);

   kbdev->cache_clean_in_progress = true;
}

Besides, the device driver configures the job slot if cache clean and/or invalidate will be required before and after the job is executed. The configuration is done right before putting job chain to the slot. While it is done by the device driver, the configuration, in fact, instructed by the user-space app/rutnime that is in the atom structure as core_req.

PM Policy

mali_kbase_pm_policy.c

static const struct kbase_pm_policy *const all_policy_list[] = {
#ifdef CONFIG_MALI_NO_MALI
   &kbase_pm_always_on_policy_ops,
   &kbase_pm_coarse_demand_policy_ops,
#if !MALI_CUSTOMER_RELEASE
   &kbase_pm_always_on_demand_policy_ops,
#endif
#else               /* CONFIG_MALI_NO_MALI */
   &kbase_pm_coarse_demand_policy_ops,
#if !MALI_CUSTOMER_RELEASE
   &kbase_pm_always_on_demand_policy_ops,
#endif  
   &kbase_pm_always_on_policy_ops
#endif /* CONFIG_MALI_NO_MALI */
};

The device driver manages the GPU power state by continuously reading the state from the GPU and updating it. For instance, if no in-flight jobs, the device driver tries to turn off the shader and thus L2/tiler cores for power saving. The “pm_always_on” guarantees no power related register I/O during run time.

GPU Protected Mode

L2 shall be powered down and GPU shall come out of fully coherent mode before entering protected mode.
When entering into protected mode, we must ensure that the GPU is not operating in coherent mode as well. This is to ensure that no protected memory can be leaked.

From the comments in the source code, I guess the protected mode prevents data leakage possible from cache coherence/flush but could not find an caller to enter it.

Share on

Twitter Facebook LinkedIn

Heejin Park

Mali Bifrost - Cache Clean

What Invokes Cache Clean?

PM Policy

GPU Protected Mode

Share on

You may also enjoy

DSTAT - Resource Monitoring

TMUX - A Terminal Multiplexer

DLXOS Setup

Directly Access Your Physical Memory (dev/mem)