Merge pull request #204 from RanderWang/dma_trace_apl

trace: refine dma trace algorithm for apl