Profiling/Tracing PX4 on POSIX

I want to analyze the performance of PX4 stack running on snapdragon flight. Which tools can I use?
Since, I am getting some I/O bound inefficiency, I want to get some information regarding scheduling various PX4 threads.

Thanks,
Jenil Jain