This repository was archived by the owner on Dec 22, 2021. It is now read-only.

Description
@zeux 's example from the meeting (#369) using Godbolt using llvm to do MCA analysis is great. It gives us at the very least, LLVM's cost analysis and cycle predictions by port usage for any given cpu (with -mcpu flag). You can see it here.
If this is scalable and accurate, it could provide a very efficient way to meet @penzn and @lars-t-hansen suggestions for performance analyses associated with each PR without adding any undue burden. In that spirit, what specific chips we're using as targets for our analyses. That way we can know that if we're adding instructions, we're focusing our efforts on performance improvements that help the most users.
Which would you pick for the following four targets?
x86 (32-bit)?
x86_64?
ARM64?
ARMv7+neon?
Actual models or architectures are helpful if they can tell us projected UOPS and Port Usage.