Much of the optimization I had in mind for this language are already present in CC65 (such as static location evaluation). The only real difference is that my language would try to prevent you from shooting yourself in the foot (with regards to performance), where as CC65 ensures that what you express in C will (eventually) happen on the CPU
For now I am putting the breaks on this language idea and seeing how far I can go with CC65.
Thanks for all of the input!