Change function calling to use a more complex decision making process. This should both reduce token consumption & improve function selection accuracy. Context trimming only affects function selection, full context is still always used when forming responses.