by iacguy on 1/25/25, 9:35 PM with 3 comments
by fragmede on 1/25/25, 9:40 PM
by hajimuz on 1/26/25, 8:06 AM
by inhumantsar on 1/25/25, 10:57 PM
at my last place we relied heavily on APM traces in prod and kept the sampling rate relatively low. you can add metadata and send custom trace entries to take the place of most log entries. likewise the most valuable metrics could be derived from APM data easily enough. we kept shipping the must-have logs and custom metrics but their volume was surprisingly low once the noise was gone.
also, I don't think I'd ever let datadog slurp everything from AWS cloudwatch again. the amount of pure useless noise there far outweighs the value. a few targeted collectors from services like WAF and AWS SSO were all we really needed in the end.
TL;DR: be ruthless and cut cut cut. you might be surprised at just how much is never used. also, avoid letting datadog integrate with lambda or ECS, since those come with a per-unit cost which is fairly extravagant.