3 Sources
[1]
I built a Python utility using Claude to automate my image editing workflow, and it saves me hours every week
Abhinav pivoted from a career in banking to pursue his first love in writing. Even while working full-time, he continued contributing as an editor-at-large, a role he has held for more than 7 years. A lifelong tech enthusiast who has built three gaming and productivity powerhouse PCs since 2018, his passion for technology keeps him closely following the semiconductor industry, from NVIDIA and AMD to ARM. His MSc dissertation explored how artificial intelligence will reshape the future of work, reflecting his curiosity about the wider social impact of emerging technologies. Before the rise of generative AI tools, fixing a batch of images would be messy business. It would mean bouncing between three browser tabs, logging into tools you had forgotten you had an account on, and occasionally being asked for your credit card information before you could download the files you'd spent a while working on. It was a tedious affair for what is actually quite a simple task. Things are a little different now. Since the rise of Anthropic's Claude, building a custom utility tailored to a specific workflow has been reduced to a couple of prompts, provided you know what you're doing, and have a little patience for the process. As naturally curious as I am, I decided to check if I could build a Python tool that takes any image and processes it to the exact specifications that I need, regardless of how sloppy of a photographer I was that day. Here's how I put the tool together, and what it took to get there. It began with the vision of a tool Around three specifications, and one very specific frustration It may surprise no one, but my workflow involves processing a significant volume of images for work. It can include screenshots, product images, reference captures, benchmarks... you get the idea. Anything that's fit for publication needs to meet a set of criteria before it's usable anywhere on the internet, which includes a minimum height of 1080p, and a desirable 16:9 aspect ratio. Neither of those requirements are particularly complicated to fulfill, but doing both, consistently, across a batch of files in various formats using tools that weren't built with that specific combination in mind is exactly the kind of trouble that keeps me migrating from Microsoft's awful Photos app to Canva, and from Canva to whatever image upscaling utility that isn't looking to make a quick $2.99 that day. So, naturally, I found myself daydreaming about a drag-and-drop utility that accepts WEBP, JPG, JPEG, and PNG files, takes them and applies automatic upscaling to 1080p whenever the height doesn't meet the threshold, and applies intelligent automatic cropping to 16:9. Perhaps the most underrated aspect of being in the era of vibe-coding is that sometimes, a description like that is the first step towards building the solution. The next, and the final step, is to find a model that can deliver it. I finally found a local LLM I actually want to use for coding Qwen3-Coder-Next is a great model, and it's even better with Claude Code as a harness. Posts 26 By Adam Conway I took my "brief" to Claude And it understood exactly what I was looking for... eventually The prompt that I delivered was barely any more structured than the description I mentioned. It included the three key requirements, a note about the file formats I needed supported, and a preference for a drag-and-drop interface over a file picker. Claude mapped out the approach first, which included using Tkinter for the GUI, Pillow for image processing, and the TkinterDnD2 library to handle the drag-and-drop natively on my Windows 11 system. The logic flow checked out, and I asked Opus 4.8 to proceed. The first Python utility arrived much quicker than I had anticipated, and after working through a few (or a dozen) setup issues, it ran exactly as intended. The upscaling was handled by Lanczos resampling, which is a high-quality interpolation method for resizing visual media. For most use-cases, Lanczos resampling is more than adequate. I was, however, still curious to see if it could be improved, and asked Claude whether there was a better upscaler available, perhaps something closer to what modern AI tools use under the hood. Claude came back with a suggestion to use Real-ESRGAN instead, and that's exactly why I didn't stop just here. The first upscaler was fine, but the second one was seriously impressive Lanczos got the job done, but R-ESRGAN made me stop and look twice In the place of Lanczos, Claude proposed using Real-ESRGAN, which is an open-source model that reconstructs image details by using a deep neural network trained on degraded image pairs. In that way, it's slightly different in how it functions from a conventional interpolation model. This is the one detail that piqued my interest. While Lanczos itself is a strong algorithm, providing clean and sharp results for most upscaling tasks, asking it to recover a low-resolution image means essentially relying on educated guesswork about the missing details. R-ESRGAN, on the other hand, recognizes textures, edges, and structures and rebuilds the image from there. Deals Save on AI tools and software deals for creators Unlock discounts on software, AI subscriptions, and image-enhancement utilities -- shop deals on creative apps, cloud model access, automation tools, and productivity plugins to streamline your workflow and save on essential tools. Deals Explore Software, AI & Subscriptions Deals To test both the techniques, I deliberately compressed the test image down to 500x272 pixels. The Lanczos output scaled up as expected but carried the compression noise with it, producing an image that was large but retained all the original's limitations. The Real-ESRGAN output did a far more impressive job by reconstructing the foliage texture, sharpening the panel lines on the car, and recovering road details. The one area it obviously struggled was the finer details, such as the logo on the controller, text on the all-in-one cooling unit, and the license plate on the car that came back as reconstructed noise rather than legible text, which is a known limitation of most upscalers when the source detail is too degraded to recover precisely. In practice, of course, it isn't reasonable to expect ground-breaking upscaling from such a low-resolution source image. Obviously, this wasn't done for the purpose of editorial use, but rather as a stress test for the upscalers. The images did end up making it to at least one publication, however! Vibe coding your everyday annoyances away is severely underrated As someone with several folders full of vibe-coded utilities that each solve a distinct problem in my workflow, I absolutely see the allure and the merit in the approach. Anyone with a clearly defined problem can now arrive at a working solution. Charles Kettering once said that a problem well-stated is a problem half-solved, and I can't think of a philosophy that applies more neatly to vibe coding than that. The other half, as I learned across a dozen setup errors and one stubborn library, of course, is patience. That being said, the barrier to building something useful has never been lower, and for me, that's something worth getting excited about. Claude Claude is an AI assistant and LLM developed by Anthropic. See at Claude Expand Collapse
[2]
Claude's no-code canvas replaces hours of Python debugging in minutes
Aggy is a veteran writer and editor in the technology and gaming space. Having served as a Managing Editor for high-traffic digital publications, alongside being an editor and consultant for over a dozen sites. Aggy's published work spans a wide and respected array of tech and gaming outlets, including WePC, Screen Rant, How-To Geek, Android Police, PC Invasion, and Try Hard Guides. Beyond editorial work, Aggy's direct experience in the tech sphere extends to app development. Aggy has published two games under Tales and is always eager to learn and do more. He also likes working on computers and researching in his spare time. He knows about Windows, Linux, Audio, Video, and much more. Cleaning massive, disorganized spreadsheets or parsing through thousands of lines of raw server logs is annoying. You can do it yourself, make a program to do it, or you can just give it to Claude and ask it to fix your problems. Claude has a built-in execution canvas that handles smaller tasks. It's a sandboxed processing environment that lives right inside your chat window, so you can drop in files and use plain language to make the fixes you need. This is one of the ways Claude works better than usual. Claude is a lot more reliable than you would think Claude has a built-in canvas that builds apps and visuals from text I like Claude's no-code interface because it means anyone can work with it without knowing how to code. Instead of opening a terminal or writing SQL queries, you just describe what you want, and the interface handles the rest. This works because there's a full code execution engine running directly inside the chat window. You describe a goal in plain English, making sure to be specific about what you want, and the system writes and runs the code behind the scenes. You end up seeing the result in the chat, usually exactly as you wanted it. You don't have a Python environment to configure, no matplotlib syntax to remember, and no pandas documentation to dig through. It is great for people who have a simple project to get through and just want to see a final result, or those who want to see a prototype of their idea before they build it. Luckily, this is better than many other AI code builders, because it lets you see what you're asking for. When you send a message, Claude figures out what's needed and runs everything inside a secure sandboxed container. You never see a terminal. You never deal with dependency errors. The background work happens entirely out of sight. I am pretty used to asking for code, copying it into my editor, running it, hitting an error, pasting the stack trace back into the chat, and doing it all over again. The new interface is much better. This isn't a good way to have it teach you how to code, like the regular chatbot interface, but it's a good way to have code written. Claude runs its own code, reads its own error logs, fixes what broke, and keeps going until it has something worth showing you. If a data pipeline fails mid-run or a chart doesn't render correctly, Claude patches it on its own before you ever see the result. Using it is pretty simple Drop your files into the chat to get instant results You can drag your data files straight into Claude's chat window to handle them. It handles messy Excel spreadsheets, CSVs, JSON, plain text server logs, and PDFs. You can do up to twenty files per conversation, thirty megabytes each. Once they're uploaded, just describe what you want done in plain English. You don't need to wrangle boilerplate code, import libraries, or build regex patterns just to parse a log file. Tell it to clean up a disorganized marketing spreadsheet, pull specific error codes from a server log, or merge several data sources into one table. It might feel like cheating, but it's writing the same code you would; you're just not the one typing it. Then, Claude hands your instructions off to a built-in code execution engine running quietly in the background. Depending on the task, it'll use either a JavaScript environment with libraries like PapaParse and Lodash, or a Python container stocked with pandas, numpy, matplotlib, and friends. It writes, runs, and debugs the scripts itself. You never touch a dependency or a config file. I used to lose hours to pandas type errors alone. Having something else handle that execution loop is a huge relief. When it's done, the results come back to you directly in the browser. I've seen many charts and heat maps, but I have had it just make a clean spreadsheet, a formatted one, or just a proper CSV. From there, you can keep modifying it. You can filter by date range, tweak a chart's styling, and reshape the data. The whole thing lets you clean spreadsheets, parse logs, and convert file formats without reading or writing a single line of Python. This isn't the perfect answer You still need to watch out for token limits and logic bugs While Claude's no-code environment is genuinely useful, it would be a lie to say it is perfect. The biggest limitation is the data size. The system seems to have issues with large datasets that are larger than its context window, which means it may only partially process what you've uploaded. So you want to avoid heavy Excel spreadsheets, server logs, or text-heavy PDFs. These eat through the available memory fast. When that memory fills up, the system doesn't stop and tell you; it just quietly starts dropping the oldest information to make room for new inputs. This means it forgets earlier files or parameters. If you're trying to analyze hundreds of thousands of rows, the model might only work through a portion of them, producing skewed or incomplete results unless you've pre-segmented the data yourself. Data privacy is another issue you should take seriously. Uploading company files to a public cloud environment can conflict with corporate compliance policies. Things that you upload are processed on external servers and may be retained for 30 days or longer if your settings allow the data to be used for model training. That's a serious problem for organizations subject to GDPR, HIPAA, or SOC 2. It's easy to assume these tools are locked down by default, but they're not. You have to check to be sure. Unless your organization is running through an Enterprise tier with Data Processing Agreements and Zero Data Retention in place, dragging a sensitive spreadsheet into the chat window is a potential data breach. Then there's the subtler issue of hallucinated logic. Claude can generate code that runs without errors but gets the business logic completely wrong. You've got to really be careful with bigger projects. It can miss what you need and ignore it because it didn't trigger an error. All of this means you can't treat the AI as a black box that you just trust. You have to review what it produces and adjust it where you can. Don't use this for every big thing, just little ones Relying entirely on AI for every workflow isn't the right move. Token limits can cause context issues with heavy datasets, and uploading proprietary company data to a public cloud carries real security risks. If your organization deals with highly sensitive financial records or massive data volumes, a local dev environment is still the better call. That said, if you need to quickly prototype dashboards or clean up messy spreadsheets without wrestling with dependencies, Claude is one of the fastest ways to get it done Claude Price $20 Claude is an AI assistant made by Anthropic. It can assist with a wide range of tasks -- writing, coding, analysis, research, and more. Unlike a search engine, Claude reasons through problems conversationally, making it useful as a thinking partner rather than just an information retrieval tool. See at Claude Expand Collapse
[3]
Anthropic made Claude worse for a month -- this is how they got caught
I love using Claude, to the point where I cancelled ChatGPT, Perplexity, and Gemini because Claude did everything I needed. But if you're like me and have been using Claude for a while, you would've noticed the responses feeling sloppier. The model seemed to forget what it was doing mid-task, and code quality dropped massively. When I went looking for answers, none came from Anthropic. Frustrated users kept complaining across platform, and the company behind Claude remained silent -- unitl one AMD executive made that silence impossible to maintain. Someone left Claude Code running overnight, and it cost $6,000 Claude Code worked overtime and billed like a senior consultant. Posts By Oluwademilade Afolabi Users noticed the change before Anthropic did Benchmark drops, strange behavior, and mounting complaints The complaints about Claude's degrading quality started trickling in around early March 2026. Developers on Reddit, GitHub, and Hacker News reported that Claude Code -- Anthropic's AI-powered coding tool -- had gotten noticeably worse. The model was reading through code less carefully before making changes, leaving tasks halfway, and producing fixes that were technically correct but an architectural nightmare. Among these frustrated users was Stella Laurenzo, senior director of AMD's AI group and the engineer who previously built Google's OpenXLA infrastructure. On April 2, she filed a detailed GitHub issue that ended up becoming the starting point for the entire controversy. Her complaint wasn't based on gut feeling either. She and her team had analyzed 6,852 Claude Code sessions covering 17,871 thinking blocks and 234,760 tool calls. What they found was that Claude's median thinking depth had collapsed by roughly 73% since early February. The read-to-edit ratio, a measure of how much Claude studies code before touching it also fell from 6.6 reads per edit to just 2. Edits made without reading any code first jumped from 6.2% to 33.7%. The conclusion: Claude cannot be trusted to perform complex engineering tasks. Period Thinking Visible Thinking Redacted Jan 30 - Mar 4 100% 0% Mar 5 98.5% 1.5% Mar 7 75.3% 24.7% Mar 8 41.6% 58.4% Mar 10-11 >99% Mar 12+ 0% 100% Every senior engineer on her team had independently noticed the same pattern, which made it especially hard to dismiss. She also noticed a clear behavioral shift from Claude being research-first and cautious to being edit-first and hasty. The GitHub issue explains this by saying that when thinking is shallow, the model defaults to the cheapest action available: edit without reading, stop without thinking, dodge responsibility for failure. Claude Developer Anthropic PBC Price model Free, subscription available See at App Store See at Google Play Store See at Claude Expand Collapse The silence became part of the story Why the lack of communication frustrated users even more The degraded output is one thing, but Anthropic's response, or rather the lack of it, made things worse. For weeks, the company offered no blog post, no status page update, no email to subscribers, and no formal acknowledgment of the problem. Individual engineers made informal comments on social media, but the company as a whole said nothing meaningful while charging customers $20 to $200 per month for a tool that had suddenly become significantly worse. In the meantime, third-party benchmark data kept piling on. BridgeMind reported that Claude Opus 4.6's accuracy on their hallucination benchmark had dropped from 88.3% to 68.3%, sending it from second place all the way down to tenth on the leaderboard. The exact methodology behind the test is contested, but it was consistent with the broader narrative that Claude had suddenly become worse, and no one knew why. One month, multiple problems The bugs, regressions, and odd behavior that piled up fast Anthropic finally released a detailed report on what was going on on April 23. Long story short, the problems were being caused by three separate product-layer changes that had stacked on top of each other between March and April, each affecting a different part of the user base on a different schedule. The weights themselves never changed, but the infrastructure around them was affected. The first change happened on March 4, when Anthropic changed Claude Code's default reasoning effort from high to medium. Anthropic claims it was done because high-effort mode was causing the UI to appear forzen during long thinking periods. What actually ended up happening is that a lot of users suddenly started noticing the intelligence drop but didn't know why, and Anthropic had shipped no formal warning in advance. It took until April 7, over a month later, for the company to revert the change. Second issue was a caching bug that came March 26. Anthropic built an optimization to clear Claude's older reasoning history from sessions that were idle for over an hour. However, a bug in this routine caused the cache clearning to fire every single turn for the rest of the session, not just once. This meant that every follow-up question you asked, reduced Claude's reasoning history and over time, longer chats started showing the forgetfulness and bizzare tool choices user had been reporting. It also cause a separate wave of complaints about usage limits draining faster than usual. Last but not least, the third change shipped on April 16 alongside Opus 4.7. Anthropic added a verbisoty instruction to Claude Code's system prompt that kept tool calls to less than 25 words and final responses to less than 100 words. It seemed safe during weeks of internal testing, but later investigation showed a 3% quality drop across both Opus 4.6 and Opus 4.7. It was reverted four days later on April 20. Because each change hit different users at different times, the overall effect looked more like a subtle, inconsistent degradation. The internet did the investigation How users, benchmarks, and community testing exposed the issues Stella Laurenzo's GitHub post might the be most significant factor behind Anthropic not only being caught, but also forced to investigate and fix the issue. She didn't just complaint -- she built an analysis pipeline, intrumented her sessions, and produced a data-backed report that Anthropic's internal teams couldn't ignore. It was also detailed and repoducable enough that other engineers could look at their own session logs and recognize the same patterns. Subscribe to our newsletter on AI tool reliability Track AI model reliability -- subscribe to our newsletter for rigorous coverage of regressions, incident analysis, vendor responses, and community investigations that clarify how and why AI tools change. Get Updates By subscribing, you agree to receive newsletter and marketing emails, and accept our Terms of Use and Privacy Policy. You can unsubscribe anytime. The Hacker News thread debated whether Anthropic's stated rational for the changes was real or just a cover for cost-cutting. Anthropic's internal staff had also been using a different build of Claude Code than what shipped to paying customers, meaning the dogfooding that's supposed to catch these issues never caught them. I love Claude, but these mistakes were painful. Why Claude feels more human to talk to than ChatGPT, and what that actually means It's not magic. Here's what's actually going on. Posts 1 By Tashreef Shareef In its response, Anthropic has committed to fixing both problems. A larger share of internal staff will now use teh exact public build. The company will run broader per-model evaluation suites for every system prompt change. And as a gesture of goodwill, Anthropic reset usage limtis for all subscribers on April 23. Regardless, it took a senior AMD director building a custom analysis pipeline to force the conversation. The lesson for anyone depending on a black-box AI service for professional work is clear: if you don't measure your sessions, you may never find out if the tool got quitely worse.
Share
Copy Link
Anthropic Claude's no-code canvas transforms Python utility creation, enabling users to automate image editing and data processing workflows in minutes instead of hours. But a month-long performance degradation went unaddressed, with AMD's Stella Laurenzo exposing a 73% drop in thinking depth that eroded trust in Claude Code's engineering capabilities.
Anthropic Claude has emerged as a tool that fundamentally changes how developers approach workflow automation. Users are building custom Python utilities through simple conversational prompts, bypassing the traditional cycle of writing code, debugging errors, and wrestling with dependency issues. One developer automated an entire image editing workflow by describing three specifications to the AI model: automatic upscaling to 1080p, intelligent cropping to 16:9 aspect ratio, and support for multiple file formats including WEBP, JPG, JPEG, and PNG
1
.
Source: XDA-Developers
The resulting Python utility leveraged Tkinter for the GUI, Pillow for image processing, and TkinterDnD2 for drag-and-drop functionality. When asked about better upscaling options, Anthropic Claude suggested replacing Lanczos resampling with Real-ESRGAN, an open-source model that reconstructs image details using a deep neural network trained on degraded image pairs. This kind of iterative refinement through natural language represents a shift in how generative AI tools enable code generation without requiring deep technical expertise
1
.Claude's built-in execution canvas handles data processing tasks through a sandboxed environment that lives directly inside the chat window. Users can drop files up to thirty megabytes each, with support for up to twenty files per conversation, including Excel spreadsheets, CSVs, JSON, plain text server logs, and PDFs
2
. The no-code canvas writes, executes, and debugs scripts autonomously, using either a JavaScript environment with libraries like PapaParse and Lodash, or a Python container equipped with pandas, numpy, and matplotlib.
Source: How-To Geek
This approach eliminates the traditional debugging loop where developers copy code into an editor, encounter errors, paste stack traces back into the chat, and repeat the process. Claude Code runs its own code, reads error logs, and fixes issues before presenting results. For data analysis tasks like cleaning disorganized spreadsheets or parsing server logs, users simply describe their needs in plain English and receive formatted outputs directly in the browser
2
.While Anthropic Claude demonstrated impressive capabilities, a serious quality crisis emerged in early March 2025 when users began reporting degraded performance. Stella Laurenzo, senior director of AMD's AI group and former Google OpenXLA infrastructure engineer, filed a detailed GitHub issue on April 2 after analyzing 6,852 Claude Code sessions covering 17,871 thinking blocks and 234,760 tool calls. Her team discovered that Claude's median thinking depth had collapsed by approximately 73% since early February. The read-to-edit ratio fell from 6.6 reads per edit to just 2, while edits made without reading any code first jumped from 6.2% to 33.7%
3
.Source: MakeUseOf
The performance degradation manifested as the AI model forgetting tasks mid-execution, producing architecturally problematic fixes, and shifting from a research-first approach to an edit-first pattern. BridgeMind reported that Claude Opus 4.6's accuracy on their hallucination benchmark dropped from 88.3% to 68.3%, sending it from second place to tenth on the leaderboard
3
.Related Stories
What amplified user frustration wasn't just the performance degradation itself, but Anthropic's complete silence while charging customers $20 to $200 per month. For over a month, the company offered no blog post, status page update, or formal acknowledgment as complaints mounted across Reddit, GitHub, and Hacker News. Individual engineers made informal social media comments, but no official communication addressed the widespread quality concerns
3
.Anthroptic finally released a detailed report on April 23 revealing three separate product-layer changes that had stacked between March and April. The first occurred on March 4 when Claude Code's default reasoning effort changed from high to medium, causing noticeable intelligence drops without advance warning. A caching bug introduced on March 26 cleared older reasoning history from idle sessions, though a bug in this routine caused additional problems. Anthropic didn't revert the reasoning effort change until April 7, over a month after implementation
3
.The incident raises questions about transparency in AI model deployment and the balance between optimization and user experience. As developers increasingly rely on generative AI tools for workflow automation and Python utility creation, the expectation for consistent performance and clear communication becomes critical for maintaining trust in these systems.
Summarized by
Navi
[1]
10 Apr 2026•Technology

20 Oct 2025•Technology

04 Mar 2025•Technology

1
Policy and Regulation

2
Policy and Regulation

3
Technology

News Categories