Claude Code storage

Binary Files Corrupted During Upload or Processing

Files uploaded to or processed by your application come out corrupted. PDFs are unreadable, images show as broken, ZIP archives can't be extracted, and Excel files fail to open. The files appear to be the right size but their contents are garbled.

Binary file corruption typically happens when the application treats binary data as text (applying encoding transformations that destroy the data), when streams are not properly piped, or when files are read and written with mismatched encoding settings.

This issue is particularly insidious because it may only affect certain file types or file sizes, making it hard to reproduce consistently during testing.

Error Messages You Might See

Error: Invalid PDF structure The file is corrupted and cannot be opened Error: incorrect header check (zlib) CairoError: invalid image surface Corrupted ZIP: unexpected end of data
Error: Invalid PDF structureThe file is corrupted and cannot be openedError: incorrect header check (zlib)CairoError: invalid image surfaceCorrupted ZIP: unexpected end of data

Common Causes

  • UTF-8 encoding applied to binary data — Reading binary files with encoding: 'utf-8' instead of as a Buffer or bytes object
  • Base64 double-encoding — File data is base64 encoded during upload and encoded again during storage, doubling the corruption
  • Multipart parsing misconfigured — The file upload middleware converts binary data to strings instead of preserving raw buffers
  • Stream not set to binary mode — HTTP response or file stream uses text mode instead of binary mode
  • Chunked transfer truncation — Large files are cut off because the receiving end closes the connection before all chunks arrive

How to Fix It

  1. Always use binary mode for file operations — Use fs.readFile without encoding option (returns Buffer) or open files with 'rb'/'wb' in Python
  2. Verify multipart parsing configuration — Ensure multer, busboy, or your upload library preserves raw binary buffers
  3. Test with actual binary files — Upload a real PDF, image, and ZIP file and verify they open correctly after download. Compare checksums
  4. Use streams correctly — Pipe readable streams directly to writable streams without intermediate string conversion
  5. Check Content-Type headers — Ensure the response sets application/octet-stream or the correct MIME type, not text/plain

Real developers can help you.

Nam Tran Nam Tran 10 years as fullstack developer zipking zipking I am a technologist and product builder dedicated to creating high-impact solutions at the intersection of AI and specialized markets. Currently, I am focused on PropScan (EstateGuard), an AI-driven SaaS platform tailored for the Japanese real estate industry, and exploring the potential of Archify. As an INFJ-T, I approach development with a "systems-thinking" mindset—balancing technical precision with a deep understanding of user needs. I particularly enjoy the challenge of architecting Vertical AI SaaS and optimizing Small Language Models (SLMs) to solve specific, real-world business problems. Whether I'm in a CTO-level leadership role or hands-on with the code, I thrive on building tools that turn complex data into actionable value. BurnHavoc BurnHavoc Been around fixing other peoples code for 20 years. Matt Butler Matt Butler Software Engineer @ AWS Sage Fulcher Sage Fulcher Hey I'm Sage! Im a Boston area software engineer who grew up in South Florida. Ive worked at a ton of cool places like a telehealth kidney care startup that took part in a billion dollar merger (Cricket health/Interwell health), a boutique design agency where I got to work on a ton of exciting startups including a photography education app, a collegiate Esports league and more (Philosophie), a data analytics as a service startup in Cambridge (MA) as well as at Phillips and MIT Lincoln Lab where I designed and developed novel network security visualizations and analytics. I've been writing code and furiously devoted to using computers to make people’s lives easier for about 17 years. My degree is in making computers make pretty lights and sounds. Outside of work I love hip hop, the Celtics, professional wrestling, magic the gathering, photography, drumming, and guitars (both making and playing them) legrab legrab I'll fill this later MFox MFox Full-stack professional senior engineer (15+years). Extensive experience in software development, qa, and IP networking. Antriksh Narang Antriksh Narang 5 years+ Experienced Dev (Specially in Web Development), can help in python, javascript, react, next.js and full stack web dev technologies. Omar Faruk Omar Faruk As a Product Engineer at Klasio, I contributed to end-to-end product development, focusing on scalability, performance, and user experience. My work spanned building and refining core features, developing dynamic website templates, integrating secure and reliable payment gateways, and optimizing the overall system architecture. I played a key role in creating a scalable and maintainable platform to support educators and learners globally. I'm enthusiastic about embracing new challenges and making meaningful contributions. Basel Issmail Basel Issmail ’m a Senior Full-Stack Developer and Tech Lead with experience designing and building scalable web platforms. I work across the full development lifecycle, from translating business requirements into technical architecture to delivering reliable production systems. My work focuses on modern web technologies, including TypeScript, Angular, Node.js, and cloud-based architectures. I enjoy solving complex technical problems and helping teams turn product ideas and prototypes into working platforms that can grow and scale. In addition to development, I often collaborate closely with product managers, business analysts, designers, and QA teams to ensure that solutions align with both technical and business goals. I enjoy working with startups and product teams where I can contribute both as a hands-on engineer and as a technical partner in designing and delivering impactful software.

You don't need to be technical. Just describe what's wrong and a verified developer will handle the rest.

Get Help

Frequently Asked Questions

Why are my uploaded files corrupted?

The most common cause is reading binary files as text (UTF-8). Ensure your upload handler preserves raw binary buffers. In Node.js, don't pass an encoding option to fs.readFile for binary files.

How do I verify file integrity after upload?

Compare MD5 or SHA256 checksums of the original file and the stored file. If they differ, the data was modified during transit or storage. Use crypto.createHash in Node.js or hashlib in Python.

Related Claude Code Issues

Can't fix it yourself?
Real developers can help.

You don't need to be technical. Just describe what's wrong and a verified developer will handle the rest.

Get Help