Channel: User HoldOffHunger - Stack Overflow

↧

Answer by HoldOffHunger for How does the Linux command `file` recognize the encoding of my files?

September 24, 2022, 12:08 pm

≫ Next: Answer by HoldOffHunger for wrong text encoding on linux

≪ Previous: Answer by HoldOffHunger for Omit "Is a directory" results while using find command in Unix

TLDR: Magic File Doesn't Support UTF-8 BOM Markers

(and that's the main charset you need to care about)

The source code is on GitHub so anyone can search it. After doing a quick search, things like BOM, ef bb bf, and feffdo not appear at all. That means UTF-8, Byte-Order-Mark reading is not supported. Files made in other applications that use or preserve the BOM marker will all be returned as "charset=unknown" when using file.

In addition, none of the config files mentioned in the Magic File manpage are a part of magic file v. 4.17. In fact, /etc/magicfile/ doesn't exist at all, so I don't see any way in which I can configure it.

If you're stuck trying to get the ACTUAL charset encoding and magic file is all you have, you can determine if you have a UTF-8 file at the Linux CLI with:

hexdump -n 3 -C $path_to_filename

If the above returns the following sequence, ef bb bf, then you are 99% likely in possession of a BOM-marked UTF-8 file. This is not a 100% certainty, but it is far more useful than magic file, where it has no handling whatsoever for Byte Order Marks.

↧

Trending Articles

Police confirm man stabbed to death in Selsdon was Andrew David Else of Croydon

April 26, 2014, 8:31 am

Angry father ordered to compensate daughter’s male friend

December 28, 2017, 6:10 pm

Moondru Mudichu 20-07-2016 – Polimer tv Serial

July 20, 2016, 9:25 am

Anthony Wahome Biography, Family, Wife and Children

August 20, 2016, 3:57 am

Sniper: Ghost Warrior 3: Трейнер/Trainer (+17) [1.0 - 1.02] {FLiNG}

May 17, 2017, 7:53 pm

IN COURT: Full list of people sentenced at Northampton Magistrates’ Court

July 16, 2017, 10:00 pm

DMG Audio Limitless v1.01 WiN/OSX Incl Patched and Keygen-R2R

February 5, 2016, 4:18 am

Madonna – Behind Me (feat. Guido Dos Santos) – Single [iTunes Plus M4A]

December 25, 2024, 6:44 am

A/L Technology Stream – Subject combinations, Syllabuses and Teacher guides

December 17, 2013, 6:12 pm

Sri Lankan Actress Nadeesha Hemamali Hot Shoot

December 8, 2015, 11:05 am

Jessica Carradero Lopez Arrested by Miami-Dade County Corrections on Dec 17,...

December 17, 2019, 12:00 am

Prison officer charged!

December 6, 2018, 7:00 pm

Download: Rich Bizzy -Panono Ukwenda (Cover)

November 14, 2017, 4:54 am

Jamani mm nauliza hivi second selection za form five zinatoka lini?

August 7, 2017, 4:53 am

Reply: Betrayal at House on the Hill:: Rules:: Re: Haunt #6 - Spoilers Within

September 15, 2018, 4:28 pm

Gordian S01e01-73 [H264 - Ita Jap Ac3 - SoftSub Ita]

May 27, 2017, 2:16 pm

Hyper-V replication "Enabling Replication Failed"

November 9, 2020, 3:17 am

Stories • Goddess Stepmom

January 31, 2025, 1:10 pm

Laura Pausini - Platinum Collection (3Cd) (2009) .mp3 - 320 Kbps

March 29, 2013, 4:20 am

Joseph Bradley – Carlisle

July 4, 2015, 1:16 am

© 2025 //www.rssing.com