Uniform Type Identifier

Source: Wikipedia, the free encyclopedia.

A Uniform Type Identifier (UTI) is a text string used on software provided by Apple Inc. to uniquely identify a given class or type of item. Apple provides built-in UTIs to identify common system objects – document or image file types, folders and application bundles, streaming data, clipping data, movie data – and allows third party developers to add their own UTIs for application-specific or proprietary uses. Support for UTIs was added in the Mac OS X 10.4 operating system, integrated into the Spotlight desktop search technology, which uses UTIs to categorize documents. One of the primary design goals of UTIs was to eliminate the ambiguities and problems associated with inferring a file's content from its MIME type, filename extension, or type or creator code.[1]

UTIs use a reverse-DNS naming structure. Names may include the ASCII characters A–Z, a–z, 0–9, hyphen ("-"), and period ("."), and all Unicode characters above U+007F.[1] Colons and slashes are prohibited for compatibility with Macintosh and POSIX file path conventions. UTIs support multiple inheritance, allowing files to be identified with any number of relevant types, as appropriate to the contained data. UTIs are case-insensitive.[2]

Background

One of the difficulties in maintaining a user-accessible operating system is establishing connections between data types and the applications or processes that can effectively use such data. For example, a file that contains picture data in a particular compression format can only be opened and processed in applications that are capable of handling picture data, and those applications must be able to identify which compression type was used in order to extract and work with that data. In early computer systems – particularly DOS, its variants, and some versions of Windows – file associations are maintained by file extensions. The three to four character code following a file name instructs the system to open the file in particular applications.

Beginning with System 1,[3] Macintosh operating systems have attached type codes and creator codes as part of the file metadata. These four-character codes were designed to specify both the application that created the file (the creator code) and the specific type of the file (the type code) so that other applications could easily open and process the file data. However, while type and creator codes extended the flexibility of the system — a particular type of file was not restricted to opening in a particular application — they suffered many of the same problems as file extensions. Type and creator codes could be lost when files were transferred across non-Macintosh systems (such as Unix-based servers), and the plethora of type codes made identification problematic.

In addition, the classic Mac OS did not recognize file extensions at all, leading to unrecognized file errors when files were transferred from DOS/Windows systems. OPENSTEP, which formed the basis of Mac OS X, used extensions, and early versions of Mac OS X followed suit. This led to some controversy with users and developers coming to OS X from NeXT or Windows origins advocating for continued use of file extensions, and those coming from Classic Mac OS urging Apple to replace or supplement file extensions with type and creators.[4]

Other file identification types exist: for example, MIME types are used for identifying data that is transferred over the web. However, Apple's UTI system was designed to create a flexible file association system that would describe data hierarchically and allow for better categorization and searching, standardize data descriptions across contexts, and provide a uniform method of expanding data types. For instance, the public.jpeg and public.png UTIs inherit from the public.image UTI, allowing users to search narrowly for JPEG images or PNG images or broadly for any kind of image merely by changing the specificity of the UTI used in the search. Further, application developers who design new data types can easily extend the UTIs available. For example, a new image format developed by a company may have a UTI of com.company.proprietary-image and be specified to inherit from the public.image type.

Apple's macOS continues to support other forms of file association, and contains utilities for translating between them, but will use UTIs by preference where available.

UTI structure

Apple maintains the public.* domain as a set base data types for all UTIs. Other UTIs are associated with these base UTIs by conformance, a system similar to class inheritance. UTIs that conform to other UTIs share a basic types, and in general any application that works with data of a more general UTI should be able to work with data of any UTI that conforms to that general UTI.

Apple public UTIs

The most basic public UTIs in the Apple hierarchy are as follows:

Identifier Conforms to Comment
public.item base class in the physical hierarchy
public.content base class for all document content
public.data public.item base class for all files, byte streams, pasteboard, etc.
public.image public.data, public.content base class for all images

UTIs are even used to identify other file type identifiers:

Identifier Conforms to Comment
public.filename-extension public.case-insensitive-text Filename extension
public.mime-type public.case-insensitive-text MIME type
com.apple.ostype public.text Four-character code (type OSType)
com.apple.nspboard-type public.text NSPasteboard type

Dynamic UTIs can be created as needed by applications; these have the prefix dyn. and take the form of "a UTI-compatible wrapper around an otherwise unknown filename extension, MIME type, OSType, and so on."[1]

Third-party UTIs

Apple provides a large collection of system-declared Uniform Type Identifiers. Third-party applications can add UTIs to the database maintained by macOS by "exporting" UTIs declared within the application package. Because new UTIs can be declared to "conform to" existing system UTIs, and declarations can associate the new UTIs with file extensions, an exported declaration alone can provide the operating system with enough information to enable new functions, such as enabling Quick Look for new file types.

List of common third-party UTIs

Description UTI Extensions Conforms to MIME types Reference URL
OPML document org.opml.opml .opml public.xml text/xml, text/x-opml, application/xml http://dev.opml.org/spec2.html
Markdown document net.daringfireball.markdown[5] .md, .markdown public.plain-text text/markdown https://daringfireball.net/projects/markdown/
SQLite database vnd.sqlite3[6] .sqlite3, .sqlite, .db public.database, public.data application/vnd.sqlite3 https://www.sqlite.org/fileformat2.html
POSIX Paths document cc.utis.paths-file .paths public.utf8-plain-text not defined https://github.com/utiscc/DotPathsFileSpec
Pasteboard Types org.nspasteboard.TransientType, org.nspasteboard.ConcealedType, org.nspasteboard.AutoGeneratedType, org.nspasteboard.source not a file type N/A N/A http://nspasteboard.org

Looking up a UTI

To get the UTI of a given file, use the mdls (meta data list, part of Spotlight) command in the Terminal.

mdls -name kMDItemContentType -name kMDItemContentTypeTree -name kMDItemKind FILE

References

  1. ^ a b c "Uniform Type Identifiers Overview". Guides and Sample Code. Apple Inc. October 29, 2007. Retrieved September 12, 2016.
  2. ^ "Uniform Type Identifiers — a reintroduction - Tech Talks - Videos". Apple Developer. Retrieved May 17, 2022.
  3. ^ "Folklore.org: The Grand Unified Model (2) - The Finder". www.folklore.org. Retrieved April 12, 2018.
  4. ^ "Mac OS X 10.1 File Name Extension Guidelines - Cocoabuilder". www.cocoabuilder.com. Retrieved April 12, 2018.
  5. ^ "Uniform Type Identifier For Markdown". Daring Fireball. Retrieved August 21, 2019.
  6. ^ "SQLite database file format media type at IANA". Internet Assigned Numbers Authority. IANA. Retrieved August 21, 2019.