Top Qs
Timeline
Chat
Perspective

Path (computing)

Text that identifies an item in a computer file system From Wikipedia, the free encyclopedia

Remove ads

A path (or filepath, file path, pathname, or similar) is a string that uniquely identifies an item in a hierarchical file system. Generally, a path is composed of directory names, special format specifiers, and optionally a filename, all separated by delimiters. This delimiter can vary by operating system, but popular, modern systems use the slash /, backslash \, or colon :.

The case-sensitivity of individual path components will vary based on operating system, or based on options specified at the time of a file system's creation or first use. In practice, this means that for a case-sensitive system, path components named component1 and Component1 can coexist at the same level in the hierarchy, whereas for a case-insensitive file system, they cannot (an error will occur). macOS and Windows' native file systems are case-insensitive by default, whereas typical Linux file systems are case-sensitive.[1][2][3]

A path can be either relative or absolute. A relative path is a path in relation to another, most often the working directory. An absolute path indicates a location regardless of the current directory; that is, it specifies all path components starting from the file system's root, and does not depend on context like a relative path does.

Paths are also essential for locating hierarchically-organized network resources, as seen in URLs and UNC paths.

Remove ads

History

Multics first introduced a hierarchical file system with directories (separated by ">") in the mid-1960s.[4]

Around 1970, Unix introduced the slash / as its directory separator.

Originally, MS-DOS did not support directories. When adding the feature, using the Unix standard of a slash was not a good option since many existing commands used a slash as the switch prefix (i.e., dir /w). In contrast, Unix uses the dash - as the switch prefix. The backslash \ was ultimately chosen instead for its similarity to the slash and not conflicting with existing commands. This convention continued into Windows. However, some areas of Windows do accept or understand Unix-style slashes also, such as PowerShell.[5][6]

Remove ads

Summary of systems

Summarize
Perspective

The following table describes the syntax of paths in notable operating systems:

More information System, Root dir. ...
Remove ads

In programming languages

Most programming languages use the path representation of the underlying system, but some may also be system-independent.

For instance, this C code is system-dependent and may fail on opposing systems:

uxFile = fopen("project/readme.txt", "r") // Fails on Windows
winFile = fopen("C:\\Program Files\\bin\\config.bat", "r") // Fails on Unix
  • In Java, the File.separator field stores the system-dependent separator.[20] Some functions preclude the need for the separator entirely.
import java.io.File;
import java.nio.file.Path;
import java.nio.file.Paths;
// ...
File file = new File("path" + File.separator + "file.txt");
Path path = Paths.get("path", "file.txt");
  • In Python, the pathlib module offers system-independent path operations.[21]
from pathlib import Path

with (Path("path") / "to" / "file.txt").open() as open_file:
    ...
  • In Seed7, all paths use the Unix path convention, regardless of operating system. Under Windows, the path /c/Users/ would be mapped to C:\Users\.[22]

In Unix

Summarize
Perspective

Most Unix-like systems use a similar syntax.[23] POSIX allows treating a path beginning with two slashes in an implementation-defined manner,[24] though in other cases systems must treat consecutive slashes as one.[25]

Many applications on Unix-like systems (for example, scp, rcp, and rsync) use resource definitions such as hostname:/directorypath/resource, or URI schemes with the service name (here 'smb'), like smb://hostname/directorypath/resource.

In macOS

When macOS was being developed, it inherited some pathname choices from Classic Mac OS and the Unix-like NeXTSTEP. The classic Mac OS uses a : while Unix and Unix-like systems use a / as the path delimiter. As a solution, to preserve compatibility for software and familiarity for users, and to allow disk file systems to be used both by the classic Mac OS and macOS, some portions of macOS convert between colons and slashes in pathnames;[26] for example, the HFS+ file system, from the classic Mac OS, converts colons in file names to slashes and, when reading a directory, converts slashes in filenames to colons,[27] as and the Carbon toolkit converts colons in pathnames to slashes and slashes in path names to colons, and converts them back when providing filenames and pathnames to the caller.[27]

Remove ads

In DOS and Windows

Summarize
Perspective
Thumb
Screenshot of a Windows Command Prompt shell showing filenames in a directory

DOS and Windows have no single root directory; a root exists for each storage drive, indicated with a drive letter or through UNC.

Directory and file name comparisons are case-insensitive: "test.TXT" would match "Test.txt".[28]

Windows understands the following kinds of paths:

  • Local paths, such as C:\File.
  • Universal naming convention (UNC).
  • DOS device paths, such as \\?\C:\File or \\.\UNC\Server\Volume\File. The first, \\?\ skips path normalization. The second, \\.\ uses the raw device namespace.[28][29]

In the Windows API, file I/O functions automatically convert / into \ (except when using the \\?\ prefix). Unless the \\?\ prefix is used, paths are limited to the length defined by MAX_PATH, which is 260.[30]

PowerShell allows slash-interoperability for backwards-compatibility:[31]

PS C:\>Get-Content -Path "C:/path/to/file.txt"

Here is some text within a file

Yen/won character error

Japanese and Korean versions of Windows often displayed the '¥' character or the '' character instead of the directory separator. This is because while in ANSI codepages, the character at 0x5C was the backslash, and in Japanese and Korean codepages, 0x5C was the yen and won signs, respectively. Therefore, when the character for a backslash was used, other glyphs appeared.[32]

Universal Naming Convention

The Microsoft Universal Naming Convention (UNC, uniform naming convention, or network path), is a syntax to describe the location of a network resource, such as a shared file, directory, or printer. A UNC path has the general form:

\\ComputerName\SharedFolder\Resource

Some Windows interfaces allow or require UNC syntax for WebDAV share access, rather than a URL. The UNC syntax is extended with optional components to denote use of SSL and TCP/IP port number. Thus, the WebDAV URL of https://hostname[:port]/SharedFolder/Resource becomes \\hostname[@SSL][@port]\SharedFolder\Resource.[33]

When viewed remotely, the "SharedFolder" may have a name different from what a program on the server sees when opening "\SharedFolder". Instead, the SharedFolder name consists of an arbitrary name assigned to the folder when defining its "sharing".

Since UNCs start with two backslashes, and the backslash is also used for escape sequences and in regular expressions, cases of leaning toothpick syndrome may arise. An escaped string for a regular expression matching a UNC begins with 8 backslashes \\\\\\\\ because the string and regular expression both require escaping. This can be simplified by using raw strings, such as @"\\\\" in C#, r'\\\\' in Python, or qr{\\\\} in Perl.

Remove ads

See also

  • basename – Shell command for extracting the last name from a path
  • Device file – Interface to a device driver that appears in a file system as if it were an ordinary file
  • dirname – Shell command in Unix systems
  • Distributed file system – Type of decentralized filesystem
  • Filename – Text string used to uniquely identify a computer file
  • Filesystem Hierarchy Standard – Linux standard for directory structure
  • Fully qualified file name – Unambiguous name in computer code
  • PATH (variable) – Computer environment variable
  • URL – Web address to a particular file or page
Remove ads

Notes

  1. Limited compatibility, may not work in all Windows programs
  2. In order, the fields are ddnn:[g,m]:filename.type;version
  3. &, %, and @ can also be used to reference the root of the current user, the library, and the current directory respectively
  4. When filesystems with filename extensions are mounted, . characters are changed to /, as in the Japan/gif example above
  5. Short for Tandem Advanced Command Language
  6. [prefix] may be a number (0–31),  * (boot volume) or @ (AppleShare home directory)
Remove ads

References

Loading related searches...

Wikiwand - on

Seamless Wikipedia browsing. On steroids.

Remove ads