wikipedia File descriptor

NOTE:

1、这篇文章总结的非常好

In Unix and related computer operating systems, a file descriptor (FD, less frequently fildes) is an abstract indicator (handle) used to access a file or other input/output resource, such as a pipe or network socket.

NOTE:

1、"tag-philosophy-everything is a file descriptor-is a good abstraction"

File descriptors form part of the POSIX application programming interface. A file descriptor is a non-negative integer, generally represented in the C programming language as the type int (negative values being reserved to indicate "no value" or an error condition).

Standard POSIX file descriptors

Each Unix process (except perhaps a daemon) should expect to have three standard POSIX file descriptors, corresponding to the three standard streams:

Integer value	Name	unistd.h symbolic constant	stdio.h file stream
0	Standard input	`STDIN_FILENO`	`stdin`
1	Standard output	`STDOUT_FILENO`	`stdout`
2	Standard error	`STDERR_FILENO`	`stderr`

Overview

In the traditional implementation of Unix, file descriptors index into a per-process file descriptor table maintained by the kernel, that in turn indexes into a system-wide table of files opened by all processes, called the file table. This table(指代的是file table) records the mode with which the file (or other resource) has been opened: for reading, writing, appending, and possibly other modes. It also indexes into a third table called the inode table that describes the actual underlying files. To perform input or output, the process passes the file descriptor to the kernel through a system call, and the kernel will access the file on behalf of the process. The process does not have direct access to the file or inode tables.

NOTE :

一、在APUE的3.10 File Sharing也描述了这部分内容；需要注意的是：the data structures used by the kernel for all I/O.即所有的IO都是采用的类似于上述的结构；并且上述结构需要和Process control block 一起来理解才能够很好的对Unix OS的IO有一个整体的认知；

二、需要注意，上面这段话中提及 file descriptor table 和 file table 时，前面分别加上了修饰语： per-process 和 system-wide ；这两个修饰语是非常重要的，需要将它们和the data structures used by the kernel for all I/O一起来进行理解；因为 file descriptor table 的scope是process，即每个process都有一套自己的 file descriptor table ，所以每个process的file descriptor都是从0开始增长；显然比较两个process的file descriptor是没有意义的（处理0,1,2，因为它们都已经被默认绑定到STDIN_FILENO ,STDOUT_FILENO ,STDERR_FILENO ）；而file table的scope是system，即所有的process都将共享file table；

三、每次调用open系统调用，都会创建一个file table entry

四、"This table(指代的是file table) records the mode with which the file (or other resource) has been opened: for reading, writing, appending, and possibly other modes."

上述mode指的是"file status flags"

On Linux, the set of file descriptors open in a process can be accessed under the path /proc/PID/fd/, where PID is the process identifier.

In Unix-like systems, file descriptors can refer to any Unix file type named in a file system. As well as regular files, this includes directories, block and character devices (also called "special files"), Unix domain sockets, and named pipes. File descriptors can also refer to other objects that do not normally exist in the file system, such as anonymous pipes and network sockets.

NOTE: Everything is a file ；从kernel实现的角度来看看待everything in Unix is file，Unix-like system是monolithic kernel，上面提到的这些device或者file都是由kernel来进行维护，它们都有对应的kernel structure；我们通过file descriptor来引用这些kernel structure，我们只能够通过system call来对这些kernel structure进行操作；

The FILE data structure in the C standard I/O library usually includes a low level file descriptor for the object in question on Unix-like systems. The overall data structure provides additional abstraction and is instead known as a file handle.

File descriptors for a single process, file table and inode table. Note that multiple file descriptors can refer to the same file table entry (e.g., as a result of the dup system call[3]:104 and that multiple file table entries can in turn refer to the same inode (if it has been opened multiple times; the table is still simplified because it represents inodes by file names, even though an inode can have multiple names). File descriptor 3 does not refer to anything in the file table, signifying that it has been closed.

NOTE: 上述的三层对应关系存在着多种可能的情况，再加上OS提供的fork机制（子进程继承父进程的file descriptor和file table entry），各种IO操作（比如dup，read，write）等等都导致了问题的复杂性；

比如存在着这些可能的情况： - dup，同一进程中，多个file descriptor指向了同一个file table entry

fork后，父进程，子进程的同一个file descriptor共享同一个file table entry（因为file descriptor table是每个进程私有的，所以这种情况其实类似于第一种情况，即多个file descriptor指向了同一个file table entry）

上面描述了file descriptor和file table entry之间的对应关系，下面描述file table entry和iNode之间的关系：是有可能存在多个不同的file table entry指向了同一个iNode的；

显然OS的这种设计，就导致当一个文件被多个不同的process进行share的时候，而每个process都可以执行一系列的IO操作，这就导致了可能存在的数据冲突问题；

总的来说，按照OS的这总结构设计，以及OS提供的各种操作，是可以总结出可能的所有情形的；

Operations on file descriptors

NOTE:

1、下面的总结非常好

The following lists typical operations on file descriptors on modern Unix-like systems. Most of these functions are declared in the <unistd.h> header, but some are in the <fcntl.h> header instead.

Creating file descriptors

open()
creat()
socket()
accept()
socketpair()
pipe()
opendir()
open_by_handle_at() (Linux)
signalfd() (Linux)
eventfd() (Linux)
timerfd_create() (Linux)
memfd_create() (Linux)
userfaultfd() (Linux)

Deriving file descriptors

dirfd()
fileno()

Operations on a single file descriptor

read(), write()
readv(), writev()
pread(), pwrite()
recv(), send()
recvmsg(), sendmsg() (including allowing sending FDs)
sendfile()
lseek()
fstat()
fchmod()
fchown()
fdopen()
ftruncate()
fsync()
fdatasync()
fstatvfs()
dprintf()
vmsplice() (Linux)

Operations on multiple file descriptors

select(), pselect()
poll()
epoll() (for Linux)
kqueue() (for BSD-based systems).
sendfile()
splice(), tee() (for Linux)

Operations on the file descriptor table

The fcntl() function is used to perform various operations on a file descriptor, depending on the command argument passed to it. There are commands to get and set attributes associated with a file descriptor, including F_GETFD, F_SETFD, F_GETFL and F_SETFL.

close()
closefrom() (BSD and Solaris only; deletes all file descriptors greater than or equal to specified number)
dup() (duplicates an existing file descriptor guaranteeing to be the lowest number available file descriptor)
dup2() (the new file descriptor will have the value passed as an argument)
fcntl (F_DUPFD)

Operations that modify process state

fchdir() (sets the process's current working directory based on a directory file descriptor)
mmap() (maps ranges of a file into the process's address space)

File locking

flock()
fcntl() (F_GETLK, F_SETLK) and F_SETLKW
lockf()

Sockets

connect()
bind()
listen()
accept() (creates a new file descriptor for an incoming connection)
getsockname()
getpeername()
getsockopt()
setsockopt()
shutdown() (shuts down one or both halves of a full duplex connection)

Miscellaneous

ioctl() (a large collection of miscellaneous operations on a single file descriptor, often associated with a device)

Upcoming operations

A series of new operations on file descriptors has been added to many modern Unix-like systems, as well as numerous C libraries, to be standardized in a future version of POSIX.[5] The at suffix signifies that the function takes an additional first argument supplying a file descriptor from which relative paths are resolved, the forms lacking the at suffix thus becoming equivalent to passing a file descriptor corresponding to the current working directory. The purpose of these new operations is to defend against a certain class of TOCTTOU attacks.

openat()
faccessat()
fchmodat()
fchownat()
fstatat()
futimesat()
linkat()
mkdirat()
mknodat()
readlinkat()
renameat()
symlinkat()
unlinkat()
mkfifoat()
fdopendir()

File descriptors as capabilities

NOTE: 一、初次阅读下面这段话的时候，我是比较疑惑的: Linux中file descriptor的scope是process，也就是说file descriptor仅仅是在一个process内有效的，那"passed between processes"有什么意义呢？后来查阅了一些资料，发现:

1、是有意义的，Linux进行了特殊的实现，在Pass-file-descriptor章节对此进行了讨论。

2、由于Linux采用的是"File descriptors as capabilities"，那当passing一个file descriptor的时候，同时也就passing了"capabilities"。

Unix file descriptors behave in many ways as capabilities. They can be passed between processes across Unix domain sockets using the sendmsg() system call. Note, however, that what is actually passed is a reference to an "open file description" that has mutable state (the file offset, and the file status and access flags). This complicates the secure use of file descriptors as capabilities, since when programs share access to the same open file description, they can interfere(冲突、妨碍) with each other's use of it by changing its offset or whether it is blocking or non-blocking, for example.[6][7] In operating systems that are specifically designed as capability systems, there is very rarely any mutable state associated with a capability itself.

A Unix process' file descriptor table is an example of a C-list.