天天看點

linux系統引導過程

linux系統引導過程

linux-0.11引導時,将依次運作BIOS程式、bootsect.s、setup.s和head.s,完成引導過程後進入到main函數運作。BIOS完成硬體的檢查與初始化等工作後,從硬碟的MBR中讀取bootsect代碼;bootsect程式主要用于讀取setup和system子產品(包含head.s)代碼到記憶體中,然後跳轉到setup執行;setup首先讀取記憶體、硬碟等裝置參數到記憶體中供後續程式使用,然後設定gdt、idt表後,最後設定機器進入保護模式并跳轉到head繼續執行;head首先進一步完善gdt、idt表的設定,然後建構頁目錄和頁表并啟動分頁記憶體管理,最後跳轉到main函數執行。

BIOS

BIOS主要過程:

  1. 當電腦的電源開啟後,BIOS程式會從主機闆的ROM晶片運作;
  2. BIOS進行加電自檢(POST),測試和初始化CPU、RAM、DMA、晶片組、鍵盤、軟碟、硬碟等裝置;
  3. BIOS從引導裝置(硬碟、軟碟、CD光牒等)的主引導記錄(MBR)中加載引導程式,再由引導程式加載操縱系統。

BIOS中斷調用:

BIOS還為作業系統提供了運作時服務,即BIOS中斷調用,中斷可通過彙編int指令調用。作業系統依賴BIOS提供的中斷調用(輸出/輸出中斷調用)加載核心,然後由核心将系統從16位實模式轉換到32位保護模式運作。

BIOS中斷向量表:

中斷 描述
INT 00h CPU:除零錯,或商不合法時觸發
INT 01h CPU:單步陷阱,TF标記為打開狀态時,每條指令執行後觸發
INT 02h CPU:非可屏蔽中斷,如引導自我測試時發生記憶體錯誤。
INT 03h CPU:第一個未定義的中斷向量,約定俗成僅用于調試程式
INT 04h CPU:算數溢出。通常由INTO指令在置溢出位時觸發。
INT 05h 在按下Shift-Print Screen或BOUND指令檢測到範圍異常時觸發。
INT 06h CPU:非法指令。
INT 07h CPU:沒有數學協處理器時嘗試執行浮點指令觸發。
INT 08h IRQ0:可程式設計中斷控制器每 55 毫秒觸發一次,即每秒 18.2 次。
INT 09h IRQ1:每次鍵盤按下、按住、釋放。
INT 0Ah IRQ2:
INT 0Bh IRQ3:COM2/COM4。
INT 0Ch IRQ4:COM1/COM3。
INT 0Dh IRQ5:硬碟控制器(PC/XT 下)或 LPT2。
INT 0Eh IRQ6:需要時由軟碟控制器調用。
INT 0Fh IRQ7:LPT1。
INT 10h 顯示服務 - 由BIOS或作業系統設定以供軟體調用。AH=00h設定顯示模式AH=01h設定遊标形态AH=02h設定光标位置AH=03h擷取光标位置與形态AH=04h擷取光标位置AH=05h設定顯示頁AH=06h清除或卷軸框畫面(上)AH=07h清除或卷軸框畫面(下)AH=08h讀取遊标處字元與屬性AH=09h更改遊标處字元與屬性AH=0Ah更改遊标處字元AH=0Bh設定邊界顔色AH=0Eh在TTY模式下寫字元AH=0Fh擷取目前顯示模式AH=13h寫字元串
INT 11h 傳回裝置清單。
INT 12h 擷取正常記憶體容量。
INT 13h 低端磁盤服務。AH=00h複位磁盤驅動器。AH=01h檢查磁盤驅動器狀态。AH=02h讀扇區。AH=03h寫扇區。AH=04h校驗扇區。AH=05h格式化磁道。AH=08h擷取驅動器參數。AH=09h初始化硬碟驅動器參數。AH=0Ch尋道。AH=0Dh複位硬碟控制器。AH=15h擷取驅動器類型。AH=16h擷取軟驅中盤片的狀态。
INT 14h 序列槽通信例程。AH=00h初始化序列槽。AH=01h寫出字元。AH=02h讀入字元。AH=03h狀态。
INT 15h 其它(系統支援例程)。AH=4FH鍵盤攔截。AH=83H事件等待。AH=84H讀遊戲杆。AH=85HSysRq 鍵。AH=86H等待。AH=87H塊移動。AH=88H擷取擴充記憶體容量。AH=C0H擷取系統參數。AH=C1H擷取擴充 BIOS 資料區塊。AH=C2H指針裝置功能。AH=E8h, AL=01h (AX = E801h)擷取擴充記憶體容量(自從 1994 年引入的新功能),可擷取到 64MB 以上的記憶體容量。AH=E8h, AL=20h (AX = E820h)查詢系統位址映射。該功能取代了 AX=E801h 和 AH=88h。
INT 16h 鍵盤通信例程。AH=00h讀字元。AH=01h讀輸入狀态。AH=02h讀 Shift 鍵(修改鍵)狀态。AH=10h讀字元(增強版)。AH=11h讀輸入狀态(增強版)。AH=12h讀 Shift 鍵(修改鍵)狀态(增強版)。
INT 17h 列印服務。AH=00h列印字元。AH=01h初始化列印機。AH=02h檢查列印機狀态。
INT 18h 執行錄音帶上的 BASIC 程式:“真正的”IBM 相容機在 ROM 裡内置 BASIC 程式,當引導失敗時由 BIOS 調用此例程解釋執行。(例:列印“Boot disk error. Replace disk and press any key to continue...”這類提示資訊)
INT 19h 加電自檢之後加載作業系統。
INT 1Ah 實時鐘服務。AH=00h讀取實時鐘。AH=01h設定實時鐘。AH=02h讀取實時鐘時間。AH=03h設定實時鐘時間。AH=04h讀取實時鐘日期。AH=05h設定實時鐘日期。AH=06h設定實時鐘鬧鈴。AH=07h重置實時鐘鬧鈴。
INT 1Bh Ctrl+Break,由 IRQ 9 自動調用。
INT 1Ch 預留,由 IRQ 8 自動調用。
INT 1Dh 不可調用:指向視訊參數表(包含視訊模式的資料)的指針。
INT 1Eh 不可調用:指向軟碟模式表(包含關于軟驅的大量資訊)的指針。
INT 1Fh 不可調用:指向視訊圖形字元表(包含從 80h 到 FFh 的 ASCII 字元的資料)的資訊。
INT 41h 位址指針:硬碟參數表(第一硬碟)。
INT 46h 位址指針:硬碟參數表(第二硬碟)。
INT 4Ah 實時鐘在鬧鈴時調用。
INT 70h IRQ8:由實時鐘調用。
INT 74h IRQ12:由滑鼠調用
INT 75h IRQ13:由數學協處理器調用。
INT 76h IRQ14:由第一個 IDE 控制器所調用
INT 77h IRQ15:由第二個 IDE 控制器所調用

bootsect.s

bootsect主要過程:

  1. bootsect被bios-startup加載到0x7c00記憶體處,然後bois-start跳轉到bootsect.s開始執行;
  2. bootsect将自身代碼從0x7c00複制到0x90000處,并跳轉到複制後的位址處繼續往下執行;
  3. bootsect讀取setup到0x90200處,讀取system到0x10000處;
  4. bootsect跳轉到setup執行

bootsect代碼:

!
! SYS_SIZE is the number of clicks (16 bytes) to be loaded.
! 0x3000 is 0x30000 bytes = 196kB, more than enough for current
! versions of linux
!
SYSSIZE = 0x3000
!
!	bootsect.s		(C) 1991 Linus Torvalds
!
! bootsect.s is loaded at 0x7c00 by the bios-startup routines, and moves
! iself out of the way to address 0x90000, and jumps there.
!
! It then loads 'setup' directly after itself (0x90200), and the system
! at 0x10000, using BIOS interrupts. 
!
! NOTE! currently system is at most 8*65536 bytes long. This should be no
! problem, even in the future. I want to keep it simple. This 512 kB
! kernel size should be enough, especially as this doesn't contain the
! buffer cache as in minix
!
! The loader has been made as simple as possible, and continuos
! read errors will result in a unbreakable loop. Reboot by hand. It
! loads pretty fast by getting whole sectors at a time whenever possible.

.globl begtext, begdata, begbss, endtext, enddata, endbss
.text
begtext:
.data
begdata:
.bss
begbss:
.text

SETUPLEN = 4				! nr of setup-sectors
BOOTSEG  = 0x07c0			! original address of boot-sector
INITSEG  = 0x9000			! we move boot here - out of the way
SETUPSEG = 0x9020			! setup starts here
SYSSEG   = 0x1000			! system loaded at 0x10000 (65536).
ENDSEG   = SYSSEG + SYSSIZE		! where to stop loading

! ROOT_DEV:	0x000 - same type of floppy as boot.
!		0x301 - first partition on first drive etc
ROOT_DEV = 0x306

entry start
start:
	mov	ax,#BOOTSEG
	mov	ds,ax
	mov	ax,#INITSEG
	mov	es,ax
	mov	cx,#256
	sub	si,si
	sub	di,di
	rep
	movw
	jmpi	go,INITSEG
go:	mov	ax,cs
	mov	ds,ax
	mov	es,ax
! put stack at 0x9ff00.
	mov	ss,ax
	mov	sp,#0xFF00		! arbitrary value >>512

! load the setup-sectors directly after the bootblock.
! Note that 'es' is already set up.

load_setup:
	mov	dx,#0x0000		! drive 0, head 0
	mov	cx,#0x0002		! sector 2, track 0
	mov	bx,#0x0200		! address = 512, in INITSEG
	mov	ax,#0x0200+SETUPLEN	! service 2, nr of sectors
	int	0x13			! read it
	jnc	ok_load_setup		! ok - continue
	mov	dx,#0x0000
	mov	ax,#0x0000		! reset the diskette
	int	0x13
	j	load_setup

ok_load_setup:

! Get disk drive parameters, specifically nr of sectors/track

	mov	dl,#0x00
	mov	ax,#0x0800		! AH=8 is get drive parameters
	int	0x13
	mov	ch,#0x00
	seg cs
	mov	sectors,cx
	mov	ax,#INITSEG
	mov	es,ax

! Print some inane message

	mov	ah,#0x03		! read cursor pos
	xor	bh,bh
	int	0x10
	
	mov	cx,#24
	mov	bx,#0x0007		! page 0, attribute 7 (normal)
	mov	bp,#msg1
	mov	ax,#0x1301		! write string, move cursor
	int	0x10

! ok, we've written the message, now
! we want to load the system (at 0x10000)

	mov	ax,#SYSSEG
	mov	es,ax		! segment of 0x010000
	call	read_it
	call	kill_motor

! After that we check which root-device to use. If the device is
! defined (!= 0), nothing is done and the given device is used.
! Otherwise, either /dev/PS0 (2,28) or /dev/at0 (2,8), depending
! on the number of sectors that the BIOS reports currently.

	seg cs
	mov	ax,root_dev
	cmp	ax,#0
	jne	root_defined
	seg cs
	mov	bx,sectors
	mov	ax,#0x0208		! /dev/ps0 - 1.2Mb
	cmp	bx,#15
	je	root_defined
	mov	ax,#0x021c		! /dev/PS0 - 1.44Mb
	cmp	bx,#18
	je	root_defined
undef_root:
	jmp undef_root
root_defined:
	seg cs
	mov	root_dev,ax

! after that (everyting loaded), we jump to
! the setup-routine loaded directly after
! the bootblock:

	jmpi	0,SETUPSEG

! This routine loads the system at address 0x10000, making sure
! no 64kB boundaries are crossed. We try to load it as fast as
! possible, loading whole tracks whenever we can.
!
! in:	es - starting address segment (normally 0x1000)
!
sread:	.word 1+SETUPLEN	! sectors read of current track
head:	.word 0			! current head
track:	.word 0			! current track

read_it:
	mov ax,es
	test ax,#0x0fff
die:	jne die			! es must be at 64kB boundary
	xor bx,bx		! bx is starting address within segment
rp_read:
	mov ax,es
	cmp ax,#ENDSEG		! have we loaded all yet?
	jb ok1_read
	ret
ok1_read:
	seg cs
	mov ax,sectors
	sub ax,sread
	mov cx,ax
	shl cx,#9
	add cx,bx
	jnc ok2_read
	je ok2_read
	xor ax,ax
	sub ax,bx
	shr ax,#9
ok2_read:
	call read_track
	mov cx,ax
	add ax,sread
	seg cs
	cmp ax,sectors
	jne ok3_read
	mov ax,#1
	sub ax,head
	jne ok4_read
	inc track
ok4_read:
	mov head,ax
	xor ax,ax
ok3_read:
	mov sread,ax
	shl cx,#9
	add bx,cx
	jnc rp_read
	mov ax,es
	add ax,#0x1000
	mov es,ax
	xor bx,bx
	jmp rp_read

read_track:
	push ax
	push bx
	push cx
	push dx
	mov dx,track
	mov cx,sread
	inc cx
	mov ch,dl
	mov dx,head
	mov dh,dl
	mov dl,#0
	and dx,#0x0100
	mov ah,#2
	int 0x13
	jc bad_rt
	pop dx
	pop cx
	pop bx
	pop ax
	ret
bad_rt:	mov ax,#0
	mov dx,#0
	int 0x13
	pop dx
	pop cx
	pop bx
	pop ax
	jmp read_track

/*
 * This procedure turns off the floppy drive motor, so
 * that we enter the kernel in a known state, and
 * don't have to worry about it later.
 */
kill_motor:
	push dx
	mov dx,#0x3f2
	mov al,#0
	outb
	pop dx
	ret

sectors:
	.word 0

msg1:
	.byte 13,10
	.ascii "Loading system ..."
	.byte 13,10,13,10

.org 508
root_dev:
	.word ROOT_DEV
boot_flag:
	.word 0xAA55

.text
endtext:
.data
enddata:
.bss
endbss:

           

setup.s

setup主要過程:

  1. 擷取光标位置、擴充記憶體大小、顯示卡頁參數、等參數,并儲存在記憶體0x90000-0x901FF處,以供核心相關程式使用。儲存的參數如下:
linux系統引導過程
  1. 将system代碼從記憶體0x10000-0x8ffff處移動到0x00000-0x7ffff處。(system代碼不超過512KB)
  2. 加載idt(中斷描述符表)與gdt(全局描述符表),開啟20位位址線;然後将機器的CR0控制寄存器的PE位置為1,使機器進入保護模式,并跳轉到gdt表代碼段的0偏移處運作,即運作System代碼子產品的head.s。

在setup執行階段,gdt表中定義了一個代碼段和一個資料段,兩個段位址空間完全重疊,段位址基址為0x00000;ldt表未定義任何段。

setup.s代碼:

!
!	setup.s		(C) 1991 Linus Torvalds
!
! setup.s is responsible for getting the system data from the BIOS,
! and putting them into the appropriate places in system memory.
! both setup.s and system has been loaded by the bootblock.
!
! This code asks the bios for memory/disk/other parameters, and
! puts them in a "safe" place: 0x90000-0x901FF, ie where the
! boot-block used to be. It is then up to the protected mode
! system to read them from there before the area is overwritten
! for buffer-blocks.
!

! NOTE! These had better be the same as in bootsect.s!

INITSEG  = 0x9000	! we move boot here - out of the way
SYSSEG   = 0x1000	! system loaded at 0x10000 (65536).
SETUPSEG = 0x9020	! this is the current segment

.globl begtext, begdata, begbss, endtext, enddata, endbss
.text
begtext:
.data
begdata:
.bss
begbss:
.text

entry start
start:

! ok, the read went well so we get current cursor position and save it for
! posterity.

	mov	ax,#INITSEG	! this is done in bootsect already, but...
	mov	ds,ax
	mov	ah,#0x03	! read cursor pos
	xor	bh,bh
	int	0x10		! save it in known place, con_init fetches
	mov	[0],dx		! it from 0x90000.

! Get memory size (extended mem, kB)

	mov	ah,#0x88
	int	0x15
	mov	[2],ax

! Get video-card data:

	mov	ah,#0x0f
	int	0x10
	mov	[4],bx		! bh = display page
	mov	[6],ax		! al = video mode, ah = window width

! check for EGA/VGA and some config parameters

	mov	ah,#0x12
	mov	bl,#0x10
	int	0x10
	mov	[8],ax
	mov	[10],bx
	mov	[12],cx

! Get hd0 data

	mov	ax,#0x0000
	mov	ds,ax
	lds	si,[4*0x41]
	mov	ax,#INITSEG
	mov	es,ax
	mov	di,#0x0080
	mov	cx,#0x10
	rep
	movsb

! Get hd1 data

	mov	ax,#0x0000
	mov	ds,ax
	lds	si,[4*0x46]
	mov	ax,#INITSEG
	mov	es,ax
	mov	di,#0x0090
	mov	cx,#0x10
	rep
	movsb

! Check that there IS a hd1 :-)

	mov	ax,#0x01500
	mov	dl,#0x81
	int	0x13
	jc	no_disk1
	cmp	ah,#3
	je	is_disk1
no_disk1:
	mov	ax,#INITSEG
	mov	es,ax
	mov	di,#0x0090
	mov	cx,#0x10
	mov	ax,#0x00
	rep
	stosb
is_disk1:

! now we want to move to protected mode ...

	cli			! no interrupts allowed !

! first we move the system to it's rightful place

	mov	ax,#0x0000
	cld			! 'direction'=0, movs moves forward
do_move:
	mov	es,ax		! destination segment
	add	ax,#0x1000
	cmp	ax,#0x9000
	jz	end_move
	mov	ds,ax		! source segment
	sub	di,di
	sub	si,si
	mov 	cx,#0x8000
	rep
	movsw
	jmp	do_move

! then we load the segment descriptors

end_move:
	mov	ax,#SETUPSEG	! right, forgot this at first. didn't work :-)
	mov	ds,ax
	lidt	idt_48		! load idt with 0,0
	lgdt	gdt_48		! load gdt with whatever appropriate

! that was painless, now we enable A20

	call	empty_8042
	mov	al,#0xD1		! command write
	out	#0x64,al
	call	empty_8042
	mov	al,#0xDF		! A20 on
	out	#0x60,al
	call	empty_8042

! well, that went ok, I hope. Now we have to reprogram the interrupts :-(
! we put them right after the intel-reserved hardware interrupts, at
! int 0x20-0x2F. There they won't mess up anything. Sadly IBM really
! messed this up with the original PC, and they haven't been able to
! rectify it afterwards. Thus the bios puts interrupts at 0x08-0x0f,
! which is used for the internal hardware interrupts as well. We just
! have to reprogram the 8259's, and it isn't fun.

	mov	al,#0x11		! initialization sequence
	out	#0x20,al		! send it to 8259A-1
	.word	0x00eb,0x00eb		! jmp $+2, jmp $+2
	out	#0xA0,al		! and to 8259A-2
	.word	0x00eb,0x00eb
	mov	al,#0x20		! start of hardware int's (0x20)
	out	#0x21,al
	.word	0x00eb,0x00eb
	mov	al,#0x28		! start of hardware int's 2 (0x28)
	out	#0xA1,al
	.word	0x00eb,0x00eb
	mov	al,#0x04		! 8259-1 is master
	out	#0x21,al
	.word	0x00eb,0x00eb
	mov	al,#0x02		! 8259-2 is slave
	out	#0xA1,al
	.word	0x00eb,0x00eb
	mov	al,#0x01		! 8086 mode for both
	out	#0x21,al
	.word	0x00eb,0x00eb
	out	#0xA1,al
	.word	0x00eb,0x00eb
	mov	al,#0xFF		! mask off all interrupts for now
	out	#0x21,al
	.word	0x00eb,0x00eb
	out	#0xA1,al

! well, that certainly wasn't fun :-(. Hopefully it works, and we don't
! need no steenking BIOS anyway (except for the initial loading :-).
! The BIOS-routine wants lots of unnecessary data, and it's less
! "interesting" anyway. This is how REAL programmers do it.
!
! Well, now's the time to actually move into protected mode. To make
! things as simple as possible, we do no register set-up or anything,
! we let the gnu-compiled 32-bit programs do that. We just jump to
! absolute address 0x00000, in 32-bit protected mode.

	mov	ax,#0x0001	! protected mode (PE) bit
	lmsw	ax		! This is it!
	jmpi	0,8		! jmp offset 0 of segment 8 (cs)

! This routine checks that the keyboard command queue is empty
! No timeout is used - if this hangs there is something wrong with
! the machine, and we probably couldn't proceed anyway.
empty_8042:
	.word	0x00eb,0x00eb
	in	al,#0x64	! 8042 status port
	test	al,#2		! is input buffer full?
	jnz	empty_8042	! yes - loop
	ret

gdt:
	.word	0,0,0,0		! dummy

	.word	0x07FF		! 8Mb - limit=2047 (2048*4096=8Mb)
	.word	0x0000		! base address=0
	.word	0x9A00		! code read/exec
	.word	0x00C0		! granularity=4096, 386

	.word	0x07FF		! 8Mb - limit=2047 (2048*4096=8Mb)
	.word	0x0000		! base address=0
	.word	0x9200		! data read/write
	.word	0x00C0		! granularity=4096, 386

idt_48:
	.word	0			! idt limit=0
	.word	0,0			! idt base=0L

gdt_48:
	.word	0x800		! gdt limit=2048, 256 GDT entries
	.word	512+gdt,0x9	! gdt base = 0X9xxxx
	
.text
endtext:
.data
enddata:
.bss
endbss:

           

head.s

head.s被編譯生成目标檔案後,會核心其它程式一起被連結為system子產品,head.s代碼位于system子產品最開始的部分,也是system子產品最先執行的程式代碼。system代碼被bootsect程式加載到0x10000位址處,并由setup程式移動到0位址處,是以,head.s代碼位于0位址處。setup執行完後,将機器設定位保護模式,并跳轉到head.s代碼處執行,head.s是32位程式。

head主要過程:

  1. 設定idt:為idt表設定256個表項,每個表項被初始化指向預設的中斷處理過程(ignore_int)。ignore_int過程不執行任何具有實際意義的操作,相當于空過程。後續執行的核心程式可根據需要在idt表中注冊中斷門、陷阱門或系統門。
  2. 設定gdt:為gdt表設定256個表項,為使用的表項初始為0值。gdt表象可指向代碼段、資料段、LDT,以及任務狀态段。後續執行的核心程式可根據需要在gdt中設定相關段。
  3. 設定頁目錄和頁表:頁目錄和頁表被放置在0位址處,然後将頁目錄表位址寫入CR3寄存器,并置CR0寄存器的PG位為1,PG位為1意為啟用分頁管理機制。
  4. 跳轉到mian函數執行

head.s代碼:

/*
 *  linux/boot/head.s
 *
 *  (C) 1991  Linus Torvalds
 */

/*
 *  head.s contains the 32-bit startup code.
 *
 * NOTE!!! Startup happens at absolute address 0x00000000, which is also where
 * the page directory will exist. The startup code will be overwritten by
 * the page directory.
 */
.text
.globl _idt,_gdt,_pg_dir,_tmp_floppy_area
_pg_dir:
startup_32:
	movl $0x10,%eax
	mov %ax,%ds
	mov %ax,%es
	mov %ax,%fs
	mov %ax,%gs
	lss _stack_start,%esp
	call setup_idt
	call setup_gdt
	movl $0x10,%eax		# reload all the segment registers
	mov %ax,%ds		# after changing gdt. CS was already
	mov %ax,%es		# reloaded in 'setup_gdt'
	mov %ax,%fs
	mov %ax,%gs
	lss _stack_start,%esp
	xorl %eax,%eax
1:	incl %eax		# check that A20 really IS enabled
	movl %eax,0x000000	# loop forever if it isn't
	cmpl %eax,0x100000
	je 1b
/*
 * NOTE! 486 should set bit 16, to check for write-protect in supervisor
 * mode. Then it would be unnecessary with the "verify_area()"-calls.
 * 486 users probably want to set the NE (#5) bit also, so as to use
 * int 16 for math errors.
 */
	movl %cr0,%eax		# check math chip
	andl $0x80000011,%eax	# Save PG,PE,ET
/* "orl $0x10020,%eax" here for 486 might be good */
	orl $2,%eax		# set MP
	movl %eax,%cr0
	call check_x87
	jmp after_page_tables

/*
 * We depend on ET to be correct. This checks for 287/387.
 */
check_x87:
	fninit
	fstsw %ax
	cmpb $0,%al
	je 1f			/* no coprocessor: have to set bits */
	movl %cr0,%eax
	xorl $6,%eax		/* reset MP, set EM */
	movl %eax,%cr0
	ret
.align 2
1:	.byte 0xDB,0xE4		/* fsetpm for 287, ignored by 387 */
	ret

/*
 *  setup_idt
 *
 *  sets up a idt with 256 entries pointing to
 *  ignore_int, interrupt gates. It then loads
 *  idt. Everything that wants to install itself
 *  in the idt-table may do so themselves. Interrupts
 *  are enabled elsewhere, when we can be relatively
 *  sure everything is ok. This routine will be over-
 *  written by the page tables.
 */
setup_idt:
	lea ignore_int,%edx
	movl $0x00080000,%eax
	movw %dx,%ax		/* selector = 0x0008 = cs */
	movw $0x8E00,%dx	/* interrupt gate - dpl=0, present */

	lea _idt,%edi
	mov $256,%ecx
rp_sidt:
	movl %eax,(%edi)
	movl %edx,4(%edi)
	addl $8,%edi
	dec %ecx
	jne rp_sidt
	lidt idt_descr
	ret

/*
 *  setup_gdt
 *
 *  This routines sets up a new gdt and loads it.
 *  Only two entries are currently built, the same
 *  ones that were built in init.s. The routine
 *  is VERY complicated at two whole lines, so this
 *  rather long comment is certainly needed :-).
 *  This routine will beoverwritten by the page tables.
 */
setup_gdt:
	lgdt gdt_descr
	ret

/*
 * I put the kernel page tables right after the page directory,
 * using 4 of them to span 16 Mb of physical memory. People with
 * more than 16MB will have to expand this.
 */
.org 0x1000
pg0:

.org 0x2000
pg1:

.org 0x3000
pg2:

.org 0x4000
pg3:

.org 0x5000
/*
 * tmp_floppy_area is used by the floppy-driver when DMA cannot
 * reach to a buffer-block. It needs to be aligned, so that it isn't
 * on a 64kB border.
 */
_tmp_floppy_area:
	.fill 1024,1,0

after_page_tables:
	pushl $0		# These are the parameters to main :-)
	pushl $0
	pushl $0
	pushl $L6		# return address for main, if it decides to.
	pushl $_main
	jmp setup_paging
L6:
	jmp L6			# main should never return here, but
				# just in case, we know what happens.

/* This is the default interrupt "handler" :-) */
int_msg:
	.asciz "Unknown interrupt\n\r"
.align 2
ignore_int:
	pushl %eax
	pushl %ecx
	pushl %edx
	push %ds
	push %es
	push %fs
	movl $0x10,%eax
	mov %ax,%ds
	mov %ax,%es
	mov %ax,%fs
	pushl $int_msg
	call _printk
	popl %eax
	pop %fs
	pop %es
	pop %ds
	popl %edx
	popl %ecx
	popl %eax
	iret


/*
 * Setup_paging
 *
 * This routine sets up paging by setting the page bit
 * in cr0. The page tables are set up, identity-mapping
 * the first 16MB. The pager assumes that no illegal
 * addresses are produced (ie >4Mb on a 4Mb machine).
 *
 * NOTE! Although all physical memory should be identity
 * mapped by this routine, only the kernel page functions
 * use the >1Mb addresses directly. All "normal" functions
 * use just the lower 1Mb, or the local data space, which
 * will be mapped to some other place - mm keeps track of
 * that.
 *
 * For those with more memory than 16 Mb - tough luck. I've
 * not got it, why should you :-) The source is here. Change
 * it. (Seriously - it shouldn't be too difficult. Mostly
 * change some constants etc. I left it at 16Mb, as my machine
 * even cannot be extended past that (ok, but it was cheap :-)
 * I've tried to show which constants to change by having
 * some kind of marker at them (search for "16Mb"), but I
 * won't guarantee that's all :-( )
 */
.align 2
setup_paging:
	movl $1024*5,%ecx		/* 5 pages - pg_dir+4 page tables */
	xorl %eax,%eax
	xorl %edi,%edi			/* pg_dir is at 0x000 */
	cld;rep;stosl
	movl $pg0+7,_pg_dir		/* set present bit/user r/w */
	movl $pg1+7,_pg_dir+4		/*  --------- " " --------- */
	movl $pg2+7,_pg_dir+8		/*  --------- " " --------- */
	movl $pg3+7,_pg_dir+12		/*  --------- " " --------- */
	movl $pg3+4092,%edi
	movl $0xfff007,%eax		/*  16Mb - 4096 + 7 (r/w user,p) */
	std
1:	stosl			/* fill pages backwards - more efficient :-) */
	subl $0x1000,%eax
	jge 1b
	xorl %eax,%eax		/* pg_dir is at 0x0000 */
	movl %eax,%cr3		/* cr3 - page directory start */
	movl %cr0,%eax
	orl $0x80000000,%eax
	movl %eax,%cr0		/* set paging (PG) bit */
	ret			/* this also flushes prefetch-queue */

.align 2
.word 0
idt_descr:
	.word 256*8-1		# idt contains 256 entries
	.long _idt
.align 2
.word 0
gdt_descr:
	.word 256*8-1		# so does gdt (not that that's any
	.long _gdt		# magic number, but it works for me :^)

	.align 3
_idt:	.fill 256,8,0		# idt is uninitialized

_gdt:	.quad 0x0000000000000000	/* NULL descriptor */
	.quad 0x00c09a0000000fff	/* 16Mb */
	.quad 0x00c0920000000fff	/* 16Mb */
	.quad 0x0000000000000000	/* TEMPORARY - don't use */
	.fill 252,8,0			/* space for LDT's and TSS's etc */

           

相關知識

裝置号

linux0.11程式中的硬碟裝置命名方式為主裝置号+次裝置号,常見的主裝置号有:1-記憶體、2-磁盤、3-硬碟、4-ttyx、5-tty、6-并行口、7-非命名管道1。由于1個硬碟中可以有1—4個分區,是以1,硬碟還依據分區的不同用次裝置号進行指定分區。是以硬碟的邏輯裝置号由以下方式構成:裝置号=主裝置号*256+次裝置号。

分段記憶體管理

32位保護模式下,linux-0.11使用分段管理記憶體。資料段和代碼段描述資訊被儲存在gdt/ldt中,段描述資訊稱為段描述符,包含段基址、段限長、讀寫保護等資訊。代碼運作時,gdt/ldt表位址會被儲存在GDTR/LDTR寄存器中。通路段時,可通過段寄存器中儲存的段選擇符(包含段的索引資訊)通路gdt/ldt中的段描述符,讀取段基址資訊。擷取段基址後,使用段基址加上偏移位址可計算出實際通路的記憶體位址。

程式代碼段和資料段的描述符格式

主要資訊包括段基址、段限長、描述符特權級(DPL)、讀寫通路保護等。

  • 描述符特權級(DPL):描述了通路段需要的請求特權等級(RPL),當RPL<=DPL時,代碼才可以通路段内容。
linux系統引導過程

段選擇符格式

包含描述符索引、表訓示器(TI位)和請求者特權級(RPL位)。

  • 描述符索引:GDT/LDT表的表項索引号。
  • 表訓示器(TI位):指定選擇符引用的描述符表,值為0表示指定GDT表,值為1表示指定LDT表。
  • 請求者特權級(RPL位):用于保護機制,Linux中有兩個特權級,較高特權級的核心(RPL=0)代碼和較低特權級的使用者代碼(RPL=3)。
linux系統引導過程

相關寄存器

CR0:CR0中的PE位,訓示是否啟用保護模式,啟用保護模式時,使用分段記憶體管理。

GDTR/LDTR:儲存GDT/LDT表位址,使用lgdt/lldt指令加載GDT/LDT表位址到GDTR/LDTR寄存器中。

分頁記憶體管理

保護模式下,啟動分頁記憶體管理後,經過分段機制計算得到的位址不再是實體位址,而是虛拟位址,拟位址由幾個部分構成:虛拟位址=頁目錄項索引+頁表項索引+頁内偏移構成。根據頁目錄項索引和頁表項索引查詢頁目錄和頁表後可獲得實體頁号,進一步可計算實體位址:實體位址=實體頁号+業内偏移。

頁目錄與頁表

頁目錄:存儲包括頁表位址在内的頁表描述資訊,每個頁表描述資訊為一個頁目錄項。

頁表:存儲包括實體頁位址在内的實體頁描述資訊,每個頁表描述資訊為一個頁表項。

線性位址到實體位址之間的變換

linux系統引導過程

CR0:CR0中的PG位訓示是否開啟分頁機制。

CR3:存儲頁目錄位址。

[參考資料]

繼續閱讀