- mds failover code
- connection and recovd subsystem - refined handling of replies/timeout with levels: - requests are delayed until the request level is lower than or equals to the connection level - much updated network documentation - updated file system recovery documentation - server maintains lists of open files and handles "re-opening" maintains list in the metadata client info structures. - flags on requests to indicate their disposition after a reply, e.g. retain until commit, retain until explicitly canceled etc. - new failure instrumentation to drop a reply, but execute the request. - handling of re-sent creation requests - move file attribute updates on mds to close, remove from write - reconnection routine in llight. - work through recovery list more orderly: - retain list in sent order - handle according to disposition of request - return integers not void - add direct (0-copy) I/O support -- doesn't compile on 2.4.9 - failure handling in client reintegration code - replay handling in server reintegration code - add names to client systems to understand debugging/tracing output better - remove most lists from the client structure: the multiple lists introduced request reordering. We now use one list and flag the requests. - re-addressing of connections: invoked by the client recovery scripts - don't reallocate reply buffers if they were already there and not consumed in case of re-sending requests. - introduce a request replay function: I want this to be merged with ptlrpc_queue wait soon. - small support routines for continuing delayed requests, restarting requests for which replies were lost, etc. - try to get negative errors back even when Portals errors return positive problems. - make last committed and received 64 bit in network packets. - write test programs that: - keep files open - do I/O every second - include 5 basic regression cases for failover recovery: runfailure-client-mds.sh - simplify ha_assist.sh -- the secondary ha_assist program does the work
Showing
- lustre/doc/.cvsignore 1 addition, 0 deletionslustre/doc/.cvsignore
- lustre/include/linux/lustre_idl.h 14 additions, 13 deletionslustre/include/linux/lustre_idl.h
- lustre/include/linux/lustre_mds.h 10 additions, 1 deletionlustre/include/linux/lustre_mds.h
- lustre/include/linux/lustre_net.h 20 additions, 10 deletionslustre/include/linux/lustre_net.h
- lustre/include/linux/obd_support.h 2 additions, 1 deletionlustre/include/linux/obd_support.h
- lustre/lib/mds_updates.c 1 addition, 0 deletionslustre/lib/mds_updates.c
- lustre/llite/commit_callback.c 0 additions, 2 deletionslustre/llite/commit_callback.c
- lustre/llite/file.c 15 additions, 15 deletionslustre/llite/file.c
- lustre/llite/recover.c 104 additions, 7 deletionslustre/llite/recover.c
- lustre/llite/rw.c 56 additions, 0 deletionslustre/llite/rw.c
- lustre/llite/super.c 3 additions, 1 deletionlustre/llite/super.c
- lustre/mdc/mdc_reint.c 29 additions, 9 deletionslustre/mdc/mdc_reint.c
- lustre/mdc/mdc_request.c 9 additions, 4 deletionslustre/mdc/mdc_request.c
- lustre/mds/handler.c 53 additions, 4 deletionslustre/mds/handler.c
- lustre/mds/mds_reint.c 52 additions, 2 deletionslustre/mds/mds_reint.c
- lustre/osc/osc_request.c 2 additions, 0 deletionslustre/osc/osc_request.c
- lustre/ptlrpc/client.c 185 additions, 64 deletionslustre/ptlrpc/client.c
- lustre/ptlrpc/connmgr.c 1 addition, 0 deletionslustre/ptlrpc/connmgr.c
- lustre/ptlrpc/events.c 1 addition, 9 deletionslustre/ptlrpc/events.c
- lustre/ptlrpc/niobuf.c 12 additions, 21 deletionslustre/ptlrpc/niobuf.c
Loading
Please register or sign in to comment