Package wsatools :: Module Utilities

Module Utilities

General utility functions. Mostly concerning manipulation of Python objects, and the file system.

Author: I.A. Bond

Organization: WFAU, IfA, University of Edinburgh

Contributors: R.S. Collins, N.J.G. Cross, N.C. Hambly, E. Sutorius

Classes

ParsedFile
Behaves like a file object, except that when iterating over file lines only non-blank, non-comment lines are returned and any EOL characters are removed together with trailing white-space.

WordWrapper
Formats long strings so that they neatly fit within a certain width without words being split across lines.

Ratings
Ratings is mostly like a dictionary, with extra features: the value corresponding to each key is the 'score' for that key, and all keys are ranked in terms their scores.

Functions

[hide private]

int

_getUserWidth()
Private helper function used by the WordWrapper class.

source code

list

arbSort(unsortedList, kwdList, key=<function <lambda> at 0x26be410>, isFullKeySet=True)
Arbitrarily sorts a list of tuples of form (keyword, value) by the order defined in the sequence of specified keywords.

source code

ensureDirExist(aDir)
If the supplied directory does not exist then create it.

source code

str

expandNumberRange(numberRange, useTens=False)
Given a human readable compact number range string, expand it to a complete sequence of numbers in a CSV string.

source code

generator(str)

extractColumn(filePathName, colNum)
Gobble all entries in the given column of a space separated text file into a list.

source code

list(list(dataType))

extractColumns(filePathName, columnList=None, numRows=None, dataType=<type 'str'>)
Extracts from the given file the data in the given list of columns as a list of strings for every column.

source code

dict(str:list(int, int))

getDiskSpace(disks)
Gets the available disk space for supplied list of disks.

source code

set

getDuplicates(anIterable)
Returns the set of items for which the groupBy method returns the same item more than once.

source code

list(int)

getListIndices(aList, value)
Returns a list of all indices where a value occurs in the given list.

source code

str

getNextDisk(sysc, spacePerDisk=None, byPercentFree=False, preAllocMem=0)
Gets the next available disk which is less than 99% full.

source code

int

getNumberItems(numberRange, useTens=False)
Calculates the number of items expressed by a human-readable string of number ranges.

source code

int

getSystemMemoryKb()
Returns: Amount of available memory in kilobytes.

source code

list(tuple(int, int))

groupByCounts(keyCounts, groupSize)
Taking an ordered list of counts of a particular key, e.g.

source code

str

joinDict(aDict, joinStr=' = ', sepStr=', ')
Like string.join, but operates on the contents of a dictionary instead of a list. source code

str

joinNested(aSeq, joinStr=', ', subJoinStr=None, seqIndex=None)
Like string.join, but can handle nested (or un-nested) sequences of string- castable objects. source code

dict

invertDict(aDict, forceReturnList=False)
Inverts the dictionary in such a way that if the input dictionary's values are lists, each item of this list will become a key with the input dictionary's key as value.

source code

mx.DateTime

makeDateTime(time=None)
Returns an archive date/time data type, defaulting to the current time if no input argument is given.

source code

str

makeMssqlTimeStamp()
Creates a timestamp using makeTimeStamp and formats appropriately for use in ingest strings for Microsoft SQL Server (and handles a bug in the datetime object creation).

source code

str

makeTimeStamp()
Returns: An archive time stamp as a string, as opposed to the internal date time type.

source code

bool

moreThanOneIn(values)
Generator test function is equivalent to memory hog len(list(values)) > 1 or sum(1 for _ in values) > 1 but doesn't iterate through all items. source code

str

multiSub(text, subs)
Performs multiple string substitutions on the given string.

source code

noInterrupt(*args, **kwds)
Disables keyboard interrupts whilst in this context.

source code

list(list)

npop(aList, nMax=2, mode='topbot')
Divide aList in nMax sublists by populating it with items subsequently taken from the top and the bottom of list. source code

str

numberRange(numbers, sep=', ', useTens=False)
Given a sequence of integers it returns that sequence as a string representation of an ordered range of unique numbers. source code

list(X)

orderedSet(seq, excludeList=None)
Returns a list of the given sequence in the original order, but with duplicates removed.

source code

float

parseFloat(value)
Parses a string value and converts to float.

source code

generator(list(X))

splitList(longList, chunkSize=2, noSingles=False)
Splits a list into a list of equal sized chunks.

source code

generator(X)

unpackList(combinedList)
Given a list of lists, return a single sequence containing all of the elements of the combined list, as a generator.

source code

naturalSorted(strings)
Sort strings naturally.

source code

naturalSortKey(key)

source code

Variables

[hide private]

__package__ = 'wsatools'

Function Details

Module Utilities

_getUserWidth()

arbSort(unsortedList, kwdList, key=<function <lambda> at 0x26be410>, isFullKeySet=True)

ensureDirExist(aDir)

expandNumberRange(numberRange, useTens=False)

extractColumn(filePathName, colNum)

extractColumns(filePathName, columnList=None, numRows=None, dataType=<type 'str'>)

getDiskSpace(disks)

getDuplicates(anIterable)

getListIndices(aList, value)

getNextDisk(sysc, spacePerDisk=None, byPercentFree=False, preAllocMem=0)

getNumberItems(numberRange, useTens=False)

getSystemMemoryKb()

groupByCounts(keyCounts, groupSize)

joinDict(aDict, joinStr=' = ', sepStr=', ')

joinNested(aSeq, joinStr=', ', subJoinStr=None, seqIndex=None)

invertDict(aDict, forceReturnList=False)

makeDateTime(time=None)

makeMssqlTimeStamp()

makeTimeStamp()

moreThanOneIn(values)

multiSub(text, subs)

noInterrupt(*args, **kwds)

npop(aList, nMax=2, mode='topbot')

numberRange(numbers, sep=', ', useTens=False)

orderedSet(seq, excludeList=None)

parseFloat(value)

splitList(longList, chunkSize=2, noSingles=False)

unpackList(combinedList)

joinDict(aDict, joinStr=`'` `=` `'`, sepStr=`',` `'`)

joinNested(aSeq, joinStr=`',` `'`, subJoinStr=None, seqIndex=None)

npop(aList, nMax=2, mode=`'topbot'`)

numberRange(numbers, sep=`',` `'`, useTens=False)