Data Structures in C#
This is not an exhaustive list, just listing the most commonly used ones
Why it matters ?
Data structures gives ways to collect and organise data so that various operations can be performed on those data
The data structure has an impact on the performance of an algorithm
What is an algorithm ?
It is a set of predefined instructions to be performed in a given order to accomplish a task
The performance of an algorithm can be measured by :
Time complexity
Space complexity
The Big O notation
Asymptotic Notation (!!simplified here!!)
Algorithm complexity is rough estimation of the number of steps performed by given computation depending on the size of the input data measured through asymptotic notation
Examples:
Linear complexity O(n) – all elements are processed once (or constant number of times)
Quadratic complexity O(n2) – each of the elements is processed n times
Complexity | Notation | Description |
---|---|---|
constant | O(1) | Constant number of operations, whatever the size of the collection |
logarithmic | O(log n) | Logarithmic increase of operations |
linear | O(n) | Number of operations proportional to the input data size |
quadratic cubic |
O(n²) O(n³) |
Number of operations proportional to the square (cube )of the size of the input data |
exponential |
O(2n), O(kn), O(n!) |
Exponential number of operations, fast growing |
Array : T[]
https://msdn.microsoft.com/en-us/library/9b9dty7d.aspx
Stores multiple variables of the same type
It can be single or multi dimensional
The size of the array is fixed at instantiation time
It is zero based indexed and allow random access (by index)
Use when fixed number of elements should be processed by index
ArrayList
https://msdn.microsoft.com/en-us/library/system.collections.arraylist%28v=vs.110%29.aspx
Wraps an Array and can store object of various types
It accepts null as a valid value and allows duplicate elements
List<T>
https://msdn.microsoft.com/en-us/library/6sh2ey19%28v=vs.110%29.aspx
It is a collection of object of homogeneous type
List<T> class performs better in most cases and is type safe compared to ArrayList
Use when elements should be added and processed by index
LinkedList<T>
https://msdn.microsoft.com/en-us/library/he2s3bh7%28v=vs.110%29.aspx
Doubly linked node-pointer based collection of data
Relies on the LinkedListNode<T> class
Each node point to the previous and next node
Allows duplicate and null values
No random access possible, accessing a member requires traversing the list
Use when elements should be added at both sides of the list
Dictionary<TKey, TValue>
https://msdn.microsoft.com/en-us/library/xfhwa508(v=vs.110).aspx
Represents a collection of keys and values
Implemented as a hash table, retrieving a value by key fast, close to O(1)
Keys must be not null & unique, values can be null
Iteration returns a KeyValuePair<TKey, TValue> object
Use when key-value pairs should be added fast and searched fast by key
Hashtable
https://msdn.microsoft.com/en-us/library/system.collections.hashtable%28v=vs.110%29.aspx
Represents a collection of key/value pairs that are organized based on the hash code of the key
A key cannot be null, but a value can be
The objects used as keys by a Hashtable are required to override the Object.GetHashCode and the Object.Equals methods = You need to provide hash implementation
Iteration returns a DictionaryEntry object
HashSet<T>
https://msdn.microsoft.com/en-us/library/bb359438%28v=vs.110%29.aspx
It contains no duplicate elements
Elements are stored in no particular order
It is like a Dictionary<TKey, TValue> , but in a HashSet the value is the Key
Use to group unique values, to perform fast add and check operations
Stack<T>
https://msdn.microsoft.com/en-us/library/3278tedw%28v=vs.110%29.aspx
Last In First Out (LIFO) collection
Accepts null value, allows duplicate elements
Used to access the information in reverse order
Main operations : Push, Pop, Peek
Use to implement LIFO behavior
Queue<T>
https://msdn.microsoft.com/en-us/library/7977ey2c%28v=vs.110%29.aspx
First In First Out (FIFO) collection
Accepts null value and allows duplicate object
Objects are inserted at one end and removed from the other
Main operations : Enqueue, Dequeue, Peek
Use to implement FIFO behavior
Intro to Linq
(Language-Integrated Query)
Linq
Standard patterns for querying and updating data
Linq to Objects : to query collection of objects
Linq to XML : to query xml documents
Linq to ADO.Net : to query data sets, entities or sql databases (Linq to SQL)
Linq is a set of extension methods that helps developers query and manipulate collections
Linq to Objects queries
var query = from Student student in arrList
where student.Scores[0] > 95
orderby student.Scores[0]
select student;
var query = arrList.Where(x => x.Scores[0] > 95).OrderBy(x => x.Scores[0]);
Lambda expression query (method syntax)
SQL like query (query syntax)
https://msdn.microsoft.com/en-us/library/bb397937.aspx
Sources and links
- About the big O notation : blog post
- http://bigocheatsheet.com/#data-structures
- https://msdn.microsoft.com/en-us/default.aspx
- https://msdn.microsoft.com/en-us/library/ms379570%28v=vs.80%29.aspx
- https://msdn.microsoft.com/en-us/library/ms379571%28v=vs.80%29.aspx
- http://www.studytonight.com/data-structures/time-complexity-of-algorithms
-
http://stackoverflow.com/questions/169973/when-should-i-use-a-list-vs-a-linkedlist
Data Structures in C#
By hkoundi
Data Structures in C#
- 829