StringPattern
With this gem, you can easily generate strings supplying a very simple pattern. Even generate random words in English or Spanish.
Also, you can validate if a text fulfills a specific pattern or even generate a string following a pattern and returning the wrong length, value... for testing your applications. Perfect to be used in test data factories.
Also you can use regular expressions (Regexp) to generate strings: /[a-z0-9]{2,5}\w+/.gen
To do even more take a look at nice_hash gem
Installation
Add this line to your application's Gemfile:
gem 'string_pattern'
And then execute:
$ bundle
Or install it yourself as:
$ gem install string_pattern
Usage
What is a string pattern?
A pattern is a string where we supply these elements "a-b:c" where a is min_length, b is max_length (optional) and c is a set of symbol_type
min_length: minimum length of the string
max_length (optional): maximum length of the string. If not provided, the result will be with the min_length provided
symbol_type: The type of the string we want.
x: from a to z (lowercase)
X: A to Z (capital letters)
L: A to Z and a to z
T: National characters defined on StringPattern.national_chars
n or N: for numbers. 0 to 9
$: special characters, $%&#... (includes blank space)
_: blank space
*: all characters
0: empty string will be accepted. It needs to be at the beginning of the symbol_type string
@: It will generate a valid email following the official algorithm. It cannot be used with other symbol_type
W: for English words, capital and lower. It cannot be used with other symbol_type
w: for English words only lower and words separated by underscore. It cannot be used with other symbol_type
P: for Spanish words, capital and lower. It cannot be used with other symbol_type
p: for Spanish words only lower and words separated by underscore. It cannot be used with other symbol_type
How to generate a string following a pattern
To generate a string following a pattern you can do it using directly the StringPattern class or the generate method in the class, be aware you can always use also the alias method: gen
require 'string_pattern'
p StringPattern.generate "10:N"
p StringPattern.gen "5:X"
p "4:Nx".gen
p :"10:T".generate
p [:"3:N", "fixed", :"3:N"].gen
p "(,3:N,) ,3:N,-,2:N,-,2:N".split(',').generate
p gen "3:N"
Generating unique strings
If you want to generate for example 1000 strings and be sure all those strings are different you can use:
StringPattern.dont_repeat = true
1000.times {
puts :"6-20:L/N/".gen
}
StringPattern.cache_values = Hash.new()
Using dont_repeat all the generated string during the current run will be unique.
In case you just want one particular string to be unique but not the rest then add to the pattern just in the end the symbol: &
The pattern needs to be a symbol object.
1000.times {
puts :"6-20:L/N/&".gen
puts :"10:N".gen
}
Generate words randomly in English or Spanish
To generate a string of the length you want that will include only real words, use the symbol types:
- W: generates English words following CamelCase ('ExampleOutput')
- w: generates English words following snake_case ('example_output')
- P: generates Spanish words following CamelCase ('EjemploSalida')
- p: generates Spanish words following snake_case ('ejemplo_salida')
require 'string_pattern'
puts '10-30:W'.gen
puts '10-30:w'.gen
puts '10-30:P'.gen
puts '10-30:p'.gen
If you want to use a different word separator than "_" when using 'w' or 'p':
require 'string_pattern'
StringPattern.word_separator = ' '
puts '10-30:w'.gen
puts '10-30:p'.gen
The word list is loaded on the first request to generate words, after that the speed to generate words increases amazingly. 85000 English words and 250000 Spanish words. The vocabularies are a sample of public open sources.
Generate strings using Regular Expressions (Regexp)
Take in consideration this feature is not supporting all possibilities for Regular expressions but it is fully functional. If you find any bug or limitation please add it to issues: https://github.com/MarioRuiz/string_pattern/issues
In case you want to change the default maximum for repetitions when using * or +: StringPattern.default_infinite = 30
. By default is 10.
If you want to translate a regular expression into an StringPattern use the method we added to Regexp class: to_sp
Examples:
/[a-z0-9]{2-5}\w+/.to_sp
#> ["2-5:nx", "1-10:Ln_"]
#regular expression for UUID v4
/[0-9A-F]{8}-[0-9A-F]{4}-4[0-9A-F]{3}-[89AB][0-9A-F]{3}-[0-9A-F]{12}/.to_sp
If you want to generate a random string following the regular expression, you can do it like a normal string pattern:
regexp = /[0-9A-F]{8}-[0-9A-F]{4}-4[0-9A-F]{3}-[89AB][0-9A-F]{3}-[0-9A-F]{12}/
puts StringPattern.generate(regexp)
puts generate(regexp)
puts regexp.generate
puts regexp.gen
String patterns
How to generate one or another string
In case you need to specify that the string is generated selecting one or another fixed string or pattern, you can do it by using Array of patterns and in the position you want you can add an array with the possible values
p ["uno:", :"5:N", ['.red','.green', :'3:L'] ].gen
Take in consideration that this is only available to generate successful strings but not for validation
Custom characters
Also, it's possible to provide the characters we want. To do that we'll use the symbol_type [characters]
If we want to add the character ] we have to write ]]
Examples
p "4:[asDF9]".gen
p "2-20:[$#6]Xx".gen
p "4:[asDF]]9]".gen
Required characters or symbol types
We'll use the symbol / to specify which characters or symbols we want to be included on the resulting string as required values /symbols or characters/
If we need to add the character / we'll use //
Examples:
"4:XN/x/".gen
"6-15:[/23abCD/$Æ]NX".gen
"4-9:[/$5/]XN/x/".generate
Excluded characters
If we want to exclude a few characters in the result, we'll use the symbol %characters%
If you need to exclude the character %, you should use %%
Examples:
"2-20:[%83%ABC]N".gen
"10:L/n/[%0WXYzZ%]".gen
Not fulfilling a pattern
If we want our resulting string doesn't fulfill the pattern we supply, then we'll use the symbol ! at the beginning
Examples:
"!4:XN/x/".gen
"!10:N".gen
Generate a string with specific expected errors
Usually, for testing purposes you need to generate strings that don't fulfill a specific pattern, then you can supply as a parameter expected_errors (alias: errors)
The possible values you can specify is one or more of these ones: :length, :min_length, :max_length, :value, :required_data, :excluded_data, :string_set_not_allowed
:length: wrong length, minimum or maximum
:min_length: wrong minimum length
:max_length: wrong maximum length
:value: wrong resultant value
:required_data: the output string won't include all necessary required data. It works only if required data supplied on the pattern.
:excluded_data: the resultant string will include one or more characters that should be excluded. It works only if excluded data supplied on the pattern.
:string_set_not_allowed: it will include one or more characters that are not supposed to be on the string.
Examples:
"10-20:N".gen errors: [:min_length]
"20:N".gen errors: [:length, :value]
"10:L/n/".gen errors: [:value]
Validate if a string is following a pattern
If you need to validate if a specific text is fulfilling the pattern you can use the validate method.
If a string pattern supplied and no other parameters supplied the output will be an array with the errors detected.
Possible output values, empty array (validation without errors detected) or one or more of: :min_length, :max_length, :length, :value, :string_set_not_allowed, :required_data, :excluded_data
In case an array of patterns supplied it will return only true or false
Examples:
StringPattern.validate((text: "This text will be validated", pattern: :"10-20:Xn")
"10:N".validate "333444"
:"10:N".validate("333444")
["5:L","3:xn","4-10:n"].validate "DjkljFFc343444390"
If we want to validate a string with a pattern and we are expecting to get specific errors, you can supply the parameter expected_errors (alias: errors) or not_expected_errors (aliases: non_expected_errors, not_errors).
In this case, the validate method will return true or false.
Examples:
"10:N".val "3445", errors: [:min_length]
"10:N/[09]/".validate "4434039440", errors: [:value]
"10-12:XN/x/".validate "FDDDDDAA343434", errors: [:max_length, :required_data]
Configure
SP_ADD_TO_RUBY
This gem adds the methods generate (alias: gen) and validate (alias: val) to the Ruby classes: String, Array, and Symbol.
Also adds the method generate (alias: gen) to Kernel. By default (true) it is always added.
In case you don't want to be added, just before requiring the library set:
SP_ADD_TO_RUBY = false
require 'string_pattern'
In case it is set to true (default) then you will be able to use:
require 'string_pattern'
"20-30:@".gen
"10:L/N/[/-./%d%]".validate("12ds6f--.s")
"20-40:@".validate(my_email)
gen "10:N"
"(,3:N,) ,3:N,-,2:N,-,2:N".split(",").generate
%w{( 3:N ) 1:_ 3:N - 2:N - 2:N}.gen
["1:L", "5-10:LN", "-", "3:N"].gen
national_chars
To specify which national characters will be used when using the symbol type: T, you use StringPattern.national_chars, by default is the English alphabet
StringPattern.national_chars = (('a'..'z').to_a + ('A'..'Z').to_a).join + "áéíóúÁÉÍÓÚüÜñÑ"
"10-20:Tn".gen
optimistic
If true it will check on the strings of the array positions supplied if they have the pattern format and assume in that case that is a pattern. If not it will assume the patterns on the array will be supplied as symbols. By default is set to true.
StringPattern.optimistic = false
["5:X","fixedtext", "3:N"].generate
[:"5:X","fixedtext", :"3:N"].generate
StringPattern.optimistic = true
["5:X","fixedtext", "3:N"].generate
[:"5:X","fixedtext", :"3:N"].generate
block_list
To specify which words will be avoided from the results
StringPattern.block_list = ['example', 'wrong', 'ugly']
StringPattern.block_list_enabled = true
"2-20:Tn".gen
Contributing
Bug reports and pull requests are welcome on GitHub at https://github.com/marioruiz/string_pattern.
License
The gem is available as open source under the terms of the MIT License.