Use JS markdown formatter
(cherry picked from commit 3941016)
This commit is contained in:
parent
dafa7e764d
commit
f068ef88a4
36
README.md
36
README.md
|
@ -24,7 +24,8 @@ ICU Normalization
|
|||
|
||||
Normalizes characters as explained [here](http://userguide.icu-project.org/transforms/normalization). It registers itself by default under `icu_normalizer` or `icuNormalizer` using the default settings. Allows for the name parameter to be provided which can include the following values: `nfc`, `nfkc`, and `nfkc_cf`. Here is a sample settings:
|
||||
|
||||
{
|
||||
```js
|
||||
{
|
||||
"index" : {
|
||||
"analysis" : {
|
||||
"analyzer" : {
|
||||
|
@ -35,14 +36,16 @@ Normalizes characters as explained [here](http://userguide.icu-project.org/trans
|
|||
}
|
||||
}
|
||||
}
|
||||
}
|
||||
}
|
||||
```
|
||||
|
||||
ICU Folding
|
||||
-----------
|
||||
|
||||
Folding of unicode characters based on `UTR#30`. It registers itself under `icu_folding` and `icuFolding` names. Sample setting:
|
||||
|
||||
{
|
||||
```js
|
||||
{
|
||||
"index" : {
|
||||
"analysis" : {
|
||||
"analyzer" : {
|
||||
|
@ -53,7 +56,8 @@ Folding of unicode characters based on `UTR#30`. It registers itself under `icu_
|
|||
}
|
||||
}
|
||||
}
|
||||
}
|
||||
}
|
||||
```
|
||||
|
||||
ICU Filtering
|
||||
-------------
|
||||
|
@ -64,7 +68,8 @@ language is wanted. See syntax for the UnicodeSet [here](http://icu-project.org/
|
|||
|
||||
The Following example exempts Swedish characters from the folding. Note that the filtered characters are NOT lowercased which is why we add that filter below.
|
||||
|
||||
{
|
||||
```js
|
||||
{
|
||||
"index" : {
|
||||
"analysis" : {
|
||||
"analyzer" : {
|
||||
|
@ -81,7 +86,8 @@ The Following example exempts Swedish characters from the folding. Note that the
|
|||
}
|
||||
}
|
||||
}
|
||||
}
|
||||
}
|
||||
```
|
||||
|
||||
ICU Collation
|
||||
-------------
|
||||
|
@ -94,7 +100,8 @@ Uses collation token filter. Allows to either specify the rules for collation
|
|||
|
||||
Here is a sample settings:
|
||||
|
||||
{
|
||||
```js
|
||||
{
|
||||
"index" : {
|
||||
"analysis" : {
|
||||
"analyzer" : {
|
||||
|
@ -105,11 +112,13 @@ Here is a sample settings:
|
|||
}
|
||||
}
|
||||
}
|
||||
}
|
||||
}
|
||||
```
|
||||
|
||||
And here is a sample of custom collation:
|
||||
|
||||
{
|
||||
```js
|
||||
{
|
||||
"index" : {
|
||||
"analysis" : {
|
||||
"analyzer" : {
|
||||
|
@ -126,7 +135,8 @@ And here is a sample of custom collation:
|
|||
}
|
||||
}
|
||||
}
|
||||
}
|
||||
}
|
||||
```
|
||||
|
||||
Optional options:
|
||||
* `strength` - The strength property determines the minimum level of difference considered significant during comparison.
|
||||
|
@ -159,7 +169,8 @@ ICU Tokenizer
|
|||
|
||||
Breaks text into words according to [UAX #29: Unicode Text Segmentation](http://www.unicode.org/reports/tr29/).
|
||||
|
||||
{
|
||||
```js
|
||||
{
|
||||
"index" : {
|
||||
"analysis" : {
|
||||
"analyzer" : {
|
||||
|
@ -169,7 +180,8 @@ Breaks text into words according to [UAX #29: Unicode Text Segmentation](http://
|
|||
}
|
||||
}
|
||||
}
|
||||
}
|
||||
}
|
||||
```
|
||||
|
||||
|
||||
License
|
||||
|
|
Loading…
Reference in New Issue