-
-
Notifications
You must be signed in to change notification settings - Fork 417
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Lark can't match µ character even though it is defined in the input #1478
Comments
For reference here is the grammar: // define rules for strength and strength ranges when they appear alone and with additional text // define structure for rules with no additional text // define non-terminals units as all possible unit rules then for each unit rule specify units // define all weight units // define all volume units // define all length units //define all mole units // define all iu units // define all percent units // define all time units // define all percent weight volume units // define all dosis units // list all colony forming units // list all kallikrein inactivator units // list all plaque forming units // list all becquerel unit // list all cell unit // list all area unit // list all currie unit // list all tissue culture infectious dose unit // list all hahnemannian units // list all ppm units // list all vector_genome units // list all anti xa unit //list all cell_culture_infectious_dose unit // list all equivalents unit // list all d antegen units // list all elisa unit // list all all alergan unit // list all effective response // list all relative potency unit // list all tcp unit // list all haemagglutination_inhibition_unit // list all oocysts unit // list all high activation unit // list all antigenic unut // list all tuberculin_units // list all antibody_micro_agglutination_lytic_reaction unit // list all speywood unit // list all galactosidase_units // list all pulsation units // list all katal unit // list all limit_of_flocculation_unit // list all germs unit // list all kallidinogenase_inactivator_unit // list all usp unit homeopathic_potency_unit: HOMEOPATHIC_POTENCY_UNIT | HOMEOPATHIC_POTENCY_X_UNIT | HOMEOPATHIC_POTENCY_C_UNIT // list all bioequivalent_allergy_units // list all fluorescent_focus_units // list all protein_nitrogen_units // list all helper terminals used to define strength rules // define general terminals any tips on improving the grammar are also aprreciated :) |
I am trying to write a lark parser which can extract parts of text of form number unit e.g 10 grams or 10gm etc. I am trying to parse the following input:
"10 μl"
I get the error
`10 μl
^
Expected one of:
* DECIMAL_HAHNEMANNIAN
* WEIGHT_GM
* DOSIS_TABLET
* WEIGHT_MG
* LENGTH_CM
* CELL_CULTURE_INFECTIOUS_DOSE
* MILLION_COLONY_FORMING_UNIT
* WEIGHT_NG
* SEPARATOR
* MICROKATAL
* SPEYWOOD_UNIT
* DOSIS_BLISTER
* DOSIS_GENERATOR
* VOLUME_L
* EFFECTIVE_RESPONSE_50
* EFFECTIVE_RESPONSE_60
* WEIGHT_MCG_BASE
* TISSUE_CULTURE_INFECTIOUS_DOSE
* SPACES
* VOLUME_ML
* HIGH_ACTIVATION_UNIT
* ANTIGENIC_UNIT
* LOG_ELISA_UNIT
* EFFECTIVE_RESPONSE_25
* MILI_EQUIVALENTS_UNIT
* BECQUEREL
* BIOEQUIVALENT_ALLERGY_UNIT
* DOSIS_SRT
* HOMEOPATHIC_POTENCY_C_UNIT
* IU_MIU
* DOSIS_GUM
* MOLE_MCMOL
* TIME_MIN
* GIGA_BECQUEREL
* HAEMAGGLUTINATION_INHIBITION_UNIT
* PERCENT
* DOSIS_BAG
* DOSIS_PACK
* VECTOR_GENOME_UNIT
* ANTI_XA_UNIT
* ALLERGAN_UNIT
* PROTEIN_NITROGEN_UNIT
* PERCENT_WEIGHT_PER_WEIGHT
* IU_IU
* TIME_DAY
* MOLE_MOL
* MILLION_CELL
* HOMEOPATHIC_POTENCY_X_UNIT
* DOSIS_CAPSULE
* MICROCURIE
* DOSIS_DOS
* D_ANTEGEN_UNIT
* GALACTOSIDASE_UNIT
* DOSIS_CONTAINER
* WEIGHT_PG
* KATAL
* WEIGHT_LBS
* VOLUME_DROP
* GERMS_UNIT
* CELL
* ANTIBODY_MICRO_AGGLUTINATION_LYTIC_REACTION_UNIT
* LOG_TISSUE_CULTURE_INFECTIOUS_DOSE
* TIME_HOUR
* HOMEOPATHIC_POTENCY_M_UNIT
* WEIGHT_MG_BASE
* LENGTH_MM
* OOCYST_UNIT
* RELATIVE_POTENCY_UNIT
* VOLUME_MCL
* KALLIDINOGENASE_INACTIVATOR_UNIT
* IU_KIU
* PERCENT_VOLUME_PER_VOLUME
* KALLIKREIN_INACTIVATOR_UNIT
* PERCENT_WEIGHT_PER_VOLUME
* LIMIT_OF_FLOCCULATION_UNIT
* EFFECTIVE_RESPONSE_120
* WEIGHT_TON
* WEIGHT_MCG
* LOG_CELL_CULTURE_INFECTIOUS_DOSE
* PARTS_PER_MILLION_UNIT
* TCP_UNIT
* HOMEOPATHIC_POTENCY_Q_UNIT
* DOSIS_KIT
* KILO_BECQUEREL
* WEIGHT_KGS
* TUBERCULIN_UNIT
* MILICURIE
* LOG_HAEMAGGLUTINATION_INHIBITION_UNIT
* DOSIS_LOZENGE
* RANGE_SEPARATOR
* MEGA_BECQUEREL
* USP_UNIT
* FLUORESCENT_FOCUS_UNIT
* DOSIS_SACHET
* DOSIS_BOTTLE
* EQUIVALENTS_UNIT
* ELISA_UNIT
* PERCENT_VOLUME_PER_WEIGHT
* DOSIS_VIAL
* SPORULATED_OOCYST_UNIT
* COLONY_FORMING_UNIT
* PULSATION_UNIT
* MOLE_MMOL
* CURIE
* AREA_CM_SQ
* DOSIS_STRIP
* BILLION_COLONY_FORMING_UNIT
* HOMEOPATHIC_POTENCY_UNIT
* PLAQUE_FORMING_UNIT
* WEIGHT_GM_BASE
* LOG_COLONY_FORMING_UNIT
* MOLE_NMOL
* DOSIS_SYRINGE
* DOSIS_CARTRIDGE
* EFFECTIVE_RESPONSE_70
* DOSIS_ACT
* DOSIS_CYLINDER
* LOG_PLAQUE_FORMING_UNIT
* VOLUME_GAL
* DOSIS_PIECE
None
My token VOLUME_MCL is defined as:
VOLUME_MCL: "mikroliter" | "microl" | "mcl" | "µl" | "ul"
`
The text was updated successfully, but these errors were encountered: