[HLP] Weird table extraction #1128
Kochise
started this conversation in
Ask for help with specific PDFs
Replies: 2 comments 1 reply
-
Maybe have a look at these projects to be able to detect and create sub groups : https://github.com/MulongXie/UIED Regards. |
Beta Was this translation helpful? Give feedback.
1 reply
-
Thanks for showing this example. Given the particular structure of that product sheet, some custom logic — taking advantage of the particular structural elements on the page — might work better than |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hi, trying to extract data from :
https://www.se.com/uk/en/product/RM17JC00MW/modular-current-control-relay-8-a-1-co-overcurrent-function-250-v-ac-dc/
-> https://www.se.com/uk/en/product/download-pdf/RM17JC00MW?filename=Schneider+Electric_Harmony-Control-Relays_RM17JC00MW.pdf
Settings
First page
Console
lst_text(1)=['Product datasheet', 'Specifications', 'Modular current control relay, 8 A,', '1 CO, overcurrent function, 250 V', 'AC/DC', 'Local distributor code:', '389834764 RM17JC00MW', 'EAN Code: 3389119405034', 'Main', 'Range Of Product Harmony Control Relays', 'Relay Type Current control relay', 'Product Or Component Type Current control relay', 'Relay Name RM17JC', 'Relay Monitored Parameters Overcurrent detection', 'Time Delay Without', 'Switching Capacity In Va 1250 VA', 'Minimum Switching Current 10 mA at 5 V DC', 'Maximum Power Consumption In 3 VA', 'Va', 'Measurement Range 2...20 A AC', 'Utilisation Category AC-12 conforming to IEC 60947-5-1', 'AC-13 conforming to IEC 60947-5-1', 'AC-14 conforming to IEC 60947-5-1', 'AC-15 conforming to IEC 60947-5-1', 'DC-12 conforming to IEC 60947-5-1', 'DC-13 conforming to IEC 60947-5-1', 'DC-14 conforming to IEC 60947-5-1', 'Contacts Type And Composition 1 C/O', 'Complementary', 'Maximum Switching Voltage 250 V AC/DC', '[Us] Rated Supply Voltage AC/DC', 'Supply Voltage Limits 20.4…264 V AC/DC', 'Operating Voltage Tolerance - 15 % + 10 % Un', 'Maximum Power Consumption In 1 W', 'W', 'Control Circuit Frequency 40...70 Hz sinusoidal', 'Output Contacts 1 C/O', 'Nominal Output Current 5 A', 'Maximum Measuring Cycle 30 ms measurement cycle as true rms value', 'Hysteresis 15 % fixed of threshold setting', 'Delay At Power Up 0.5 s', 'Measurement Accuracy +/- 10 % of the full scale value', 'Repeat Accuracy +/- 0.5 % for input and measurement circuit', '16 Apr 2024 1', 'snoitacilppa', 'resu', 'cificeps', 'rof', 'stcudorp', 'eseht', 'fo', 'ytilibailer', 'ro', 'ytilibatius', 'gninimreted', 'rof', 'desu', 'eb', 'ot', 'ton', 'si', 'dna', 'rof', 'etutitsbus', 'a', 'sa', 'dednetni', 'ton', 'si', 'noitatnemucod', 'sihT', ':remialcsiD']
lst_table(1)=[[None, None, 'Local distributor code:\n389834764 RM17JC00MW'], [None, None, 'EAN Code: 3389119405034'], ['Range Of Product', 'Harmony Control', 'Relays'], ['Relay Type', 'Current control rel', 'ay'], ['Product Or Component Type', 'Current control rel', 'ay'], ['Relay Name', 'RM17JC', ''], ['Relay Monitored Parameters', 'Overcurrent detect', 'ion'], ['Time Delay', 'Without', ''], ['Switching Capacity In Va', '1250 VA', ''], ['Minimum Switching Current', '10 mA at 5 V DC', ''], ['Maximum Power Consumption In\nVa', '3 VA', ''], ['Measurement Range', '2...20 A AC', ''], ['Utilisation Category', 'AC-12 conforming\nAC-13 conforming\nAC-14 conforming\nAC-15 conforming\nDC-12 conforming\nDC-13 conforming\nDC-14 conforming', 'to IEC 60947-5-1\nto IEC 60947-5-1\nto IEC 60947-5-1\nto IEC 60947-5-1\nto IEC 60947-5-1\nto IEC 60947-5-1\nto IEC 60947-5-1'], ['Contacts Type And Composition', '1 C/O', ''], ['Complementary', '', ''], ['Maximum Switching Voltage', '250 V AC/DC', ''], ['[Us] Rated Supply Voltage', 'AC/DC', ''], ['Supply Voltage Limits', '20.4…264 V AC/D', 'C'], ['Operating Voltage Tolerance', '- 15 % + 10 % Un', ''], ['Maximum Power Consumption In\nW', '1 W', ''], ['Control Circuit Frequency', '40...70 Hz sinusoi', 'dal'], ['Output Contacts', '1 C/O', ''], ['Nominal Output Current', '5 A', ''], ['Maximum Measuring Cycle', '30 ms measureme', 'nt cycle as true rms value'], ['Hysteresis', '15 % fixed of thres', 'hold setting'], ['Delay At Power Up', '0.5 s', ''], ['Measurement Accuracy', '+/- 10 % of the full', 'scale value'], ['Repeat Accuracy', '+/- 0.5 % for input', 'and measurement circuit']]
Expected
If possible, get 2 tables with 2 columns :
Main and Complementary
Perhaps also the header with :
['Modular current control relay, 8 A, 1 CO, overcurrent function, 250 V AC/DC', 'Local distributor code:', '389834764 RM17JC00MW', 'EAN Code: 3389119405034'],
Regards.
Beta Was this translation helpful? Give feedback.
All reactions