If you're experiencing line breaks or formatting problems in the "Experience" section of Job Descriptions parsed from PDF files, this is a known challenge due to the complex nature of PDF layouts.
Why Line Breaks Happen in PDFs
PDFs are visually formatted documents, not inherently structured like Word or HTML files. Therefore:
-
Multicolumn layouts, tables, or unusual spacing can disrupt line continuity.
-
Bullet points or numbered lists often cause extra line breaks or merged content.
-
Headers and footers might interrupt the flow if not detected correctly.
RChilli Solutions and Workarounds
-
Enable
parsethroughpdflayout
Setting (AI Layout Model)
This API setting improves parsing for column-based and visually segmented PDFs.Include in your API request JSON:
Note:
-
Works best with single-page PDFs.
-
This is a beta feature; test across various layouts is recommended.
-
-
Use
formattedpdfhtml
to Preserve Original Layout
Returns the original resume/JD layout as HTML for visual reference and QA. -
Bullet Point Preservation with
bulletpoint
Setting
Maintains bullet points in parsed fields instead of merging them into plain text. -
Additional Formatting Settings to Consider
-
htmlresume
: Returns an HTML version of parsed content. -
templateoutput
: Generates a formatted resume template based on parsed fields.
-
Recommendations
-
Experiment with combinations of these settings to optimize parsing for your JD formats.
-
If issues persist, consider preprocessing complex PDFs into simpler formats like
.docx
before parsing.
Need Further Assistance?
If formatting inconsistencies continue, share sample files and your API request payload with the RChilli Support Team at support@rchilli.com for personalized guidance.
For detailed configuration options, refer to the official JD Parser API Settings documentation:
RChilli JD Parser API Dynamic Settings
Comments
0 comments
Article is closed for comments.